7,181 Matching Annotations
  1. Jan 2023
    1. Author Response

      Reviewer #1 (Public Review):

      This well-done platform trial identifies that ivermectin has no impact on SARS-CoV-2 viral clearance rate relative to no study drug while casirivimab lead to more rapid clearance at 5 days. The figures are simple and appealing. The study design is appropriate and the analysis is sound. The conclusions are generally well supported by the analysis. Study novelty is somewhat limited by the fact that ivermectin has already been definitively assessed and is known to lack efficacy against SARS-CoV-2. Several issues warrant addressing:

      1) Use of viral load clearance is not unique to this study and was part of multiple key trials studying paxlovid, remdesivir, molnupiravir, and monoclonal antibodies. The authors neglect to describe a substantial literature on viral load surrogate endpoints of therapeutic efficacy which exist for HIV, hepatitis B and C, Ebola, HSV-2, and CMV. For SARS-CoV-2, the story is more complicated as several drugs with proven efficacy were associated with a decrease in nasal viral loads whereas a trial of early remdesivir showed no reduction in viral load despite a 90% reduction in hospitalization. In addition, viral load kinetics have not been formally identified as a true surrogate endpoint. For maximal value, a reduction in viral load would be linked with a reduction in a hard clinical endpoint in the study (reduction in hospitalization and/or death, decreased symptom duration, etc...). This literature should be discussed and data on the secondary outcome, and reduction in hospitalization should be included to see if there is any relationship between viral load reduction and clinical outcomes.

      This is an important point and we thank the reviewer for raising it. We agree that there is a rich literature on the use of viral load kinetics in optimizing treatment of viral infectious diseases, and we are clearly not the first to think of it! We have added the following sentence in the discussion.

      “The method of assessing antiviral activity in early COVID-19 reported here builds on extensive experience of antiviral pharmacodynamic assessments in other viral infections.”

      We agree that more information is needed to link viral clearance measures to clinical outcomes. We have addressed this in the discussion as follows:

      “Using less frequent nasopharyngeal sampling in larger numbers of patients, clinical trials of monoclonal antibodies, molnupiravir and ritonavir-boosted nirmatrelvir, have each shown that accelerated viral clearance is associated with improved clinical outcomes [1,4,5]. These data suggest reduction in viral load could be used as a surrogate of clinical outcome in COVID-19. In contrast the PINETREE study, which showed that remdesivir significantly reduced disease progression in COVID-19, did not find an association between viral clearance and therapeutic benefit. This seemed to refute the usefulness of viral clearance rates as a surrogate for rates of clinical recovery [16]. However, the infrequent sampling in all these studies substantially reduced the precision of the viral clearance estimates (and thus increased the risk of type 2 errors). Using the frequent sampling employed in the PLATCOV study, we have shown recently that remdesivir does accelerate SARS-CoV-2 viral clearance [17], as would be expected from an efficacious antiviral drug. This is consistent with therapeutic responses in other viral infections [18, 19]. Taken together the weight of evidence suggests that accelerated viral clearance does reflect therapeutic efficacy in early COVID-19, although more information will be required to characterize this relationship adequately.”

      2) The statement that oropharyngeal swabs are much better tolerated than nasal swabs is subjective. More detail needs to be paid to the relative yield of these approaches.

      The statement is empirical. We know of other studies in progress where there are high rates of discontinuation because of patient intolerance of repeated nasopharyngeal sampling. Not one of 750 patients enrolled to date in PLATCOV has refused sampling, which we believe is useful information for research involving multiple sampling. This is clearly a critical point for pharmacodynamic studies.

      We agree that the optimal site of swabbing for SARS-CoV-2 and relative yields for the given test requirements (sensitivity vs quantification) need to be considered, although the literature on this is large and sometimes contradictory.

      We have added the following line:

      Oropharyngeal viral loads have been shown to be both more and less sensitive for the detection of SARS-CoV-2 infection. Although rates of clearance are very likely to be similar from the two body sites, this should be established for comparison with other studies.

      3) The stopping rules as they relate to previously modeled serial viral loads are not described in sufficient detail.

      The initial stopping rules were chosen based on previously modelled data (reference 11). We have added details to the text (lines 199-219):

      “Under the linear model, for each intervention, the treatment effect β is encoded as a multiplicative term on the time since randomisation: eβT, where T=1 if the patient was assigned the intervention, and zero otherwise. Under this specification β=0 implies no effect (no change in slope), and β>0 implies increase in slope relative to the population mean slope. Stopping rules are then defined with respect to the posterior distribution of β, with futility defined as Prob[β<λ]>0.9; and success defined as Prob[β>λ]>0.9, where λ≥0. Larger values of λ imply a smaller sample size to stop for futility but a larger sample size to stop for efficacy. λ was chosen so that it would result in reasonable sample size requirements, as was determined using a simulation approach based on previously modelled serial viral load data [11]. This modelling work suggested that a value of λ=log(1.05) [i.e. 5% increase] would requireapproximately 50 patients to demonstrate increases in the rate of viral clearance of ~50%, with control of both type 1 and type 2 errors at 10%. The first interim analysis (n=50) was prespecified as unblinded in order to review the methodology and the stopping rules (notably the value of λ). Following this, the stopping threshold was increased from 5% to 12.5% [λ=log(1.125)] because the treatment effect of casirivimab/imdevimab against the SARS-CoV-2 Delta variant was larger than expected and the estimated residual error was greater than previously estimated. Thereafter trial investigators were blinded to the virus clearance results. Interim analyses were planned every batch of additional 25 patients’ PCR data however, because of delays in setting up the PCR analysis pipeline, the second interim analysis was delayed until April 2022. By that time data from 145 patients were available (29 patients randomised to ivermectin and 26 patients randomized to no study drug).”

      4) The lack of blinding limits any analysis of symptomatic outcomes.

      We added this line to the discussion:

      “Finally, although not primarily a safety study, the lack of blinding compromises safety or tolerability assessments.”

      5) It is unclear whether all 4 swabs from 2 tonsils are aggregated. Are the swabs placed in a single tube and analyzed?

      The data are not aggregated but treated as independent and identically distributed under the linear model. 4 swabs were taken at randomization, followed by two at each follow-up visit. We have added line 183:

      “[..] (18 measurements per patient, each swab is treated as as independent and identically distributed conditional on the model).”

      Swabs were stored separately and not aggregated.

      6) In supplementary Figure 7, both models do well in most circumstances but fail in the relatively common event of non-monotonic viral kinetics (multiple peaks, rebound events). Given the importance of viral rebound during paxlovid use, an exploratory secondary analysis of this outcome would be welcome.

      Thank you for the suggestion. We agree, although the primary goal is to estimate the mean change in slope. Rebound is a relatively rare event and tends to occur after the first seven days of illness in which we are assessing rate of clearance.

      Nevertheless, we agree that this is an important point. It remains unclear how to model viral rebound. In over 700 profiles now available from the study, only a few have strong evidence of viral rebound.

      Reviewer #2 (Public Review):

      This manuscript details the analytic methods and results of one arm of the PLATCOV study, an adaptive platform designed to evaluate low-cost COVID-19 therapeutics through enrollment of a comparatively smaller number of persons with acute COVID-19, with the goal of evaluating the rate of decrease in SARS-CoV-2 clearance compared to no treatment through frequent swabbing of the oropharynx and a Bayesian linear regression model, rather than clinical outcomes or the more routinely evaluated blunt virologic outcomes employed in larger trials. Presented here, is the in vivo virologic analysis of ivermectin, with a very small sample of participants who received the casirivimab/imdevimab, a drug shown to be highly effective at preventing COVID-19 progression and improving viral clearance (during circulation of variants to which it had activity) included for comparison for model evaluation.

      The manuscript is well-written and clear. It could benefit however from adding a few clarifications on methods and results to further strengthen the discussion of the model and accurately report the results, as detailed below.

      Strengths of this study design and its report include:

      1) Selection of participants with presumptive high viral loads or viral burden by antigen test, as prior studies have shown difficulty in detecting effect in those with a lower viral burden.

      2) Adaptive sample size based on modeling- something that fell short in other studies based on changing actuals compared to assumptions, depending on circulating variant and "risk" of patients (comorbidities, vaccine state, etc) over time. There have been many other negative studies because the a priori outcomes assumptions were different from the study design to the time of enrollment (or during the enrollment period). This highlight of the trial should be emphasized more fully in the discussion.

      3) Higher dose and longer course of ivermectin than TOGETHER trial and many other global trials: 600ug/kg/day vs 400mcg/kg/day.

      4) Admission of trial participants for frequent oropharyngeal swabbing vs infrequent sampling and blunter analysis methods used in most reported clinical trials

      5) Linear mixed modeling allows for heterogeneity in participants and study sites, especially taking the number of vaccine doses, variant, age, and serostatus into account- all important variables that are not considered in more basic analyses.

      6) The novel outcome being the change in the rate of viral clearance, rather than time to the undetectable or unquantifiable virus, which is sensitive, despite a smaller sample size

      7) Discussion highlights the importance of frequent oral sampling and use of this modeled outcome for the design of both future COVID-19 studies and other respiratory viral studies, acknowledging that there are no accepted standards for measuring virologic or symptom outcomes, and many studies have failed to demonstrate such effects despite succeeding at preventing progression to severe clinical outcomes such as hospitalization or death. This study design and analyses are highly important for the design of future studies of respiratory viral infections or possibly early-phase hepatitis virus infections.

      Weaknesses or room for improvement:

      1) The methods do not clearly describe allocation to either ivermectin or casirivimab/imdevimab or both or neither. Yes, the full protocol is included, but the platform randomization could be briefly described more clearly in the methods section.

      We have added additional text to the Methods:

      “The no study drug arm comprised a minimum proportion of 20% and uniform randomization ratios were then applied across the treatment arms. For example, for 5 intervention arms and the no study drug arm, 20% of patients would be randomized to no study drug and 16% to each of the 5 interventions. Additional details on the randomization are provided in the Supplementary Materials. All patients received standard symptomatic treatment.”

      2) The handling of unquantifiable or undetectable viruses in the models is not clear in either the manuscript or supplemental statistical analysis information. Are these values imputed, or is data censored once below the limits of quantification or detection? How does the model handle censored data, if applicable?

      We have added lines 185-186:

      “Viral loads below the lower limit of quantification (CT values ≥40) were treated as left-censored under the model with a known censoring value.”

      3) Did the study need to be unblinded prior to the first interim analysis? Could the adaptive design with the first analysis have been done with only one or a subset of statisticians unblinded prior to the decision to stop enrolling in the ivermectin arm?

      The unblinded interim analysis was done on the first 50 patients enrolled in the study. The study at that time was enrolling into five arms including ivermectin, casirivimab-imdevimab, remdesivir, favipiravir, and a no study drug arm (there were exactly 10 per arm as a result of the block randomization).

      The main rationale for making this interim analysis unblinded was to determine the most reasonable value of λ (this defines stopping for futility/success), which is a trade-off between information gain, reasonable sample size expectations, and the balance between quickly identifying interventions which have antiviral activity versus the certainty of stopping for futility.

      Once the value of 12.5% was decided, the trial investigators remained blinded to the results until the stopping rules were met and the unblinded statistician discussed with the independent Data Safety and Management Board who agreed to unblind the ivermectin arm.

      4) Can the authors comment on why the interim analysis occurred prior to the enrollment of 50 persons in each of the ivermectin and comparison arms? Even though the sample sizes were close (41 and 45 persons), the trigger for interim analysis was pre-specified.

      After the first interim analysis at 50 patients enrolled into the study, they were planned every additional 25 patients (i.e. very frequently). The trigger for the interim analysis was not 50 patients into a specific arm, but 50 patients in total, and thereafter were planned to occur with every 25 new patients enrolled into the study. In practice there were backlogs in the data pipeline (which we explain), and interim analyses occurred less frequently than planned- the second one being in April 2022.

      5) The reporting of percent change for the intervention arms is overstated. All credible intervals cross zero: the clearance for ivermectin is stated to be 9% slower, but the CI includes + and - %, so it should be reported as "not different." Similarly, and more importantly for casirivimab/imdevimab, it was reported to be 52% faster, although the CI is -7.0 to +115%. This is likely a real difference, but with ten participants underpowered- and this is good to discuss. Instead, please report that the estimate was faster, but that it was not statistically significant. Similarly, the clearance half-life for ivermectin is not different, rather than "slower" as reported (CI was -2 to +6.6 hours). This result was however statistically significant for casirivimab/imdevimab.

      Thank you for your comments. The confidence interval for casirivimab/imdevimab did not cross zero and was +7.0 to +115.1%, and we thank the reviewer for picking up the error in the results section (it was correct in the abstract) where it was written -7.0 to +115.1%. We have made this correction. Elsewhere, we have provided more precise language to discriminate clinical significance from statistical significance, as per the essential revisions.

      6) While the use of oropharyngeal swabs is relatively novel for a clinical trial, and they have been validated for diagnostic purposes, the results of this study should discuss external validity, especially with respect to results from other studies that mainly use nasopharyngeal or nasal swab results. For example, oropharyngeal viral loads have been variably shown to be more sensitive for the detection of infection, or conversely to have 1-log lower viral loads compared to NP swabs. Because these models look for longitudinal change within a single sampling technique, they do not impact internal validity but may impact comparisons to other studies or future study designs.

      We have added the following sentence to the discussion:

      “Oropharyngeal viral loads have been shown to be both more and less sensitive for the detection of SARS-CoV-2 infection. Although rates of viral clearance are very likely to be similar from the two sites, this should be established for comparison with other studies.”

      7) Caution should be used around the term "clinically significant" for viral clearance. There is not an agreed-upon rate of clinically significant clearance, nor is there a log10 threshold that is agreed to be non-transmissible despite moderately strong correlations with the ability to culture virus or with antigen results at particular thresholds.

      We agree. We have addressed this partly in our response to Reviewer 1.

      8) Additional discussion could also clarify that certain drugs, such as remdesivir, have shown in vivo activity in the lungs of animal models and improvement in clinical outcomes in people, but without change in viral endpoints in nasopharyngeal samples (PINETREE study, Gottlieb, NEJM 2022). Therefore, this model must be interpreted as no evidence of antiviral activity in the pharyngeal compartment, rather than a complete lack of in vivo activity of agents given the limitations of accessible and feasible sampling. That said, strongly agree with the authors about the conclusion that ivermectin is also likely to lack activity in humans based on the results of this study and many other clinical studies combined.

      As above this has been addressed in our response to Reviewer 1.

      Reviewer #3 (Public Review):

      This is a well-conducted phase 2 randomized trial testing outpatient therapeutics for Covid-19. In this report of the platform trial, they test ivermectin, demonstrating no virologic effect in humans with Covid-19.

      Overall, the authors' conclusions are supported by the data.

      The major contribution is their implementation of a new model for Phase 2 trial design. Such designs would have been ideal earlier in the pandemic.

      We thank the reviewer for their encouraging comments.

    1. Author Response

      Reviewer #1 (Public Review):

      Auxin-induced degradation is a strong tool to deplete CHK-2 and PLK-2 in the C. elegans germ line. The authors strengthen their conclusions through multiple approaches, including rescuing mutant phenotypes and biochemical analyses of CHK-2 and PLK-2.

      The authors overcame a technical limitation that would hinder in vitro analysis (low quantity of CHK-2) through the clever approach of preventing its degradation via the proteasome. In vitro phosphorylation assays and mass spectrometry analysis that establishes that CHK-2 is a substrate of PLK-2 nicely complement the genetic data.

      The authors argue that the inactivation of CHK-2 by PLK-2 promotes crossover designation; however, the data only indicate that PLK-2 promotes proper timing of crossover designation.

      We thank the reviewer for this point of clarification. While we believe that PLK activity is essential to inactivate CHK-2 and trigger CO designation, we agree that this has not been firmly established with the tools available to us, as elaborated below. We have revised the text to avoid overstating the conclusions.

      It is not clear whether the loss of CHK-2 function with the S116A and T120A mutations is the direct result of the inability to phosphorylate these residues or whether it is caused by the apparent instability of these proteins, as their abundance was reduced in IPs compared to wild-type. Agreed. The instability of the mutant proteins was a source of significant frustration during the course of this work, and limits the strength of our conclusions.

      The mechanism of CHK-2 inactivation in the absence of PLK-2 remains unclear, though the authors were able to rule out multiple candidates that could have played this role.

      Reviewer #2 (Public Review):

      In this manuscript, Zhang et al., address the role of Polo-like kinase signaling in restricting the activity of Chk2 kinase and coordinating synapsis among homologous chromosomes with the progression of meiotic prophase in C. elegans. While individual activities of PLK-2 and CHK-2 have been demonstrated to promote chromosome pairing, and double-strand break formation necessary for homologous recombination, in this manuscript the authors attempt to link the function of these two essential kinases to assess the requirement of CHK-2 activity in controlling crossover assurance and thus chromosome segregation. The study reveals that CHK-2 acts at distinct regions of the C. elegans germline in a Polo-like kinase-dependent and independent manner.

      Strengths:

      The study reveals distinct mechanisms through which CHK-2 functions in different spatial regions of meiosis. For example, it appears that CHK-2 activity is not inhibited by PLK's (1 and 2) in the leptotene/zygotene meiotic nuclei where pairing occurs. This suggests that either CHK-2 is not phosphorylated by PLK-2 in the distal nuclei or that it has a kinase-independent function in this spatial region of the germline. These are interesting observations that further our understanding of how the processes of meiosis are orchestrated spatially for coordinated regulation of the temporal process.

      Weaknesses:

      While the possibilities stated above are interesting, they lack direct support from the data. A key missing element in the study is the actual role of PLK-2 signaling in controlling CHK-2 activity and thus function. I expand on this below.

      Throughout the manuscript, the authors test the role of each of the kinases (CHK-2 or PLK-1, or 2) using auxin-induced degradation, which would eliminate both phosphorylated and unphosphorylated pools of proteins. This experiment thus does not test the role of PLK-2 signaling in controlling CHK-2 function or the role of CHK-2 activation. To test the role of signaling from PLK-2 or CHK-2, the authors need to generate appropriate alleles such as phospho-mutants or kinase-dead mutants. The authors do generate unphosphorylatable and phosphomimetic versions of CHK-2, however, they find that the protein level for both these alleles is lower than wild-type CHK-2 (which the authors state is already low). The authors conclude that the lower level of protein in the CHK-2 phospho-mutants is because the mutations cause destabilization of the protein. I am sympathetic with the authors since clearly these results make interpretations of actual signaling activity more challenging. But there needs to be some evidence of this activity, for example through the generation of a phosphor-specific antibody to phosphorylated CHK-2. While not functional, at least the phosphorylation status of CHK-2 would provide more information on its spatial pattern of activation and inactivation. In addition, it would still be of interest to the readership to present the data on these phosphor-mutant alleles with crossover designation and COSA-1::GFP. Is the phenotype of the WT knockin, and each of the phosphomutant knock-ins similar to auxin-induced degradation of CHK-2?

      We thank the reviewer for these comments. We have made several attempts over the past decade that have failed to elicit a CHK-2 antibody that works for either immunofluorescence or western blots, likely due to the very low abundance of CHK-2. This has discouraged us from investing yet more resources to try to develop a phospho-specific antibody. Moreover, our evidence suggests that phosphorylation may promote CHK-2 degradation. Since the phosphomutants of CHK-2 are not stable, we do not think knock-in of these phosphomutants will provide new insights.

      Given that the CHK-2 phosphomutants did not pan out for assessing the signaling regulation of PLK-2 on CHK-2, to directly assess whether PLK-2 activity restricts CHK-2 function in mid-pachytene but not leptotene/zygotene, the authors should generate PLK-2 kinase dead alleles. These alleles will help decouple the signaling function of PLK-2 from a structural function.

      Similarly, to assess the potentially distinct roles of CHK-2 in leptotene/zygotene and mid-pachytene it would be important to assess CHK-2 kinase-dead mutant alleles. At this time, all of the analysis is based on removing both active CHK-2 and inactive CHK-2 (i.e. phosphorylated and unphosphorylated pool) using auxin-induced degradation. The kinase-dead alleles will help infer the role of the kinase more directly. The authors can then superimpose the auxin-induced degradation and assess the impact of complete removal of the protein vs only loss of its kinase function. These experiments may help clarify the role of signaling outcomes of these proteins, vs their complete loss. For example, what does kinase dead PLK-2 recruitment to the synapsed chromosomes appear like? Are their distinct activities for active and inactive PLK-2 that are spatially regulated? The same can be tested for CHK-2.

      A kinase-dead allele of plk-2 has been generated in previous work and we have used it for other purposes. However, the fact that CHK-2 and PLK-2 are required for homolog pairing and synapsis, which are prerequisites for crossover designation, precludes their use here.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      1. General Statements

      We thank the reviewers for their thorough and insightful evaluations of our manuscript and for their constructive feedback, which have significantly improved the quality of our manuscript. We were pleased to read that all three reviewers found our work novel, interesting, and relevant. In this revised manuscript, we have done our best to address all of the points raised by the reviewers by performing new experiments and revising sections of the text, as requested.

      2. Point-by-point description of the revisions

      Reviewer #1 (Evidence, reproducibility and clarity):

      In this manuscript authors show that extracellular Mtb aggregates can cause macrophage killing in a close contact dependent but phagocytosis independent manner. They showed Mtb aggregates can induce plasma membrane perturbations and cytoplasmic Ca2+ influx with live cell microscopy. Next, the authors show that the type of cell death initiated by extracellular aggregates is pyroptosis and they partially supressed cell death with pyroptosis inhibitors. They also identified that PDIM, EsxA/EsxB and EspB all have a role in uptake-independent killing of macrophages even though their impact varies with respect membrane perturbation and Ca2+ influx. Finally, they used a small molecule inhibitor BTP15 to inhibit the effect of ESX-1 during the contact of the extracellular Mtb aggregates with the macrophages and they observed a substantial decrease in membrane perturbation and macrophage killing.<br /> The work describes a very interesting mechanism by which Mtb can kill macrophages that is possibly relevant in the context of infection.

      1. In general, there are two main issues with the experiments and the interpretation: the lack of quantitative analysis showing that in a population of macrophages the ones that are in contact with the aggregates die whereas the ones that are not in contact remain alive. This is currently not shown, and it should be added in figure 1.

      All our data are based on the visual inspection and annotation of time-lapse microscopy image series, from which it is conclusive that death happens more often among cells in contact with Mtb aggregates (see movies S3 and S6 for representative examples). However, we acknowledge the reviewer’s suggestion that quantitative data supporting this observation might help to convey this conclusion more effectively. Therefore, we have quantified the percentage of dead cells in: I) macrophages in uninfected controls; II) macrophages that establish contact with an Mtb aggregate; III) bystander macrophages that never contact an Mtb aggregate despite being in the same sample as the infected cells, in experiments with (figure 1D) or without (figure 1Q) cytochalasin D treatment. These data have been incorporated as two additional plots in figure 1 in the revised manuscript. We find that uninfected and bystander cells have similar survival probabilities over the time-course of an experiment, whereas most of the cells that physically interact with Mtb_aggregates die by the end of the experiment. To further validate these observations, we have also plotted the lifespans of infected cells vs. bystander cells without (figure S3A) and with (figure S3B) cytochalasin D treatment. In these plots, the lifespan of an individual cell is represented by a line; the fraction of the line coloured in black corresponds to the time spent as bystander and the fraction of the line in magenta corresponds to the time spent in contact with an _Mtb aggregate. We hope that these new data convincingly show that bystander cells (black lines) survive longer compared to cells that interact with Mtb aggregates (black-magenta lines).

      1. The second is the cell death mode, as the markers used are very different and considering different outcomes (e.g., apoptosis vs. necrosis) are relevant for the infection it is unclear what is being measured here and the impact on bacterial replication.

      As the reviewer points out, it has previously been shown that different cell death pathways can affect viability and propagation of intracellular bacteria (1, 2). Since in our experiments we are specifically analyzing extracellular bacteria, we cannot directly comment on how cell death affects intracellular bacterial replication. However, to address the reviewer’s comment, we have included additional data in figure S13A of the revised manuscript showing that specific inhibitors of cell death do not affect the growth or replication of extracellular Mtb. These results suggest that while these molecules do not affect Mtb growth per se, the suppression of these specific death pathways also does not significantly affect the microenvironment to alter Mtb growth (i.e., access to nutrients or molecules released by dead cells). In addition, we have included new data in figure S12 demonstrating the responsiveness of our isolated macrophages to the various cocktails of molecules typically used to induce apoptosis, pyroptosis, or necroptosis.

      The authors are showing that infection with Mtb aggregates increase the rate of the macrophage killing but how does this impact infection dissemination and replication of the bacterial aggregates? Is it beneficial for the aggregates? Did the authors check the growth rate of Mtb along with cytochalasin D?

      A previous study has shown that phagocytosis of Mtb aggregates leads to macrophage death more efficiently than phagocytosis of a similar number of individual bacteria (3). It has also been shown that Mtb growing on the debris of dead host macrophages forms cytotoxic aggregates that kill the newly interacting macrophages (3). These observations suggest a model in which host cell death induced by Mtb aggregates supports faster extracellular growth and propagation of infection (3). This study was cited in the Introduction section of our manuscript, and our data support these observations. In the revised manuscript, we show that single Mtb bacilli or Mtb aggregates induce macrophage death in a dose-dependent manner (figure S7A,B); however, bacterial aggregates kill more efficiently when compared to similar numbers of non-aggregated bacilli (figure S7A,B). We also show that infection with Mtb_aggregates leads to faster bacterial propagation compared to infection with similar numbers of individual bacteria (figure S7C,D). These observations, combined with our data showing that _Mtb aggregation also enhances uptake-independent killing of macrophages (figure 2), suggest that Mtb aggregates induce rapid host cell death, allowing the bacteria to escape intracellular stresses, grow faster outside host cells (figure S1B), and propagate to other cells. To address the reviewer’s concern whether cytochalasin D affects Mtb growth, the revised manuscript includes additional data confirming that cytochalasin D does not affect the growth of Mtb aggregates (figure S6).

      1. How did the authors quantify the interactions of Mtb with macrophages in Figure 1D?

      The interactions of Mtb with macrophages were quantified through manual annotation of the time-lapse microscopy image series. If the Mtb aggregates disaggregated upon interaction with the macrophage, resulting in redistribution of smaller aggregates of bacteria, we categorized them as “fragmented”. On the other hand, if the aggregates remained clustered, we categorized them as “not fragmented”. Representative snapshots of these two patterns are presented in figure 1E and 1F and we have included additional representative examples in movies S4 and S5 of the revised manuscript. These interactions are quantified and plotted in figure 1N of the revised manuscript (figure 1D in the original version).

      1. Is it enough to conclude with one example of SEM that the mycobacteria with different fragmentation discriminates if the bacteria is intracellular or extracellularly localised? Can authors use an alternative quantitative method to confirm the localization of the bacteria by a quantification by 3D imaging of these two phenotypes with a cytoskeleton marker (or may be even with tdTomato-expressing BMDMs)?

      In the revised manuscript, we provide additional examples of correlative time-lapse microscopy and SEM images (supplementary figure S5). As suggested by the reviewer, in the revised manuscript we further validate these conclusions using an alternative approach based on correlative time-lapse microscopy followed by confocal 3D imaging. After time-lapse imaging, we fixed the samples and labelled the plasma membrane of the macrophages with a fluorescent anti-CD45 antibody to define the cell boundaries and identify bacteria that are intracellular vs. extracellular. Representative images obtained using this approach have been added to figure 1 and additional examples are shown in supplementary figure S4 of the revised manuscript. The acquisition, processing, and analysis of these 3D images are time-consuming and prevent us from performing an exhaustive quantitative analysis. However, we are confident in our conclusions, since in all of the cells that we analyzed we found that aggregates that are not fragmented within 6 hours of stable interaction with macrophages are visible on the outer side of the plasma membrane.

      1. How do we know if the cell is lysed at 30 h in Supplementary Figure 1, did the authors use a marker to detect the cell lysis or is it based on just the observation from the live cell imaging? Movies in supplementary are actually not very informative as there are many ongoing events and it is hard to visualise what the authors claim. A marker of cell death in the movies should be used.

      In this study, we used brightfield time-lapse microscopy images to identify cell death. Dying macrophages rapidly change shape, lose membrane integrity, and stop moving. Moreover, the intracellular structures and bacteria also stop moving at the time of death of the host cell. While these events can be difficult to distinguish by examining individual snapshots, they are readily identifiable by careful frame-by-frame examination of time-lapse microscopy image series. To exemplify this process, in the revised manuscript we show in supplementary figure S2A how we identify macrophage cell death events. We also include Draq7 (a live cell-impermeable dye commonly used to identify dead cells by flow cytometry and microscopy) in the growth medium during time-lapse imaging in order to label dead macrophages. The timing of staining validates and confirms our strategy of using brightfield time-lapse images to define the time-of-death of individual cells. To further assist readers, in the revised manuscript we provide the time-lapse microscopy movie used to generate this figure (movie S4). Similar images and movies have also been added for cells treated with cytochalasin D (figure S2B; movie S7). As suggested by the reviewer, we also replaced figure S1A with a new figure that shows a representative example of an Mtb intracellular microcolony that, upon death of the host macrophage, grows and forms a large extracellular aggregate on the debris of the dead cell (Draq7-positive). Movie S2 was used to generate this figure. Finally, we replaced figures 1E,F with new figures incorporating the Draq7 staining to label macrophage cell death and we include the time-lapse microscopy movies used to generate these figures (movies S4, S5).

      1. Total macrophage killing after contact in Figure 1L is around 12 hours, whereas it is observed that the macrophage death after contact with cytochalasin D treatment in Figure 1M is even longer than 24 hours. The viability at 12 hours in Figure1M is as fragmented Mtb survival in Figure1L, why there is a difference in timing with respect to macrophage killing?

      We thank the reviewer for this interesting observation. Indeed, we find that macrophages treated with cytochalasin D do take longer to die upon establishing stable interaction with Mtb aggregates in comparison to untreated cells. Although we do not have a clear explanation for this difference in timing, we speculate that by inhibiting actin polymerization and consequently cell motility, cytochalasin D might slow the expansion of the macrophage plasma membrane and the establishment of a larger interface of contact between the cell and the bacterial aggregate, which could influence the timing of cell death.

      1. Did authors perform statistical tests for Figure 1D and Figure 1N? p-values should be added.

      Figure 1D (figure 1N in the revised manuscript) shows the percentage of interactions between macrophages and _Mtb_aggregates that do or do not lead to fragmentation of the aggregate. Each dot represents the percentage of these events in one experimental replicate. We included this plot to show that reproducibly in all our replicates approximately 20% of the interactions do not lead to fragmentation of the aggregate. Since the purpose of this plot is not to compare the “fragmented” and “non-fragmented” populations but rather to highlight the reproducibility of the phenomenon, we do not think it would be appropriate to add a p-value. However, figure 1N (figure 1Q in the revised manuscript) has been updated and modified to include statistical analysis and a p-value.

      1. In Figure 3, do the observations indicated in the Figure 3 happen in all the macrophages that are in contact with aggregates? This is unclear and critical to support the conclusions. Do all the macrophages that are in contact with Mtb aggregates become Annexin-V positive? In Supplementary Figure 2 there is some information regarding this question, but it will be important to show it as a percentage.

      In response to the reviewer’s suggestion, we have modified the figure to include quantitation of Annexin-V staining. Approximately 75% of the macrophages that interact with an Mtb aggregate show detectable local Annexin V-positive membrane domains at the site of contact with the aggregate during a typical 60 hour-long experiment. Since most of the macrophages show local Annexin V-positive membrane domains within the first 12 hours upon contact with an Mtb_aggregate (figure 3C), we used this criterion for comparison of different conditions or strains (for example, those shown in figure 6F). In addition, we added figure 3D, which shows the behaviour of 105 macrophages upon contact with _Mtb aggregates in a typical experiment. In this plot, each line represents the lifespan of an individual cell; the fraction of the line in black represents the time spent as bystander, the fraction of the line in magenta represents the time spent interacting with an Mtb aggregate, and the fraction in green represents the time upon formation of local Annexin V-positive membrane domains at the site of contact with the Mtb aggregate. We believe that this additional information further supports our conclusions that most of the cells in contact with an Mtb aggregate show local Annexin V-positive membrane domains and that cells that show this pattern die faster than cells that do not develop local Annexin V-positive membrane domains.

      1. Did the authors try to stain Mtb aggregates alone with Annexin-V as a control over the duration of the imaging?

      We thank the reviewer for suggesting this control. In supplementary figure S8C of the revised manuscript, we include a representative example of a time-lapse microscopy image series showing Mtb aggregates that never interact with a live macrophage althought they are adjacent to a dead cell. As observed in the Annexin V fluorescence images (yellow), these Mtb aggregates never become Annexin-V positive during the course of the experiment (60 hours).

      1. In Figure 4, did the authors continue to image the cells interacting with Mtb aggregates that do not die after Ca2+ accumulation in Supplementary Figure 3D? Do these cells recover from the plasma membrane perturbation? Did the authors consider using another marker for plasma membrane perturbation together with BAPTA?

      Unfortunately, we are not able to image macrophages stained with Oregon Green 488 Bapta-1 AM for more than 36 hours because they lose fluorescence over time, possibly due to partial dye degradation or secretion. Another issue is that macrophages do not establish synchronous interaction with Mtb aggregates (figure 3D; figure S3B). In order to pool together results from many cells, we analyze all the cells that interact with Mtb within the first 20 hours and we define as timepoint 0 the time at which each individual cell establishes interaction with the bacteria. To compare similar time windows for each cell, we use fluorescence values measured at 16 hours post-interaction with bacteria as a readout. This time window is sufficient to observe formation of local Annexin V plasma membrane domains and death in a relevant number of macrophages (figure 1P; figure 3D). Not all of the contacted cells die within the timeframe of our experiments; however, we believe that if we imaged cells that accumulate Ca2+ for longer durations, we would find that all such cells eventually die. This assumption is consistent with the observation that calcium chelation reduces inflammasome activation and death in macrophages in contact with Mtb aggregates (figure 5D; figure 4E).

      With respect to the reviewer’s query whether cells recover from plasma membrane perturbation, in our time-lapse microscopy experiments, we observe that when macrophages form local Annexin V-positive plasma membrane domains at the site of contact with Mtb aggregates, they never revert to an Annexin V-negative status afterwards (figure 3D; movie S7; movie S8). Our SEM data show that Mtb aggregates colocalizing with Annexin V-positive domains are not partially covered by intact membrane, in contrast to those associated with Annexin V-negative macrophages, although they do present vesicles and membrane debris on their surface (figure 3F,G ). In the revised manuscript, we include additional fluorescence microscopy images showing that Annexin V-positive foci colocalize with markers for the macrophages’ plasma membrane (figure S8A,B) as well as with more distal areas of the bacterial aggregates, where we do not observe any positive plasma membrane staining (figure S8B). Similarly, although _Mtb_aggregates that are never in contact with macrophages never become Annexin V-positive (figure S8C), we see that upon macrophage death, aggregates in contact with dead cells retain some Annexin V-positive material on their surface (figure S8C; movie S8). Vesicle budding and shedding is a common ESCRT III-mediated membrane repair strategy that allows removal of damaged portions of the plasma membrane and wound resealing (4). Thus, we think that in our experiments the Annexin V-positive foci might represent both damaged membrane areas and released macrophage plasma membrane vesicles that stick to the hydrophobic surface of the bacterial aggregates. This means that the time of appearance of Annexin V-positive domains marks the time when the macrophage membrane experiences a damaging event. Interestingly, we do not observe a gradual increase in fluorescence intensity of the Annexin V-positive domains, but rather multiple single intensity peaks over time (movie S8). This might suggest that multiple discrete damaging events occur over time.

      1. In Figure 5D-G it will be important if the authors include dots for each macrophage events for the contact conditions as well, as it was done for the bystander condition.

      We apologize for using a too-pale shade of magenta in the earlier version of the manuscript, which apparently made the dots in these figures hard to visualize. In the revised manuscript, we use a darker shade of magenta to show the dots corresponding to the fluorescence values of the macrophages in contact with Mtb aggregates.

      1. How did the authors discriminate between the macrophages that are in contact or not with Mtb aggregates after the staining with Casp-1, pRIP3 and pMLKL? Do the aggregates stay in contact even after the staining procedures? Representative images of the labelling should be included in this figure.

      Before fixation, we make sure to remove the medium gently to avoid disrupting the interactions between cells and bacteria. This step most likely removes the floating bacterial aggregates that are not in stable contact with cells but apparently does not detach aggregates that stably interact with cells. Our correlative time-lapse microscopy and immunofluorescence images (figure 1; figure S4), as well as our correlative time-lapse microscopy and SEM images (figure S5; figure 3F,G), confirm that Mtb aggregates that interact with cells during time-lapse imaging are retained on the surface of those cells upon fixation and processing for immunofluorescence or electron microscopy. As we can observe in figure 5B (cell indicated by the white arrow), Mtb aggregates are retained on the debris of dead cells. In figure 5 we distinguish between “in contact” macrophages and “bystander” macrophages by inspecting brightfield images showing the cells and the respective fluorescence images corresponding to the bacteria. If the body of a macrophage identified in the brightfield image overlaps with a bacterial aggregate identified in the fluorescence channel, we define the macrophage as “in contact”; otherwise, it is annotated as “bystander”. We provide representative images in figure S12 and we clarify the definition of “in contact” and “bystander” in the figure legend of figure 5.

      1. The labelling of Figure 5H needs to be corrected both in the text and in the figure legend.

      We thank the reviewer for bringing our attention to this error, which has been corrected in the revised manuscript.

      1. Pyroptosis inhibitors did reduce the percentage of cell death, but did it also reduce the number of Annexin-V positive domains? This is important as AnnexinV is a marker of apoptosis and the outcome for Mtb very different.

      As pointed out by the reviewer, Annexin V staining is often used as a marker for apoptosis. Typically, apoptotic cells stain positive for Annexin V but negative for other membrane-impermeable markers such as propidium iodide, because they expose phosphatidylserine (bound by Annexin V) on the outer leaflet of the plasma membrane without losing plasma membrane integrity (5). Apoptotic cells often look round and their plasma membrane is stained homogeneously by fluorescently labelled Annexin V (5). In our experiments, we observe that macrophages in contact with Mtb aggregates become Annexin V-positive; however, this happens only at the site of contact with the bacteria (figure 3A; movie S7). Only when cells die and get stained by membrane-impermeable dies such as Draq7 do they also get stained with Annexin V over the entire membrane debris. We thus use Annexin V staining as a marker for membrane perturbation rather than for cell death. If we were using the Annexin V as a marker for cell death, we would expect to see a reduction in Annexin V-positive cells in samples treated with pyroptosis inhibitors. In these samples, we do observe a reduced percentage of cell death in comparison to untreated controls; however, we still observe a comparable percentage of macrophages that stain positive for Annexin V locally, i.e., at the site of contact with bacterial aggregates (supplementary figure S13B). In line with this observation, treated vs. untreated macrophages in contact with Mtb aggregates accumulate similar levels of intracellular calcium. These observations are consistent with our model suggesting that contact with Mtb aggregates induces membrane perturbation, calcium accumulation, inflammasome activation, and pyroptosis in contacted macrophages. Since the death inhibitors used in our study specifically target pyroptosis effectors, we do not expect them to affect upstream events such as membrane perturbation and calcium accumulation.

      1. In Figure 6, The sections for Figure 6 are well described but kept relatively long with too many details, it will be helpful to the reader if the authors can combine the sections in one header.

      We agree that the text linked to figure 6 is long. We tried to make these sections as concise as possible; however, we are concerned that combining all of the sections under a single header might be at the expense of clarity. Thus, unless the reviewer objects, we would prefer to maintain the use of multiple headers.

      1. Figure 6F does not have a statistical test and p-value, it will be important to include the statistical test in the legend and p-values in the

      As recommended by the reviewer, we have analyzed the results in figure 6F by using a one-way ANOVA test and we have added the calculated p-values to the figure.

      Reviewer #1 (Significance):

      Based on the literature, Mtb infection and replication can trigger different types of cell death and most of the studies have addressed cell death only as an outcome of intracellular replication. This study shows another form of host cell death, associated only to extracellular bacterial aggregates that are in contact with macrophages. Plasma membrane damage initiating pyroptosis has been defined in: "Plasma membrane damage causes NLRP3 activation and pyroptosis during Mycobacterium tuberculosis infection" by K.S. Beckwith et al. (2020). However, the effect of extracellular bacteria on plasma membrane damage was not addressed before and this paper is addressing an important observation with respect to Mtb evasion and dissemination. These observations represent a novel and interesting aspect in the induction of macrophage cell death by Mtb and potentially relevant for the disease. If the authors consider the comments listed above, this manuscript will be a novel and relevant addition to the field of host pathogen interactions in tuberculosis.

      We thank the reviewer for their perspective and their positive comments about our work.

      Reviewer #2 (Evidence, reproducibility and clarity):

      In this work, Toniolo and coworkers use single-cell time-lapse fluorescence microscopy to show that extracellular aggregates of Mycobacterium tuberculosis can evade phagocytosis by killing macrophages in a contact-dependent but uptake-independent manner. The authors further show that this process is dependent on the functionality of the ESX-1 type VII secretion system and the presence of mycobacterial phthiocerol dimycocerosate (PDIM). In essence the authors show that M. tuberculosis can induce macrophage death from the outside of the cell, and dissect the different players that are involved in the process.

      Major comments:

      1. I was intrigued by all the different findings of this work, which was done by using bone marrow derived murine macrophages, however, my first question to the authors is how they imagine that this process will take under an in vivo situation? Do they have evidence that these mycobacterial clumps may form during the initial infection process in the lungs? It would be important to provide more insights and discussion into this question in order to see how relevant the described details are inside the host organism.

      Formation of Mtb aggregates in tuberculosis lesions have been documented in several animal models (6, 7) and in humans (8–11). While it is unclear whether mycobacterial aggregates form during the earliest stages of infection, extracellular bacterial aggregates have been observed in animal models of infection within the first month post-infection, and they are often associated with necrotic foci. Moreover, masses of Mtb growing as pellicle-like aggregates are often observed on the surface of cavities in human tuberculosis patients. These observations confirm that Mtb aggregates can form during a tuberculosis infection and that a significant fraction of bacteria are extracellular during different stages of infection. As we observe that macrophages undergo contact-dependent uptake-independent death also in the absence of cytochalasin D in vitro, we assume that this may also happen in vivo when Mtb aggregates are formed or released outside host cells. This process may promote bacterial propagation at early stages of infection as well as at later stages when necrotic granulomas and cavities are formed.

      In the revised manuscript we present and discuss our observations in the context of the in vivo phenotypes reported in the scientific literature and we include additional references showing that extracellular Mtb aggregates are often observed in vivo. We also propose this concept already in the Introduction section to better link our observations to possible in vivo scenarios.

      Minor comments:

      Line 91: here the authors list the different forms of cell death that is induced by MTB infection, and it would be important to add apoptosis as a reported mechanism as well (References: PMID: 23848406, PMID: 28095608)

      As suggested by the reviewer, in the revised manuscript we have modified the Introduction section to include apoptosis as a Mtb-induced mechanism of macrophage death and we have cited the two publications recommended by the reviewer.

      1. Line 95: The secretion of EspE was mainly described in M. marinum while in members of the M. tuberculosis complex no virulence phenotype was reported to the best of my knowledge.

      In agreement with the reviewer’s comment, we have modified the sentence and removed EspE from the list of virulence factors.

      1. Lines 98: In the cited papers it is described that PDIM is required for phagosomal damage/rupture, however, the methods used there do not allow to specifically report about translocation. The wording should be adapted.

      We thank the reviewer for this insightful comment and we have modified the text accordingly.

      1. Line 206: Here it is described that in Figure 3A the BMDMs were expressing tdTomato fluorescence and the bacteria GFP, and the same is also repeated in the Figure legend of Fig3A. However, on the images, BMDMs are shown green and bacterial clumps purple (as also indicated in the description directly on the images) Please check and explain/correct this discrepancy.

      We apologize that the color scheme in figure 3A is confusing. In this figure we used tdTomato-expressing BMDMs and GFP-expressing bacteria; however, we have pseudo-colored the fluorescence images for the sake of consistency with the other figures in the manuscript, which always show bacteria in magenta. We have clarified this point in the figure legend of the revised manuscript.

      1. Line 304: Here the authors could mention that this finding is similar to results found previously in reference PMID: 28095608 and opposite to the results reported previously in PMID: 28505176.

      As recommended by the reviewer, we have added a sentence comparing our results with previous studies and we have cited the two references suggested by the reviewer.

      1. Line 321: It should be mentioned that CFP10 (EsxB) can also be secreted without its EsxA partner (under certain circumstances, i.e. when the EspACD operon is not expressed due to a phoP regulatory mutation (PMID: 28706226)). However, in Figure S7 an EspAdeletion mutant shows loss of EsxB secretion. This should be checked and discussed how the data here compare with data and strains published previously.

      We thank the reviewer for pointing out this interesting point. Our proteomics data revealed that both our esxA mutant and our espA mutants abolish secretion of both EsxA and EsxB, in line with previously published data (12–14). We do not know why the espA mutant behaves differently from the MTBVAC strain concerning secretion of EsxA and EsxB (although we note that regulatory mutations may have complex pleiotropic effects). In the revised manuscript, we have modified this section to include references highlighting that secretion of these proteins may be uncoupled in some circumstances.

      1. The finding that EspB can substitute the loss of virulence due to loss of EsxA/ESAT-6 secretion is astonishing and also is different to previous observations that strain H37Ra and MTBVAC (two attenuated strains that have no or very little EsxA secretion due to a regulation defect of the espACD operon PMID: 18282096; PMID: 28706226). How does the hypothesis put forward by the authors match with these previously published data ?

      We thank the reviewer for this interesting comment. We would like to clarify that we are not claiming that EspB and EsxA are in general redundant and that EspB can substitute EsxA as a virulence factor. In our experiments we show that EspB can induce contact-dependent uptake-independent death in macrophages in contact with Mtb aggregates in vitro even in the absence of EsxA; however, the precise role of EspB during infection in mice or humans remains to be elucidated and is outside the scope of this manuscript. A previous study comparing Mtb ESX-1 mutants with different secretion patterns linked EspB secretion to Mtb virulence in vivo (14); however, the behavior of an isogenic espB_deletion strain _in vivo was not reported. A M. marinum espB mutant was shown to have reduced virulence; however, in contrast to Mtb, deletion of espB also affects secretion of EsxA in this organism (15). As the reviewer points out, the Mtb strains H37Ra and MTBVAC do not secrete EsxA due to a mutated phoP gene. Previous literature has shown that espB expression is also dependent on PhoP (16). We thus speculate that these strains might behave similarly to our espA espB mutant strain in the context of contact-dependent uptake-independent induction of macrophage death, although we think that this point is outside the scope of our manuscript.

      1. In the same context, it is to notice that the authors report in the paragraph between lines 310-330 about EsxA/EsxB secretion, however, looking at the Western blots of figure S7, there is no blot showing results using an antibody against EsxA. Given the previously published results that EsxA/EsxB secretion may also be disconnected (PMID: 28706226), the wording of the text in this paragraph should be adapted or the results from Western Blots using EsxA antibodies be added.

      We agree with the reviewer’s comment. Unfortunately, we currently do not have access to a good antibody for EsxA. A commercial monoclonal antibody that was previously available for immunoblotting has been discontinued. We tried several other antibodies that were previously shown to work in M. marinum, but none of these antibodies were effective in M. tuberculosis. We agree that analysing secretion of EsxB alone might not be sufficient to support claims about EsxA secretion. For this reason, we performed quantitative proteome analysis of the secretome in all of the relevant mutant strains. In our revised manuscript, we are careful to make sure that whenever we refer to EsxA/EsxB secretion we always provide proteomics data to support our conclusions.

      1. Line 395: Here the authors write that BTP15, a small molecule that in a previous study was shown to inhibit EsxA secretion at higher concentrations (starting from 1.5 uM and higher). However, no effect on the expression of EsxA was described for that compound in reference PMID: 25299337. Thus the corresponding sentence in line 395 needs to adapted to that situation.

      We thank the reviewer for noticing this error, which we have corrected in the revised manuscript.

      1. Moreover, most concentrations of the compounds used are reported in uM, except for BTP15. It would be easier for the reader if the concentration used for BTP15 could also be reported in uM.

      As suggested by the reviewer, in the revised manuscript we report the concentration of BTP15 in μM.

      1. Line 475 The comment on the pore forming activity has to be made with caution, as recombinant EsxA produced from E. coli cultures has been shown to often retain detergent PMID: 28119503 that may be responsible for pore forming activity of recombinant EsxA observed in quite some studies, whereas EsxA purified from M. tuberculosis cultures did not show the detergent, but still retained membranolytic activity. This point should be clarified and discussed, and the wording adapted, as EsxA is not a classical poreforming toxin, but excerts the membrane-lysing activity together with other partners (PDIM) in a yet unknown way upon cell contact.

      We thank the reviewer for this comment. In the revised manuscript, we have modified the text accordingly and included the sugggested reference.

      Reviewer #2 (Significance):

      The findings in this work extend the current knowledge on cell infection by M. tuberculosis in a significant way and put extracellular M. tuberculosis clumps in a new context. These data obtained by single-cell time-lapse fluorescence microscopy also need to be discussed for predicting the relevance for an in vivo situation inside the host organism.

      As suggested by the reviewer, in the revised manuscript we discuss additional examples from the literature showing that Mtb aggregates can form during infection and that many bacteria are extracellular and associated with necrotic foci during different stages of the disease in animal models of infection and in human patients. We believe that these previously published observations support the in vivo relevance of the process we observe in vitro.

      Reviewer #3 (Evidence, reproducibility and clarity):

      This is an excellent study distinguished by the volume of observations, rigor of analysis and clarity of presentation. The results are novel, biologically interesting and pathophysiologically important. The ability of aggregated M. tuberculosis to kill macrophages has been reported, but the understanding was that proliferation of Mtb within macrophages killed them. Here, the authors observe that macrophages are susceptible to pyroptotic death triggered by contact with extracellular Mtb aggregates, and that this is not recapitulated by contact with a comparable number of Mtb as single bacilli. The authors go some way to tracing the mechanism and uncover a complex inter-dependence on PDIM and on components of the mycobacterial ESX-1 secretory system.

      The following comments will helpfully improve the study further.

      Major points

      1. The chief measurement in this study is death of individual macrophages as judged by the observer in videomicroscopy. However, the criteria for calling a macrophage "dead" are not defined with any morphological detail, beyond noting that the cell stops moving and lyses. Of course a cell will stop moving if it has lysed, but do not some if not most cells stop moving before they lyse? If so, lysis alone would seem to be the time-point marker for cell death. Yet from the images in Fig 1E and F, I cannot tell that the cells called "dead" have lysed. Watching the videos, the time of lysis is not clear to me. Eventually, shrunken cell bodies are obvious but it is not clear if these are residua of cells that had been said to "lyse" at an earlier time.

      In this study, we used brightfield time-lapse microscopy images to identify cell death. Dying macrophages rapidly change shape, lose membrane integrity, and stop moving. Moreover, the intracellular structures and bacteria also stop moving at the time of death of the host cell. While these events can be difficult to distinguish by examining individual snapshots, they are readily identifiable by careful frame-by-frame examination of time-lapse microscopy image series. To exemplify this process, in the revised manuscript we show in supplementary figure S2A how we identify macrophage cell death events. We also include Draq7 (a live cell-impermeable dye commonly used to identify dead cells by flow cytometry and microscopy) in the growth medium during time-lapse imaging in order to label dead macrophages. The timing of staining validates and confirms our strategy of using brightfield time-lapse images to define the time-of-death of individual cells. To further assist readers, in the revised manuscript we provide the time-lapse microscopy movie used to generate this figure (movie S4). Similar images and movies have also been added for cells treated with cytochalasin D (figure S2B; movie S7). As suggested by the reviewer, we also replaced figures 1E,F with new figures incorporating the Draq7 staining to label macrophage cell death and we include the time-lapse microscopy movies used to generate these figures (movies S4, S5).

      1. The use of BTP15 as a specific inhibitor of ESX-1 is problematic. The source of the compound is not stated.

      The BTP15 molecule was kindly provided by Prof. Stewart Cole, the corresponding author of the article describing the identification of this compound and its effect on Esx-1 secretion (17). We have included this information in the Materials and Methods section.

      1. The concentration used, 20 ug/mL, is well above the reported IC50 (1.2 uM) for its presumed target, a mycobacterial histidine kinase, and above the concentrations (0.3-0.6 uM) reported to inhibit Mtb's secretion of EsxA almost completely. It is concerning that the concentrations that were reported to work so well on the whole cell are lower than the IC50 for the presumed target, because uptake into Mtb and intrabacterial metabolism will typically lead to a lower potency for an inhibitor against the whole bacterium than against the isolated enzyme; and because 50% inhibition of an enzyme rarely gives a functional effect as complete as what is shown in the cited reference. In other words, it is not clear that the histidine kinase is the functionally relevant target of BTP15 in Mtb. The original report did not consider BTP15's possible effect on mammalian cells and the present authors likewise do not take that into consideration with respect to possible effects on the macrophages. No concentration-response or time course experiments with BTP15 are presented. Most important, unless I missed it, there is apparently no demonstration that the compound inhibited ESX-1-dependent secretion in the present authors' hands, no matter by what mechanism. Without this, I am reluctant to accept that the results with BTP15 demonstrate a dependence of extracellular-aggregate-induced macrophage death on ESX-1-mediated secretion from Mtb. I would recommend that the authors either provide a direct demonstration of BTP15's effect on ESX-1 dependent secretion at concentrations near those that worked on whole cells in the original report, or drop the BTP15 studies from the paper. That said, the genetic experiments remain unequivocal, so the paper's conclusions would not be affected.

      We agree with the reviewer that in the original version of our manuscript we did not provide direct evidence demonstrating that BTP15 inhibits ESX-1 secretion and that it does not affect the host cells. We addressed the first issue by quantifying (by Western blot) the secretion of EsxB and EspB in Mtb cultures treated with different concentrations of BTP15. We show that BTP15 reduces secretion of these two proteins in a dose-dependent manner. These data have been included in figures S21A-B of the revised manuscript. In line with this observation, we also show that BTP15 reduces uptake-independent killing of macrophages by Mtb aggregates in a dose-dependent manner (figure 6H). To show that the dose-dependent effect observed in macrophages does not depend on a direct effect of BTP15 on the host cells, we treated Mtb with different concentrations of BTP15 for 48 hours and removed the compound by washing the bacteria prior to infection. We observe that Mtb aggregates that have been treated with BTP15 show reduced uptake-independent killing of macrophages, even when bacteria have been pre-treated and the small molecule is not present during the incubation with the cells (figure S21C). We hope that these additional results provide clear evidence that BTP15 reduces Mtb-mediated contact-dependent uptake-independent killing of macrophages by inhibiting ESX-1 secretion, consistent with our genetic data. We think these results are important because they provide a chemical validation of our genetic data. To the best of our knowledge, BTP15 is the only available compound known to inhibit ESX-1 secretion, and in the revised manuscript we confirm that this compound has the previously described effect on Mtb also in our hands. Unfortunately, we had to use concentrations higher than those previously reported to inhibit ESX-1 secretion in order to achieve the observed effects. As we had access only to prediluted aliquots that had been stored for a long time, we cannot rule out the posibility that the compound might have undergone partial degradation during storage.

      1. The experiments, or at least the discussion, could consider what may distinguish single Mtb cells from aggregated Mtb in some way relevant to the present observations. The authors seem to assume that all the Mtb cells in their preparations are biochemically equivalent and that their distribution into single-cell or aggregate subpopulations is stochastic. What if it is deterministic instead? For example, what if these two subpopulations are defined by differential expression of PDIM, so that the greater macrophage-killing effect of aggregates than single cells in equivalent numbers reflects a greater amount of PDIM in the aggregates, rather than some sort of valency-of-contact effect? The authors could compare the PDIM-to-DNA ratio in the single cell and aggregated subpopulations, or at least discuss this possibility.

      We thank the reviewer for proposing this extremely interesting idea. In the revised manuscript, we have added a discussion of this point (lines 487-489) and we have floated various possible explanations. However, we believe that experimental dissection of the underlying mechanism could be a very lengthy undertaking and we hope that the reviewer will agree that this is outside the scope of the current manuscript.

      Minor points

      1. Some of the experiments compare "low", "medium" and "high" numbers of Mtb, but I could not find a definition of these numbers.

      We apologize for this oversight. In the revised manuscript, we have clarified the definition of these gates in the figure 2 legend.

      1. There seem to be no positive or negative controls for any of the antibodies used for cell staining (anti-cleaved caspase 1, antiphospho RIP3, anti-phospho MLKKL).

      As recommended by the reviewer, the revised manuscript includes controls for all of the antibodies used for immunostaining. In figure S12 we provide representative immunostaining images and fluorescence quantification of uninfected untreated macrophages (negative controls) and of uninfected macrophages treated with cocktails of molecules typically used to induce apoptosis, pyroptosis, or necroptosis (positive controls).

      Reviewer #3 (Significance):

      The results are novel, biologically interesting and pathophysiologically important.

      We thank the reviewer for their appreciation of our findings.

      References 1. H. Gan, et al., Mycobacterium tuberculosis blocks crosslinking of annexin-1 and apoptotic envelope formation on infected macrophages to maintain virulence. Nature Immunology 9, 1189–1197 (2008). 2. M. Divangahi, et al., Mycobacterium tuberculosis evades macrophage defenses by inhibiting plasma membrane repair. Nature Immunology 10, 899–906 (2009). 3. D. Mahamed, et al., Intracellular growth of Mycobacterium tuberculosis after macrophage cell death leads to serial killing of host cells. eLife 6, e22028 (2017). 4. A. J. Jimenez, et al., ESCRT Machinery Is Required for Plasma Membrane Repair. Science 343, 1247136 (2014). 5. M. van Engeland, L. J. W. Nieland, F. C. S. Ramaekers, B. Schutte, C. P. M. Reutelingsperger, Annexin V-Affinity assay: A review on an apoptosis detection system based on phosphatidylserine exposure. Cytometry 31, 1–9 (1998). 6. D. R. Hoff, et al., Location of Intra- and Extracellular M. tuberculosis Populations in Lungs of Mice and Guinea Pigs during Disease Progression and after Drug Treatment. PLOS ONE 6, e17550 (2011). 7. S. M. Irwin, et al., Presence of multiple lesion types with vastly different microenvironments in C3HeB/FeJ mice following aerosol infection with Mycobacterium tuberculosis. Disease Models & Mechanisms 8, 591–602 (2015). 8. Kaplan, G., et al., Mycobacterium tuberculosis Growth at theCavity Surface: a Microenvironment with FailedImmunity. Infection and Immunity 71, 7099–7108 (2003). 9. J. Timm, et al., A Multidrug-Resistant, acr1-Deficient Clinical Isolate of Mycobacterium tuberculosis Is Unimpaired for Replication in Macrophages. The Journal of Infectious Diseases 193, 1703–1710 (2006). 10. R. L. Hunter, Pathology of post primary tuberculosis of the lung: An illustrated critical review. Tuberculosis 91, 497–509 (2011). 11. G. Wells, et al., Micro–Computed Tomography Analysis of the Human Tuberculous Lung Reveals Remarkable Heterogeneity in Three-dimensional Granuloma Morphology. Am J Respir Crit Care Med 204, 583–595 (2021). 12. S. A. Stanley, S. Raghavan, W. W. Hwang, J. S. Cox, Acute infection and macrophage subversion by Mycobacterium tuberculosis require a specialized secretion system. Proc Natl Acad Sci USA 100, 13001 (2003). 13. S. M. Fortune, et al., Mutually dependent secretion of proteins required for mycobacterial virulence. Proc Natl Acad Sci U S A 102, 10676 (2005). 14. J. M. Chen, et al., Mycobacterium tuberculosis EspB binds phospholipids and mediates EsxA-independent virulence. Mol Microbiol 89, 1154–1166 (2013). 15. L.-Y. Gao, et al., A mycobacterial virulence gene cluster extending RD1 is required for cytolysis, bacterial spreading and ESAT-6 secretion. Mol Microbiol 53, 1677–1693 (2004). 16. V. Anil Kumar, et al., EspR-dependent ESAT-6 Protein Secretion of Mycobacterium tuberculosis Requires the Presence of Virulence Regulator PhoP. Journal of Biological Chemistry 291, 19018–19030 (2016). 17. J. Rybniker, et al., Anticytolytic Screen Identifies Inhibitors of Mycobacterial Virulence Protein Secretion. Cell Host & Microbe 16*, 538–548 (2014).

    1. ourfutures entangled together.Imbler 4

      We are still rapidly discovering new species and beings that we did not know to exist before on our planet. Their evolutionary trajectory may have split but they are ultimately tied to us by ancestry. However, we often look at creatures, especially new ones, and label them as alien, unsightly, repulsive, gross, creepy, etc. This destructive rhetoric leads to an irreverence toward their being. We do not know what the blob may feel, think, or do so why do we create this disconnect between them and ourselves? Respect can still be present without a connection but may take more effort. We must put in this effort, however, as they are ultimately our neighbors (evolutionarily and physically).

    2. confuse or repulse us

      This is abhorrence to the unknown is thought to be part of our psychology. We make patterns and molds in which to fit the world into. It makes internal processing much easier, especially when confronted with a dangerous situation. For example, when things, such as the sea-blob, go against our conceived notions of ocean animals we are more cautious as we don't know to internally lable them as "safe and cuddly" or "they will eat your arm off". (read: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4141622) I'm not a super big fan of this article as its very anthropocentric but it points to some ideas about patterns that have been thought to help us survive. While the above article may refute this, animals do a similar "patternization" of their surroundings. Its a shortcut to take in senses, really, but I think it often leads to, as Juliette said, schismogenesis. In my classes I haven't found much literature contradicting the "pattern" notions of sensing per psychology, as much as I don't like them. I'd be interested to see if other had come across any literature.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Point by Point Description of Revisions

      We thank the reviewers for their time, effort and constructive input. Below, our responses are bolded with yellow highlighting, while the reviewers’ comments are italicized.


      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      Summary

      The manuscript by Hays and colleagues described the spectrum of mutations that drive adaptation in nitrogen-limit environment by experimental evolution. The approach of serial transfer (fluctuating condition) allowed them to find that Ty insertion is the major mutation type for adaptive evolution. This was neither observed in nitrogen-limited condition when another experimental evolution approach, chemostat (non-fluctuating condition), was applied, nor in glucose-limited condition. The authors concluded that not only selection pressure itself but also how selection is applied are important to shape the adaptive events.

      Major points

      *Both serial transfer and chemostat are commonly used approaches of experimental evolution. In the manuscript, the authors refer serial transfer to "fluctuating" condition because the low nitrogen source would be consumed to none during the interval of transfers. I am wondering whether the authors have estimated the nitrogen uptake (consumption) during the transfer intervals and whether the nitrogen was exhausted within 48 hours. *

      We appreciate the reviewer’s question, and although we did not directly measure nitrogen consumption throughout this specific experiment, ammonium was the limiting nutrient in the defined medium which has been previously used to achieve transient nitrogen starvation conditions in other yeast experimental evolutions (Blundell et al. 2019). In that previous work, it was confirmed that addition of ammonium above 0.04% (up to 0.15%) led to additional rounds of doubling – confirming that the amount of ammonium provided was in fact the limiting nutrient. Finally, we point out that the adaptive mutations recovered in this study predominantly impact genes known to affect nitrogen catabolism, as is expected under nitrogen-limited evolution conditions.

      We’ve updated the methods section to ensure the rationale for this medium choice is clearly stated.

      Since this is not precisely controlled by experiment design, the "fluctuating" condition itself may be not stable during the long-term evolution. For example, as population evolved, the rate and the amount of nitrogen uptake might change. I feel a better experiment setup for "fluctuating" condition is like 24 hour "low-nitrogen (ammonium)" - 24 hour "no ammonium" and so on. If the adaptive mutations (e.g. adaptive Ty) specifically respond to such "fluctuating" condition rather than chemostat, the authors can measure their fitness in nitrogen starvation condition, which is expected to be fitter than mutants observed only in chemostat (e.g. copy number variation of nitrogen transporters).

      The reviewer correctly points out that nitrogen availability will change as the population adapts, and it is likely that some portion of the population become better at utilizing the newly available nitrogen upon transfer into fresh medium over time. This is in fact the intention of this experimental design. We have rephrased the text of the main paper to emphasize that our fluctuating conditions represent fluctuations in the nutrient availability in fresh medium upon transfer, and not strict oscillating nitrogen concentrations that cells experience locally throughout all generations.

      We note that in the reviewer-proposed experimental design (using 2 stages of low- and no- nitrogen media), that the low-nitrogen condition would still exhibit the same population-dependent nitrogen usage dynamics as the population adapts over time. We chose our evolution conditions to apply a selective pressure for cells to become best adapted to the environmental fluctuations associated with this transfer regimen, and we have updated the main paper to clarify this point. We thank the reviewer for helping us clarify this important point.

      The authors compared their results with published dataset using nitrogen-limitation chemostat and the mutation spectrum is different. In addition to the "fluctuating" and "non-fluctuating" difference as mentioned above, other factors need to be considered. First, the nitrogen-limited conditions in the two studies are different. The authors used 0.04% ammonium sulfate while Hong et al used "800 uM nitrogen regardless of the molecular form of the nitrogen", which may influence the mutation spectrum and need to be discussed. Second, bottlenecks were applied for each transfer in this study, in comparison with constant population size in chemostat, which will influence the efficiency of selection and further the evolutionary dynamics and outcomes. Thus, population size and bottlenecks need to take in to account to make comparisons of mutation spectrum.

      We thank the reviewer for their point: we have expanded the section of the main text addressing the differences in how serial transfer and chemostat conditions are applied, the media differences necessitated by such and specifically how the conditions between our study and the Hong et al study differ. We believe the additional detail now better highlights our point that how selection is applied shapes adaptive events, and we thank the reviewer for their helpful input.

      *The authors found that Ty mutagenesis accounts for a substantial number of adaptive mutations in nitrogen limitation. I am wondering for adaptive clones, whether Ty occurred independently or is more likely to co-exist with other drivers. *

      We appreciate the reviewer’s question. In the clones with adaptive Ty insertions, the only co-occurring adaptive mutation is autodiploidization. There were no additional mutational classes that were adaptive and co-occur with adaptive Ty insertions in our dataset. However, many novel Ty insertions are neutral, and these DO co-occur with beneficial mutations. These data are captured in Figure 5A, and in detail in Supplemental File 1. The blue bar in the adaptive haploids reflect neutral-fitness Ty insertions that co-occur with other mutations that drive fitness increase. These are distinct from the Ty insertions that are themselves responsible for the fitness increase, which are captured in the orange bar. We have clarified the text surrounding the Fig 5A results to better emphasize these findings.

      What is the distribution of number of clones with one, two, and multiple mutations? If there is co-existence of driver mutations, what is the relative contribution of each to adaptation? The phenotypic validation of Ty mutagenesis for adaptation is expected while it seems only one case was presented in Figure 2 (mep1Ty−731427).

      Aside from diploidization events, only one clone with two nitrogen-adaptive mutations was identified in this study: a double mutant with mutations in both gat1 and tor1. Please see Supplemental File 1 (which is sortable) for a complete outline of all clones with mutations and fitness remeasurements. In the case of diploids that have additional beneficial mutations, those data are shown in Figure 3 with diploids indicated as well as the ploidy of the secondary beneficial mutation, and again in detail in Supplemental File 1.

      The reviewer is correct in that only one Ty mutation was dissected and validated in Figure 2. However, we inferred adaptation by Ty insertion through the observation of parallel adaptation, and we fitness remeasurements of many independent Ty insertion mutants.

      Statistical analysis needs to be reinforced in the manuscript, including but not limited to Figure 2 fitness comparison among clones with different genotypes, Figure 5 Ty enrichment comparison, etc.

      We thank the reviewer for their helpful suggestion. We have updated figures and figure legends to more clearly include statistical comparisons between genotypes for Figures 2 and 5: specifically describing the analyses used and the associated p-values for differences between WT and adaptive alleles and significance of Ty class enrichments.

      Minor points

      We thank the reviewer for their detailed and careful edits below and have addressed them in the main text and figures as applicable.

      "For diploids, we only sequenced those with estimated fitness greater than diploidy alone would provide." Main text clarified with additional explanation

      "either through impacting alternate start (green triangle) or alternate stop sites (yellow and red triangles)." I do not see yellow and red triangles in Fig. 3. Legend updated to reflect current figure color palette.

      Fig.2. FCY2 mutant fitness can be added as well?

      Unfortunately, data for FCY2 backcrossed mutants were not generated

      "while we found only 212 novel Ty insertions in 488 glucose evolved clones (Figure 5B)" The value in the text does not match the one in the figure.

      We appreciate the reviewer’s attention to detail and have corrected the main text to match the correct value in Fig 5B.

      In addition to adaptive Ty insertion, what is the genome-wide distribution or characteristics of other Ty, especially for nitrogen-limited condition? Is that distinct from glucose-limited condition?

      Figure S5 addresses the major locations of Ty insertions upstream of tRNA genes, in both Glucose and Nitrogen limited evolutions, the insertion location previously published to be preferred; the only difference between glucose and nitrogen is that there are more in the nitrogen limited condition, though the profile of insertions upstream of tRNAs is essentially the same. In addition to insertions upstream on tRNAs, all other specific insertion locations are available in Supplemental File 1 and Supplemental File 4.

      "Studies determining at which step(s) of the Ty life cycle nitrogen starvation shapes ty activity would be needed to determine the specific mechanism underlying the increase in transposon insertions." Here "ty" => "Ty"

      Corrected! We thank the reviewer for their detailed reading.

      Reviewer #1 (Significance (Required)):

      The manuscript is a follow-up work of Levy et al. 2015 and Blundell et al. 2019. In general, the research is interesting and point out the important role of Ty for adaptive evolution in nitrogen-limited environment. It also compared the spectrum of adaptive mutations in response to nitrogen limitation by serial transfer (this work) and chemostat (especially the work of Gresham lab). The paper is well-written as well. Audience from the field of genetics, genomics and evolution will be interested in this work.

      My field of expertise: genetics, experimental evolution, budding yeast

      We thank the reviewer for their kind comments, constructive input, and generosity with their time.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      Hays et al. sequence and analyze the mutational spectrum from a set of S. cerevisiae strains evolved in a nitrogen limiting environment, and detail genes that recurrently are found to be mutated in a fluctuating nitrogen limiting environment. These data are contrasted to evolution under glucose limited environments and non-fluctuating environments. Specifically, Hays et al. observe a high proportion of Ty element-mediated mutations arising from strains evolved under the fluctuating nitrogen limiting regime. Their fitness data are robust and clearly demonstrate that these mutations reproducibly lead to improved fitness under nitrogen limitation (based on the authors' defined criteria). Overall, the observed bias of the high proportion of Ty-mediated mutation in fluctuating nitrogen starvation is unexpected and an important finding. Further, the discussion was thoughtful and well executed in detailing interpretations of the data more broadly. We are generally positive about this work and find the analyses robust and convincing. The authors should address the concerns listed below prior to acceptance/publication.

      We thank the reviewer for their kind words and enthusiasm for our study, we have worked to address their constructive feedback as detailed below.

      Reviewer #2 (Significance (Required)):

      Major comments to be addressed:

      The claim that the 3' UTR Ty insertions in MEP1 are apparently gain of function is very interesting. The authors should consider performing RT-PCR or strand specific RNAseq to see whether the antisense transcript is reduced and the MEP1 transcript is increased in the presence of the 3' UTR insertion. This would provide much stronger support for their claim that MEP1 3' Ty insertions are gain of function. Orientation information is critical to provide!

      We agree that these future directions are exciting and of extreme interest! We however believe they are out of the scope of this current study which already includes substantial data and analysis. We note that we did not claim that the 3’ UTR insertions are gain of function – instead, we suggested that “Ty insertions in the 3’ region unique to the MEP1 locus may affect fitness in nitrogen limitation via a mechanism different than the putative gain of function missense mutations in the coding region itself”. We did not speculate on the mechanism by which these insertions are adaptive, but it is an active line of research and we look forward to discovering the mechanism.

      The authors seemed to miss a golden opportunity to measure Ty1 expression or transposition under fluctuating/non-fluctuating nitrogen starvation. Otherwise, the claims of increased Ty activity are unsupported. The authors measured an endpoint (Ty insertion), but this says nothing directly as to the rate of activity, although it is presumably correlated. However, based on the data one could argue activity may be equal in all environments, but the mutational events caused by Ty activity are uniquely selected for in fluctuating nitrogen starvation. As it stands, either model (increased activity vs. differential strength of selection) are equally likely. At a minimum, the authors should at least address this point.

      We appreciate the reviewer bringing this concern to our attention: we address the reviewer’s concerns in 3 ways: First, we’ve rephrased to more explicitly consider the possibility that the observed difference in novel Ty insertions could be driven at the level of selection, not activity. Second, we’ve clarified the main text to greater emphasize our reasoning for why we speculate the inference of greater Ty activity under nitrogen starvation may be more likely based on the level of presumptive neutral Ty insertions being greater in nitrogen than in glucose (even after normalization for the number of evolved generations). Third, we’ve performed additional experiments that support that, at least with an artificial retrotransposition reporter construct, these starvation conditions show additional Ty activity in nitrogen compared to glucose (note, we have not carried out such experiments in chemostats, and do not currently have a functioning chemostat set up). We’re including these results below, though have not included them in the manuscript, as we intend to generate additional data for a subsequent study to make these claims more robust. We feel that adding them to this manuscript would make it less focused.

      To assess Ty activity in yeast experiencing different nutrient conditions, we used a modified version of a plasmid-based Ty reporter created previously by Curcio and Garfinkel, 1991, PNAS 88(3):936-40. The original reporter construct used an inducible GAL promoter to initiate Ty transcription from the plasmid, and new Ty insertions confer the ability for the strain to grow on SC-His. To assess Ty activity induced by nitrogen limitation, we excised the GAL promoter and instead used the native Ty promoter from the insertion found at YPLWTy1-1. This Ty promoter was selected based on having recovered novel Ty insertions in evolved clones that originated from this locus.

      Plasmid pGS234 was created by replacing the promoter containing XhoI fragment from pGTy1mhis3-AI with XhoI fragment containing promoter from chromosomal location of YPLWTy1-1.

      Strains bearing the Ty reporter plasmid pGS234 were subjected to nitrogen limited media and glucose limited media to assess transposon activity in these conditions. We observe significantly more Ty activity from the reporter plasmid in nitrogen-limited conditions than in glucose limited conditions or in SC-ura medium (see Figure below).

      Panel A: Bars represent average of three WT strains with transposon reporter plasmid; each value is number of colonies on SC-His medium with each His+ colony representing independent Ty transposition events. Strains were grown in SC-Ura and then shifted to M14, M3 or SC-Ura as a control for 48 hours and plated on SC-His plates.

      Panel B. One WT strain with pGS234 was subjected to a fluctuation test (16x 5ml tubes) in M14 and M3 media. Each dot represents the number of colonies on each SC-His plate. Kruskal-Wallis chi-squared = 23.341, df = 1, p-value = 1.357e-06

      In line with the above, we think the authors should soften some points in the discussion as it stands. For example: "The significant increase of Ty activity under this specific fluctuating nitrogen-starvation..." We feel the data does not exclusively support increased activity of Ty, that would require the aforementioned assays. As it stands, we feel this is more appropriate: ": "The significant increase of Ty insertions under this specific fluctuating nitrogen-starvation..."

      We edited the main text to include this suggested language change.

      Minor comments to be addressed:

      Please provide a citation for the following statement "The single copy of Ty5 in the ancestor is known to be inactive and gives rise to no new insertions under either glucose or nitrogen limitation" - Voytas & Boeke. Nature 1992.

      We appreciate the reviewer catching this, and the reference has been added.

      We found the following to be a confusing sentence: "Indeed, if global Ty derepression reflects a host-parasite coevolution that minimizes host cost and maximizes potential for survival of both, the role of transposons in host evolvability is important (Levin and Moran 2011)."

      We have clarified this sentence by editing it to: “Indeed, the role of transposons in host evolvability is important: global Ty derepression could reflect host-parasite coevolution towards a less parasitic lifestyle: resulting in minimal host cost and maximized potential for survival of both, especially under detrimental environmental conditions (Levin and Moran 2011)”

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      Hays et al. studied the genomic changes that lead to adaptation under fluctuating nitrogen starvation. In addition to loss of function alleles, the authors identified adaptive gain-of- function alleles. Furthermore, their results demonstrate that Ty and microhomology-facilitated mutations in several candidate genes contribute substantially (though not exclusively) to the adaptation under nitrogen-limited serial transfer. Importantly, a novel lineage tracking method provides high resolution fitness measurements.

      We appreciate the reviewer’s helpful edits in clarifying and improving the manuscript, and appreciate their time and constructive input.

      Despite the clear merits of the study, we also have a few relatively minor questions and suggestions

      • Please elaborate on the criteria they used to identify adaptive loci. The fact that these mutations occurred repeatedly is highlighted on Table 1, but perhaps numbers could also be included in the text, to increase clarity.* We have added the pertinent numbers to the main text to accompany the values captured in Table 1 and Supplemental file 1 and further emphasize selection criteria outline in the main text.

      • "Were also validated to a fitness effect of >0.01 in nitrogen-limited media". More details about the selection of this cut-off value need to be provided in either the text or the Methods section to increase clarity.*

      We agree and have clarified the limit of detection used in the methods section.

      • In Figure 3 it seems that the type of observed mutations was less important compared to the gene where the mutation occurred. Therefore, it seems that some genes, e.g. GAT1, contribute more to the observed fitness change. It would be beneficial if the authors discussed this observation.*

      We thank the reviewer for their observation and have included some additional discussion in the main text around the per-locus fitness observations as shown in Figure 3.

      • What was the reason to select samples from the 88th generation for glucose and from the 192nd generation for nitrogen, as presented in Figure 5? How does this affect the observations?*

      We thank the reviewer for their question: these generations were determined to best capture peak adaptive diversity (as discussed in Blundell et al 2019), based on population barcode dynamics in the original evolutions (Levy et al 2015, Blundell et al. 2019). The challenge is balancing picking a time point late enough, such that there are sufficient numbers of adaptive clones within independent lineages, yet early enough that few mutations have occurred (ideally only a single adaptive mutation per sequenced clone) and that no very fit clones have taken over the population. Because the fitness effects of beneficial mutations in glucose limited media were larger than in nitrogen limited media it was necessary to choose a later timepoint in the Nitrogen limited evolutions, to allow for there to be a sufficient fraction of the population carrying adaptive mutations. We believe this peak diversity makes these samples the most relevant for broadly assessing the adaptive mutational spectra.

      • The use of statistics is not always clear. Please provide a clear indication of the statistical methods/tests used, eg for Figure 5.*

      We thank the reviewer for this important point and have updated figures 2 and 5 and their corresponding legends for clarity surrounding statistical analysis used.

      • The authors could include a supplementary Table, summarising their findings on GAT1 locus, since the text is extensive and it is difficult to put all the information into perspective.*

      We note that row one of Table 1 in the main text is exactly this overview of the mutations observed at the GAT1 locus. These mutations plus specific location and their fitness remeasurements are shown in Figure 3 panel A, and detailed descriptions of the mutations for each clone are also available in the sortable table in Supplemental File 1. For these reasons we’ve not included an additional GAT1-specific table.

      • The introduction is extremely detailed and informative, but at the same time quite lengthy; shortening it and only keeping the most relevant parts may increase readability.*

      We appreciate the reviewer’s perspective but have not made substantial changes to remove information from the introduction as we feel that each of the subsections of the introduction are necessary to provide the appropriate context to the study.

      • More detailed figure legends (which should also include a brief mentioning of the statistics & sample size) would benefit comprehensibility. For example the black lines in Figure S4 are not described anywhere in the text.*

      We agree and have added further description of statistics used in legends throughout. Description of the black lines in Figure S4 has been included.

      • "Many of the 332 clones ... were beneficial" à rephrase.*

      We have updated this sentence to clarify our intent.

      Reviewer #3 (Significance (Required)):

      Apart from the elegant characterization of adaptive mutations, perhaps the most important part of the study is that it highlights the importance of a particular selection regime. Together, the findings extend our knowledge on this important topic.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Hays et al. sequence and analyze the mutational spectrum from a set of S. cerevisiae strains evolved in a nitrogen limiting environment, and detail genes that recurrently are found to be mutated in a fluctuating nitrogen limiting environment. These data are contrasted to evolution under glucose limited environments and non-fluctuating environments. Specifically, Hays et al. observe a high proportion of Ty element-mediated mutations arising from strains evolved under the fluctuating nitrogen limiting regime. Their fitness data are robust and clearly demonstrate that these mutations reproducibly lead to improved fitness under nitrogen limitation (based on the authors' defined criteria). Overall, the observed bias of the high proportion of Ty-mediated mutation in fluctuating nitrogen starvation is unexpected and an important finding. Further, the discussion was thoughtful and well executed in detailing interpretations of the data more broadly. We are generally positive about this work and find the analyses robust and convincing. The authors should address the concerns listed below prior to acceptance/publication.

      Significance

      Major comments to be addressed:

      The claim that the 3' UTR Ty insertions in MEP1 are apparently gain of function is very interesting. The authors should consider performing RT-PCR or strand specific RNAseq to see whether the antisense transcript is reduced and the MEP1 transcript is increased in the presence of the 3' UTR insertion. This would provide much stronger support for their claim that MEP1 3' Ty insertions are gain of function. Orientation information is critical to provide!

      The authors seemed to miss a golden opportunity to measure Ty1 expression or transposition under fluctuating/non-fluctuating nitrogen starvation. Otherwise, the claims of increased Ty activity are unsupported. The authors measured an endpoint (Ty insertion), but this says nothing directly as to the rate of activity, although it is presumably correlated. However, based on the data one could argue activity may be equal in all environments, but the mutational events caused by Ty activity are uniquely selected for in fluctuating nitrogen starvation. As it stands, either model (increased activity vs. differential strength of selection) are equally likely. At a minimum, the authors should at least address this point.

      In line with the above, we think the authors should soften some points in the discussion as it stands. For example: "The significant increase of Ty activity under this specific fluctuating nitrogen-starvation..." We feel the data does not exclusively support increased activity of Ty, that would require the aforementioned assays. As it stands, we feel this is more appropriate: ": "The significant increase of Ty insertions under this specific fluctuating nitrogen-starvation..."

      Minor comments to be addressed:

      Please provide a citation for the following statement "The single copy of Ty5 in the ancestor is known to be inactive and gives rise to no new insertions under either glucose or nitrogen limitation" - Voytas & Boeke. Nature 1992.

      We found the following to be a confusing sentence: "Indeed, if global Ty derepression reflects a host-parasite coevolution that minimizes host cost and maximizes potential for survival of both, the role of transposons in host evolvability is important (Levin and Moran 2011)."

    1. ests, "the Charlie Chan character became institutionalized as the nonthreatening Asian (read: a physical wimp, a sexual deviant,and a political yes-man)."20 Indeed, while Chan's expertise as a detective may be theresult of his "Chinese" understanding of human nature, his appeal to Americanaudiences was the fact that he was a polite, soft-spoken, well-groomed, familyman who had adopted middle-class, American v

      I think that it is kind of interesting that if you fall into the categories of being well groomed and polite then you could be seen as a stereotype. while I feel that while we shouldnt be restrained into any type of role or stereotype I also feel that it is strange to say that doing certain things someone does out of free will will make you a stereotype or almost like a traitor to your origins.

  2. westernsydney.pressbooks.pub westernsydney.pressbooks.pub
    1. Systematic Review Subject Guide Endnote Subject Guide Discipline specific Subject Guides Databases by title

      I wonder if we should specify these LibGuides - as I think the outcome of the LibGuide review may mean these could change??

    1. Jan. 22. To set down such choice experiences that my own writingsmay inspire me and at last I may make wholes of parts. Certainly it isa distinct profession to rescue from oblivion and to fix the sentimentsand thoughts which visit all men more or less generally, that thecontemplation of the unfinished picture may suggest its harmoniouscompletion. Associate reverently and as much as you can with yourloftiest thoughts. Each thought that is welcomed and recorded is anest egg, by the side of which more will be laid. Thoughts accidentallythrown together become a frame in which more may be developedand exhibited. Perhaps this is the main value of a habit of writing, ofkeeping a journal,—that so we remember our best hours and stimulateourselves. My thoughts are my company. They have a certainindividuality and separate existence, aye, personality. Having bychance recorded a few disconnected thoughts and then brought theminto juxtaposition, they suggest a whole new field in which it waspossible to labor and to think. Thought begat thought.

      !!!!

      Henry David Thoreau from 1852

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      Summary: The authors use an unclassified quaranjavirus, Wǔh�n mosquito virus 6 (WuMV-6), to demonstrate the possibility of orthomyxvirid global transmission dynamic analyses. The focused surface protein analysis strongly indicates a vertebrate host for WuMV-6 in addition to the insect host. The analysis is then expanded to other quaranjaviruses, which differ considerably in their surface glycoproteins, indicating a complex evolution. Finally, the authors scientifically demonstrate that orthomyxovirids are undersampled and hence that this family will have to expand considerably in the future.

      Major comments: none

      We thank the reviewer for a succinct summary of our study and we are very glad the key messages were sufficiently clear.

      Minor comments: The article lacks precision and hence some global edits are in order. Generally:

      1. For clarity to the reader, please introduce the family Orthomyxoviridae, i.e., its current official composition (i.e., 9 genera, 21 species, and 22 viruses) so the reader is not confused by terms such as "quaranjavirus" or "isavirus" etc.).

      This is a fair request though we would prefer to err on the side of caution with regards to the precise number of taxonomic ranks given the flux viral taxonomy has experienced and in light of the deluge of new taxa being discovered all the time. We refer to the “traditional” view of orthomyxovirid taxonomy at the genus level, encompassing the genera described up until 2011.

      After that, please clearly indicate which viruses are classified and which ones are not. For instance, the main virus dealt with in this paper is unclassified, and so are Astopletus and Ūsinis viruses.

      We do not think this is reasonable since virtually all RNA viruses discussed in the text are not classified and their status as such has little bearing on any of our findings.

      Please ensure correct spelling, including diacritics, of the viruses and abbreviations throughout: Wǔh�n mosquito virus 6 (WuMV-6); H�běi orthomyxo-like virus 2 [note the deletion of one "virus"]; Wēnlǐng orthomyxo-like virus 2

      Thank you for the comment, we have added the diacritics where we could identify them but may have missed some.

      For orientation of the reader, please refer to family groups of viruses as -virids (e.g., "orthomyxovirids", "human coronavirids", "some rhabdovirids"). This way it is clear to the reader that, for instance, "quaranjaviruses" refers to a genus-level group

      Thank you, we agree that this adds much needed precision in terminology.

      "influenza" is a disease. There are several viruses that can cause influenza; they belong to four different genera. Please scan for "influenza" and replace each either with a virus name (for instance, in the abstract, "...RNA viruses containing influenza A virus" or with a genus name (e.g., "alphainfluenzaviruses")

      Our apologies for that misnomer. The text has been corrected.

      Please ensure the differentiation of taxa (concepts), such as species, and viruses (things). Orthomyxoviridae cannot infect anything, it can also not be sampled etc. Orthomyxovirids, the physical members of Orthomyxoviridae can infect things. Most instances of "Orthomyxoviridae" should be replaced accordingly.

      Thank you for the comment, this has been corrected as suggested.

      In particular:

      1. The title doesn't make much sense. Orthomyxovirids are not taxonomically incomplete - they are things that we simply may not have samples or may have characterized incompletely. Also, the analyses are largely restricted to quaranjaviruses. Hence, I would suggest "...genome evolution, and broad diversity of quaranjaviruses"

      Our apologies for the confusion. The analyses we carried out to quantify evolutionary orthomyxovirid diversity likely waiting to be discovered was carried out on all known (at the time) members of ____Orthomyxoviridae____ and thus the title must still refer to the entire family rather than quaranjavirids. We felt that the term “taxonomic incompleteness” imparts on the reader exactly what the reviewer refers to, namely that new taxonomic ranks are likely to come as more evolutionary diversity gets uncovered. Alternative and more precise formulations, like referring to evolutionary incompleteness or something similar, would miss the fact that it is taxonomy that discretises the otherwise continuous evolutionary change.

      Abstract: genomes are not employed and do not make money. Please replace "employed" with "used"

      We have to respectfully disagree since the definition of the word “employ” also includes the meaning “to make use of”.

      Re: point 6 above, Introduction: species/families etc. cannot be discovered. They are being established by people for viruses that may be discovered. Please fix here and elsewhere (in most cases, "species" should be replaced with "viruses")

      We agree that taxonomic ranks are designated and not discovered and have changed the text accordingly.

      P3, second paragraph: please place "jingmenviruses" in quotation marks as this is not an official term (yet). Please add "potentially" ("as potentially causing human disease"). Even the authors only speak of an "association" and do not fulfill Koch's postulates

      We have to respectfully disagree here too. “Jingmenviruses” as a term is unambiguous in referring to a group of related segmented flaviviruses even though the groups is not officially assigned a taxonomic rank. We have altered the text to add uncertainty to the claim that jingmenviruses cause disease in humans.

      P3, top right column: "e.g., the tick-borne Johnston Atoll quaranja- and thogotoviruses" is ambiguous. Please change to "e.g., the tick-borne quaranja- and thogotoviruses" or list particular viruses and clarify which belong to which genus

      Apologies for the confusion. We fixed this instance.

      P3, right column "smaller number" - change to "lower number"

      We have altered the offending sentence in response to reviewer 2 and this combination of words is no longer present.

      P3, right column "or only the polymerase" - makes no sense to the reader as it has not been introduced; and grammatically needs to be improved as the polymerase is also encoded on a segment. Likewise, PB1 makes no sense to unacquainted reader - maybe add a few sentences to the intro right after the family introduction on general genome composition and that PB1 is part of the polymerase holoenyzme?

      We have altered the offending sentence in response to reviewer 2 but we take the point. We’ve added detail about the RNA-directed RNA polymerase of orthomyxovirids to the introduction.

      P4: the Ebola virus glycoprotein is called GP1,2 [with 1,2 in subscript] (also Figure 2 legend)

      Respectfully, while the reviewer is technically correct in that the glycoprotein of Ebola virus is referred to as GP_1,2 in proteomics literature (the 1,2 referencing the protein held together by a cysteine bridge post-cleavage), calling it GP is not out of place in evolutionary studies and the term “Ebola virus GP” is unambiguous to the reader.

      P4: please change "West Africa" to "Western Africa" (the designation of the area by the UN)

      Unfortunately, while we agree that the reviewer is correct in that the UN refers to the region as “Western Africa”, references to the “West African Ebola virus epidemic” are ubiquitous in the literature and thus we do not see the reason to change the term here either.

      P6: change "with Rainbow / Steelhead trout orthomyxviruses" to "with mykissviruses (rainbow trout orthomyxovirus and steelhead trout orthomyxovirus)" [note that virus names are not capitalized except for proper noun components; hence also "infectious salmon anemia virus, bottom right column]

      While we recognise that viruses related to infectious salmon anaemia virus discovered in trout have received a separate taxonomic designation we feel very strongly about not mentioning it in our manuscript. Our fear is that “mykissviruses” have been designated too hastily on the basis of a handful of representatives and that relatives discovered in the future may show an indiscernible continuum between “mykissviruses” and isaviruses, invalidating the former as a valid term. We would therefore strongly prefer to keep references to specific viruses rather than a taxonomic designation that may disappear so that a future reader may have an easier time with our study.

      P6, right column: please change "RNA-dependent" to the IUPAC/IUB-correct "RNA-directed"

      Done.

      Figure 2 is too small. I could not figure out B with or without my confocals... Likewise S2, S3 are way too small. In Fig 2 legend, please place "spike" into lower case

      We understand the reviewer’s concern here but Figure 2B was a compromise between vertical space available on a page, the number of taxa in the PB1 tree, and what we thought important to communicate - the variation in segment number across orthomyxoviruses and mapping of PB1 diversity to gp64 diversity. This was done at the expense of individual taxon name visibility whilst fully zoomed out. To remedy this Figure 2B was rendered in 300 dpi resolution such that zooming in will show individual taxon names clearly. We ultimately hope to publish our study in an online-only journal where printing will not present an issue. Likewise for figures S2 and S3. We have changed “Spike” to be lower case in the legend.

      Figure 3: correct spelling of virus names (from top to bottom): rainbow trout orthomyxovirus, infectious salmon anemia virus, influenza C virus, influenza D virus, influenza A virus, influenza B virus, Wēnlǐng orthomyxo-like virus 2, Dhori virus, Thogoto virus, Jos virus, Aransas Bay virus, ... Johnston Atoll virus, Quaranfil virus, H�běi orthomyxo-like virus 2, Hǎin�n orthomyxo-like virus 2, Wǔh�n mosquito virus 6. Also apply to S6 and others where applicable.

      The names for viruses in Figure 3 were taken directly from their NCBI records and since we do not show their accessions there is no other way to disambiguate them to the reader. We have, however, added the necessary diacritics where appropriate.

      [PS: based on the somewhat backward, non-UNICODE editorial manager system, I am worried that the diacritics in virus names above are not rendered corretly. If so, please look up the Pinyin spelling of Wuhan, Hainan, Wenling etc. - easiest way is to search Wikipedia for the terns and then identify the Pinyin spelling, which is typically pointed out]

      CROSS-CONSULTATION COMMENTS

      I think we (all reviewers) are all largely in agreement - this is a very useful study; the manuscripts just needs various adjustments. I agree with the requests of the other two reviewers.

      Reviewer #1 (Significance (Required)):

      The strength of the paper is that it provides a road map on how undersampled taxa may be analyzed and which kind of information can be gleaned from these analyses. The paper also demonstrates that the analysis of seemingly "unimportant" viruses can prove important. The limitation of the paper is that there is no true novel revelation here. The sampling sites of WuMV-2 GenBank records already suggest broad distribution, which often goes along with sequence diversity; the continued discovery of orthomyxovirids in metagenomic studies clearly implied undersampling (but it is nice to have this "gut feeling" scientifically fortified now). The paper is useful for evolutionary virologists, virus taxonomists, orthomyxovirid specialists, and invertebrate virologists.

      We respectfully disagree with the reviewer and believe they may have missed an important point raised by our study. We do not claim that a global distribution of WuMV6 is what makes it remarkable but that its sampled diversity is 1) sufficient to calibrate molecular clocks (in our experience this is not always the case for arthropod viruses) and 2) that WuMV6 has reached its current global distribution ____recently____.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      This is a nice example of bringing together a variety of data from metatranscriptomic studies to answer fundamental evolutionary questions in the field of viral evolution. There is a focus on a single virus family, and although some might see this as a little restrictive, I think the 'deep-dive' presented in this paper leaves space for a relatively detailed and comprehensive analysis. No doubt, other studies will gain inspiration from the approach presented here and expand this work to other viral groups.

      Overall, the paper is very well written, and the figures are of a very high quality. It is a shame that there are only 3 main figures in the paper because the supplementary figures are well presented and informative.

      We thank the reviewer for the kind words.

      The manuscript discusses the importance of host quite a bit, and for that reason it would have been nice to try and incorporate the host of the various viruses into the figures somehow (perhaps as a supplementary, since the trees are already quite busy). This might help orientate the reader).

      While we appreciate that host information is of interest, we foresee several issues. For one, we refer to broad host classes (essentially arthropod versus vertebrate) because they are largely determined by membrane fusion protein classes, the actual focus of our study, which exhibit strong phylogenetic signal. Secondly, host information in metagenomic studies can be imprecise, incorrect or unavailable.

      I have some minor comments or suggestions for the authors to consider below. Note, please use line numbers in the future for your submissions.

      A paragraph in the discussion laying out the limitations of this approach would be useful to the reader and would make this excellent paper even more robust.

      Thank you for the suggestion. We presume the reviewer is referring to our interpolation of orthomyxovirid diversity and included a few sentences about the limitations of this approach in the Discussion.

      Pg 3. The sentence starting 'The vast majority of known orthomyxoviruses use one...' should be made into two sentences to make it easier to read. A second sentence for the arthropod description is the obvious edit.

      We appreciate the suggestion and have included it in the manuscript.

      Pg 3. 'The number of segments of orthomyxoviruses with genomes known to be complete varies from 6 to 8'. Rephrase to - 'Orthomyxoviruses genomes are known to have 6-8 segments, but many metagenomically discovered viruses in this group have incomplete genomes...etc...',

      Thank you for the suggestion, it has been included.

      Figure 1 - what do the white triangles mean? Are these the directions of reassortment? This should be explained in the legend...

      We apologise for the omission, this is now explained.

      New Zealand is covered up by the circular tree. It looks like there is a point which is partially obscured.

      The reviewer spotted a mistake on our part here. The figure included the coordinates for Wellington, New Zealand when the detection was actually in Wellington Shire, Australia. This has been fixed.

      PD analysis - t I think you assume that viruses are static in this analysis. As we all know, they continue to mutate and eventually new species will evolve. Is it possible to consider the mutation rate in this analysis and the evolution of new variants/ eventually leading to new species? It might be complicated, and maybe a matter for future work, but it might be worth discussing this as a limitation at the very least. Especially when extrapolating to the future (although you do not extrapolate too far, so maybe this is not an issue here...). You could choose to discuss this in relation to the bird analogy (which was great), and compare the rate of mutation which will lead to the evolution of new species on a totally different time scale.

      We appreciate the point raised by the reviewer and while we wholly agree that the possibility of new viral taxa arising over time is an important caveat, we felt the discussion ends up being rather short. On one hand taxa definitions for different viral groups can be different, and on the other speciation in RNA viruses is difficult to place in absolute time because of a phenomenon called time-dependence of evolutionary rates. Methods accounting for the latter using sophisticated models or external calibration points would seem to imply that speciation timescales exceed those of research.

      Discussion: When discussing the hypothesis that WMV6 diversity is a result of repeat exposure to vertebrate hosts, can you also describe the alternative hypothesis here, and why the evidence leads you to put more weight on the former.

      This is a fair question and we have mentioned an alternative hypothesis in the discussion that’s been brought up by our colleagues before. It’s a hypothesis that alternating between different hosts induces divergent selection pressures on gp64. We contend that since gp64 proteins are thought to use a highly conserved host receptor (NPC1) we think it likely that no major changes are required when switching hosts. We are open to discussing other alternatives if the reviewer has suggestions.

      CROSS-CONSULTATION COMMENTS

      Seems like we are all in agreement and that after some minor adjustments this will be an excellent contribution.

      Reviewer #2 (Significance (Required)):

      Please see my review above. I did not use your formatting suggestions since I only saw it upon completing my review.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      Summary

      This manuscript describes the use of data from metagenomic analyses to make inferences about the evolutionary and geographic history of the Orthomyxoviridae family of viruses and their hosts. Data from Wuhan Mosquito Virus 6 (WMV6) derived from various RNA-seq analyses is used to analyse loss and gain of virus segments over time, the time since the last common ancestor of these segments and the selection pressure acting on different genes. These results are used to hypothesise about which species have vectored this virus in the past and their geographic distribution. The additional phylogenetic diversity provided by characterisation of additional viruses of this species is quantified and projected into the future to demonstrate the value of further work in this area. The study also demonstrates more generally the benefit of additional sequencing and of characterising viruses in metagenomic datasets, even in cases where novel viruses are not identified.

      Major Comments

      The methodology in this manuscript appears to be sound and the results support the conclusions. Appropriate and detailed analyses have been performed and are described in detail. Code is provided to allow the results to be reproduced. The figures are informative and very well presented. I do not think any additional analyses are required.

      We thank the reviewer for the kind words.

      Minor Comments

      The manuscript is a little hard to follow in places. I think a brief introduction of WHV6 in the introduction section would help with this - where has it been isolated previously and what is known about its evolutionary history (if anything), how is it related to other Orthomyxoviruses. This information is included later but it would improve the flow of the paper to include it in the introduction.

      We apologise for the inconvenience and agree with the reviewer. We have improved the flow of the manuscript per reviewer suggestion.

      I think including a little more about the Method in the Results section would also be helpful, to save the reader jumping back and forth in order to understand the results. For example, at the beginning of the results section, briefly detailing how many samples were included, their broad geographic location and what the analysis is intended to show (e.g. "three full length sequences isolated from China, seven from Australia [...], between 1995 and 2019, were used to generate a reassortment network, in order to show.....") would be helpful. Each of the subsections of the Results would benefit from something similar.

      Apologies for the lack of clarity on our part. We have added more methodological information to each section in the results.

      Although it is clear in the Materials and Methods which datasets have been included, it is less apparent why these were selected. For example, in Figure 1A there are five countries listed - are these countries for which a particularly large amount of full length sequences were available or for which any full length sequence is available? Similarly, for Figure 1B, are these all of the countries where a dataset has originated containing any segment of WHV6?

      The confusion is entirely our fault as we have clearly not provided sufficient detail. This has been fixed now by explaining this better in the methods and Figure 1 legend.

      In the Discussion, it is stated that the frequency and fast evolution of WMV6 place it uniquely to enable tracking of mosquito populations, however there is no evidence presented to support this - does WMV6 evolve faster or occur more frequently than other mosquito RNA viruses?

      Our apologies for the jump in logic. We now expand on what we meant by the following sentence in the discussion: “In our experience, metagenomically discovered RNA viruses can be rare or, when encountered often, do not always contain sufficient signal to calibrate molecular clocks (Webster et al. 2015).”

      CROSS-CONSULTATION COMMENTS

      I also agree with the requests of the other two reviewers and that the manuscript will be in great shape once these are included.

      Reviewer #3 (Significance (Required)):

      This manuscript is very interesting, for the specific results presented here but, more importantly, in opening up further avenues for investigation. The study provides a proof of concept for using viruses derived from metagenomic data for specific and detailed evolutionary and ecological analyses of a single species. The scope of the analysis performed on WMV6 is not particularly broad, but it differs from the typical analysis of viruses in metagenomic datasets, which tends to focus on identification and characterisation of novel viruses only. I believe that this work is valuable to others working in the field, reveals additional potential in existing data and could provide inspiration for many future studies. To my knowledge, it is one of the first studies to focus on a single, fairly under-studied virus, and draw ecological conclusions based on only bioinformatic analyses.

      I think the results presented here for WMV6 may be of interest to a specialised audience, but that the manuscript overall is valuable to a broad audience, including ecologists, evolutionary biologists and virologists conducting fundamental science research.

      We appreciate the reviewer’s kind words.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary:

      The authors use an unclassified quaranjavirus, Wǔhàn mosquito virus 6 (WuMV-6), to demonstrate the possibility of orthomyxvirid global transmission dynamic analyses. The focused surface protein analysis strongly indicates a vertebrate host for WuMV-6 in addition to the insect host. The analysis is then expanded to other quaranjaviruses, which differ considerably in their surface glycoproteins, indicating a complex evolution. Finally, the authors scientifically demonstrate that orthomyxovirids are undersampled and hence that this family will have to expand considerably in the future.

      Minor comments:

      The article lacks precision and hence some global edits are in order. Generally:

      1. For clarity to the reader, please introduce the family Orthomyxoviridae, i.e., its current official composition (i.e., 9 genera, 21 species, and 22 viruses) so the reader is not confused by terms such as "quaranjavirus" or "isavirus" etc.).

      2. After that, please clearly indicate which viruses are classified and which ones are not. For instance, the main virus dealt with in this paper is unclassified, and so are Astopletus and Ūsinis viruses.

      3. Please ensure correct spelling, including diacritics, of the viruses and abbreviations throughout: Wǔhàn mosquito virus 6 (WuMV-6); Húběi orthomyxo-like virus 2 [note the deletion of one "virus"]; Wēnlǐng orthomyxo-like virus 2

      4. For orientation of the reader, please refer to family groups of viruses as -virids (e.g., "orthomyxovirids", "human coronavirids", "some rhabdovirids"). This way it is clear to the reader that, for instance, "quaranjaviruses" refers to a genus-level group

      5. "influenza" is a disease. There are several viruses that can cause influenza; they belong to four different genera. Please scan for "influenza" and replace each either with a virus name (for instance, in the abstract, "...RNA viruses containing influenza A virus" or with a genus name (e.g., "alphainfluenzaviruses")

      6. Please ensure the differentiation of taxa (concepts), such as species, and viruses (things). Orthomyxoviridae cannot infect anything, it can also not be sampled etc. Orthomyxovirids, the physical members of Orthomyxoviridae can infect things. Most instances of "Orthomyxoviridae" should be replaced accordingly.

      In particular:

      1. The title doesn't make much sense. Orthomyxovirids are not taxonomically incomplete - they are things that we simply may not have samples or may have characterized incompletely. Also, the analyses are largely restricted to quaranjaviruses. Hence, I would suggest "...genome evolution, and broad diversity of quaranjaviruses"

      2. Abstract: genomes are not employed and do not make money. Please replace "employed" with "used"

      3. Re: point 6 above, Introduction: species/families etc. cannot be discovered. They are being established by people for viruses that may be discovered. Please fix here and elsewhere (in most cases, "species" should be replaced with "viruses")

      4. P3, second paragraph: please place "jingmenviruses" in quotation marks as this is not an official term (yet). Please add "potentially" ("as potentially causing human disease"). Even the authors only speak of an "association" and do not fulfill Koch's postulates

      5. P3, top right column: "e.g., the tick-borne Johnston Atoll quaranja- and thogotoviruses" is ambiguous. Please change to "e.g., the tick-borne quaranja- and thogotoviruses" or list particular viruses and clarify which belong to which genus

      6. P3, right column "smaller number" - change to "lower number"

      7. P3, right column "or only the polymerase" - makes no sense to the reader as it has not been introduced; and grammatically needs to be improved as the polymerase is also encoded on a segment. Likewise, PB1 makes no sense to unacquainted reader - maybe add a few sentences to the intro right after the family introduction on general genome composition and that PB1 is part of the polymerase holoenyzme?

      8. P4: the Ebola virus glycoprotein is called GP1,2 [with 1,2 in subscript] (also Figure 2 legend)

      9. P4: please change "West Africa" to "Western Africa" (the designation of the area by the UN)

      10. P6: change "with Rainbow / Steelhead trout orthomyxviruses" to "with mykissviruses (rainbow trout orthomyxovirus and steelhead trout orthomyxovirus)" [note that virus names are not capitalized except for proper noun components; hence also "infectious salmon anemia virus, bottom right column]

      11. P6, right column: please change "RNA-dependent" to the IUPAC/IUB-correct "RNA-directed"

      12. Figure 2 is too small. I could not figure out B with or without my confocals... Likewise S2, S3 are way too small. In Fig 2 legend, please place "spike" into lower case

      13. Figure 3: correct spelling of virus names (from top to bottom): rainbow trout orthomyxovirus, infectious salmon anemia virus, influenza C virus, influenza D virus, influenza A virus, influenza B virus, Wēnlǐng orthomyxo-like virus 2, Dhori virus, Thogoto virus, Jos virus, Aransas Bay virus, ... Johnston Atoll virus, Quaranfil virus, Húběi orthomyxo-like virus 2, Hǎinán orthomyxo-like virus 2, Wǔhàn mosquito virus 6. Also apply to S6 and others where applicable.

      [PS: based on the somewhat backward, non-UNICODE editorial manager system, I am worried that the diacritics in virus names above are not rendered corretly. If so, please look up the Pinyin spelling of Wuhan, Hainan, Wenling etc. - easiest way is to search Wikipedia for the terns and then identify the Pinyin spelling, which is typically pointed out]

      CROSS-CONSULTATION COMMENTS

      I think we (all reviewers) are all largely in agreement - this is a very useful study; the manuscripts just needs various adjustments. I agree with the requests of the other two reviewers.

      Significance

      The strength of the paper is that it provides a road map on how undersampled taxa may be analyzed and which kind of information can be gleaned from these analyses. The paper also demonstrates that the analysis of seemingly "unimportant" viruses can prove important. The limitation of the paper is that there is no true novel revelation here. The sampling sites of WuMV-2 GenBank records already suggest broad distribution, which often goes along with sequence diversity; the continued discovery of orthomyxovirids in metagenomic studies clearly implied undersampling (but it is nice to have this "gut feeling" scientifically fortified now). The paper is useful for evolutionary virologists, virus taxonomists, orthomyxovirid specialists, and invertebrate virologists.

  3. jeffreycwitt.com jeffreycwitt.com
    1. Can you think of a way our social identity (e.g. our identify as “students”, Baltimoreans, Americans, etc.) depends on your understanding of the past?

      The way we identify socially may be viewed entirely differently to different individuals depending on their understanding of the history /past surrounding your identity. You may have one understanding of the past surrounding who you are, for example you may view you identity positively after understanding your history, but someone else may view your social identity negatively because they interpret your past differently, or with bias.

    1. Skip to content Toggle Menu Primary Navigation HomeReadAdminSign out Search in book: Search Want to create or adapt books like this? Learn more about how Pressbooks supports open publishing practices. Book Contents Navigation Contents IntroductionIsrat Jahan Oeeshi and Stefanie Panke I. Instructional Strategies and Engaging Pedagogies 1. Education in EmergenciesMursal Amanzai and Fatima Qasemi2. Problem and Project Based LearningAmena Karimi; Fatima Nasiry; and Zainab Mirzaie3. Educating Bilingual learners in SchoolMahroosa Noori and Masoona Noori4. Design Thinking for Creativity and Innovation at SchoolsZulikha Malekzai II. Teaching and Learning with Technology 5. Educational Technology and Mobile LearningMasturah Pakbin Alizada; Tamana Setayesh; and BIBI LINA AZIZI6. Effective Online LearningMahdia Ahmadi and Fatima Ameri7. Teaching Writing Skills with Blended Learning Approach for Schools' learnersSima Ahmadi and Arezo Sultani8. Open Pedagogy: Collaborative Open Access Textbook DesignStefanie Panke and Israt Jahan Oeeshi III. Inclusion, Wellbeing and Community Building 9. Community Building and Parents CommunicationNILOFAR SHEWA and Roshan Gul Haidari10. Social Emotional Learning and Mental WellbeingAlizeh Sultan; Sawdah Hoque; and Khurshid Arif11. Inter-Group Empathy and Inclusive Learning in Secondary SchoolsFatima Mohammadi and Ritu Tripura Appendix Strong Schools Inclusion, Wellbeing and Community Building 10 Social Emotional Learning and Mental Wellbeing LEARNING OBJECTIVES, Introduction, Components of SEL - 5 components, PILLARS OF SEL - Three Pillars, Childhood Development and SEL, MENTAL WELLBEING AND TRAUMA NAVIGATION/HEALING, Conclusion, Review Questions, Mental Health Check In Activity, Key Terms Alizeh Sultan; Sawdah Hoque; and Khurshid Arif Learning Objectives  After reading this chapter you will be able to Define Social Emotional Learning (SEL) Discuss the importance of SEL in schools. Identify and deal with challenges faced by students and educators in school settings Learn about and apply conflict resolution and mediation. Effectively promote SEL in school Navigate the Teacher-Student-Parent dynamic Utilize SEL to ensure the mental well-being of students. Train stakeholders to increase safety in schools.   Abstract: Social-Emotional Learning and Well-being is not only an emerging research agenda, but also a critical issue concerning the individual as well as societal development, because how the issue is viewed has huge theoretical as well as practical, and even vital implications. In academic, while some argue that social-emotional learning and well-being is in the subjective perception of one’s life or psychological functioning, others argue that social-emotional learning and well-being is in the objective conditions and the broader environment. Many programs have been developed to help schools enhance students’ health and reduce trauma, bullying, violence, and ill-being. How should educators set it up for students in schools? This article describes the importance of social-emotional learning (SEL) in schools, well-being, the best educational practices for applying conflict resolution in schools, mediation in schools, the teacher-student-parent dynamic, and trauma. The SEL framework can be used to guide teachers in schools and parents outside of the schools to deal with problems caused by ill-being (mental illness) trauma and violence, and use mental health (mental well-being) to teach social-emotional learning.   Introduction  Social and emotional learning (SEL) aims to help students better understand their thoughts and emotions, become more self-aware and develop more empathy for others in their community and the larger world, takes these differences into account and helps put all students on an equal footing to succeed. SEL is the process by which children and adults regulate their emotions, set goals, demonstrate empathy, build and maintain healthy relationships, and make constructive choices (CASEL, 2022). Nowadays students frequently meet people for the first time in the classroom who come from various backgrounds, hold different opinions, and have unique abilities. It can be difficult for students to adapt to a new and diverse community, to feel at home there and to show respect and understanding of people with different viewpoints and values. Social and emotional learning (SEL) aims to help students better understand their thoughts and emotions, become more self-aware and develop more empathy for others in their community and the larger world, takes these differences into account and helps put all students on an equal footing to succeed. SEL is the process by which children and adults regulate their emotions, set goals, demonstrate empathy, build and maintain healthy relationships, and make constructive choices (CASEL, 2019). Recently, an increasing number of educational institutions are realizing that competence in socio-emotional development and academic performance are closely intertwined. 7.4% of American children between the ages of 3 and 17 have been diagnosed with a behavioural problem, which means that at least two students in a class of 30 would benefit from positive reinforcement if they had behavioural issues (Banks, 2022). In addition to student education, educational institutions are now regarded as “an important if not central arena for health promotion and primary prevention” (Roeser, 2000). SEL is a methodology that teaches students of all ages how to better understand their emotions, feel them fully, and show empathy for others. These learned behaviours are then applied to assist students in making positive, responsible decisions, developing frameworks for achieving their goals, and developing positive relationships with others. SEL should be included in the school curriculum because it improves the quality of education at schools as it improves the school & class environment, reduces behavioural problems and boosts the academic achievement of students. SEL with an integrated, coordinated approach not just teaches children how to pass exams, but also teaches them how to develop life skills and deal with social challenges. SEL helps students maximize their potential in school, but also throughout their lives. A meta-analysis of 213 school-based, universal social and emotional learning (SEL) programs involving 270,034 kindergartens through high school students found that students significantly improved their social and emotional skills, attitudes, behaviour, and academic performance, resulting in an 11-percentile-point increase in achievement (Durlak et al., 2011). By incorporating SEL into the school curriculum, students will learn how to control their emotions and behaviours in order to reduce stress. Some of these skills can assist children in self-regulating when they are required to do so independently. In order to find solutions, they will sometimes be better able to communicate their thoughts and feelings to others. According to Durlak et al. (2011), the four components of SAFE—active learning, focused activities to develop one or more social skills, sequenced activities that lead to skills in a coordinated and connected way, and explicit targets regarding specific skills—are present in the most successful SEL programs.  Teachers who use SEL curricula ensure that high expectations are communicated to students by eliciting their thoughts, displaying model work, providing specific feedback to spur improvement, and emphasizing that making mistakes is an important part of learning (Paterson, 2021). Social Emotional Learning: What is SEL and Why SEL Matters – Video  Components of Social-Emotional Learning (SEL): There are many components to SEL but most literature states 5 key components. they are – Figure: Five main components of social-emotional learning (MLSD, 2022). Self-Awareness: Self-awareness is a conscious awareness of one’s own strengths, weaknesses, actions, and presence. Self-awareness necessitates a clear understanding of your mental and emotional states. Recognizing your emotions and how they influence your behaviour; recognizing your strengths and weaknesses in order to gain confidence in your abilities. Self-Management:  Self-management consists of setting and achieving goals, as well as taking responsibility for your thoughts, emotions, and actions in various situations. Self-management is built on self-awareness. If students can accurately pinpoint their feelings and how they influence their behaviours, they will be better able to act on them. It can be very empowering to assist them in developing their capacity to manage their emotions and behaviour. Social Awareness:  Social Awareness is the ability to put yourself in the shoes of another person who comes from a different background or culture than you. To act with empathy and integrity in your home, school, and community. Recognizing others, understanding the perspectives of others and empathizing with them, including those from diverse backgrounds, cultures, and contexts, is what social awareness is all about. Relationship Skills:  Relationship skills are the ability to form and maintain healthy relationships with people from various backgrounds. This competency focuses on knowing when to ask for or offer assistance, listening to others, and communicating with them. Students who participate in SEL learn how to handle conflicts in their relationships more effectively, making it easier for them to maintain their friendships (Banks, 2022). Because of their problem-solving abilities, they can work in groups and even enjoy teamwork in the classroom. Responsible Decision-Making: Responsible decision-making refers to the ability to decide how to appropriately act or react in a given situation based on learned behaviours such as ethics, safety, weighing consequences, and the welfare of others as well as yourself. To make responsible decisions, students need to develop critical thinking, open-mindedness, sound judgment, reason, problem-solving, and solution-finding skills. They need to think about others as well as themselves when making decisions.   Other components that are important to keep in mind (especially for school children) are – Stress Management:  People of all ages, particularly children, are affected by trauma and stress. According to a 2014 American Psychological Association study, many teenagers have even higher stress levels than adults. Stress management is tools, strategies, or techniques that help you feel less stressed and lessen the negative effects of stress on your mental or physical health. There are numerous techniques for dealing with stress. These techniques include behavioural, emotional, and mental ones. The first step in assisting your students in overcoming their stress is to comprehend what is causing it. Peer pressure, abuse, and familial expectations can all have an impact on your student’s mental and emotional well-being. Stress management in the classroom necessitates consistent effort. A teacher’s creativity and patience are frequently required. To reduce classroom tension, incorporate these strategies into your lesson plans. Conflict Resolution and Mediation: There will always be disagreements between children, no matter how minor. For example, during recess kids can argue about anything, whether it’s a toy or another student. Conflict resolution techniques can be used in the classroom to keep the peace among the students. Peaceful conflict resolution refers to working through a problem or conflict in an early childhood setting in a way that does not negatively impact anyone involved on a physical, emotional, or social level. When disputes are settled amicably, children can gain confidence in their ability to handle situations and relationships. Teaching young people how to navigate conflict is one of the most important components to facilitate their growth. Even some adults have poor meditation skills and can’t navigate their professional and personal life properly. When we’re so angry that we can’t think straight, it’s difficult to solve a problem. This is why teaching young people from an early age to resolve conflicts and how to overcome their negative emotions and act accordingly is of utmost importance. Although Mediation is frequently described as a win-win process, with a focus on reaching a mutually satisfactory conflict resolution, the research shows that there are additional and possibly larger benefits for students who facilitate mediation as well as the school community as a whole (DeVoogd, 2016). Teaching students to get some perspective and put themself in the other person’s shoes and think beyond just one incident is crucial. It is important to build their listening and problem-solving skills through conflict resolution. Conflict resolution is an important personal skill and in schools, the role of peer mediation in conflict resolution is also important. Mediation is an important method that requires focus, an open mind, and the willingness to compromise. Both parties in a conflict need to give work towards a solution with respect in order to not waste time and reach a satisfactory solution. And having a mediator to help with conflict resolution is a good way to ensure that. The researcher DeVoogd (2016) states that “Student mediators also demonstrate better attendance than non-mediators and report feeling safer and more connected in positive ways to their school, with a sense of belonging.”  This shows that mediation training and being mediators, in general, is useful. But I think we should give more importance to building each individual student’s conflict resolution skill than just having some students in the role of mediators. Teaching students to get some perspective and put themself in the other person’s shoes and think beyond just one incident is crucial. It is important to build their listening and problem-solving skills through conflict resolution.   Here is a video that will explain the importance of SEL  in our Children and Adult life. SUBTOPIC – PILLARS OF SEL – Three Pillars: Social and emotional learning (SEL) is a term that broadly refers to the process through which people learn and put into practice a variety of social, emotional, and associated skills, attitudes, behaviours, and values that assist guide pupils. This involves having ideas, emotions, and behaviours that help people do well in school. But SEL has been described in a number of different ways (Humphrey et al., 2011). In today’s increasingly diverse world, children frequently meet people for the first time in the classroom who come from a variety of different origins, have diverse ideas, and have special talents. Social and emotional learning (SEL) aims to assist students in better understanding their thoughts and emotions, growing in self-awareness, and developing more empathy for others in their community and the wider world in order to take into account these differences and help put all students on an equal footing to succeed. These skills can be fostered in the classroom to help kids become stronger, more effective, self-conscious, and socially aware citizens in the years to come. Learn more about the significance of social-emotional learning and the advantages it has in the classroom and outside of it. Here we discuss the three pillars of the SEL which are Culture, Adult Skills and Curriculum. Culture From a cultural perspective, the kind of skills associated with SEL seems to be based on a theory of emotions that views them as internal, individual states that call for active management control in order to be channelled in socially beneficial, healthy ways. The main focus is on controlling or containing emotions that can “boil over,” leading people to behave irrationally (Lakoff & Kovecses, 1987). The SEL literature frequently suggests verbalization or visualization methods that involve verbalizing feelings, using visualization techniques, or engaging in breathing or counting activities. Children in the primary grades should be able to recognize and appropriately label simple emotions such as sadness, anger, and happiness, according to CASEL’s 2007 assessment of essential skills in emotional identification, labelling, and discussion. Students must “recognize and appropriately describe emotions and how they are linked to action [as well as] use language skills to understand other people’s thoughts and perspectives,” such as being able to employ “I messages” while discussing feelings, according to Illinois state requirements (Illinois State Board of Education, 2006). Another part of connection skills is that “students should be able to describe ways to make and keep friends” (CASEL, 2007). The norms surrounding emotional expression, emotional experience, and emotional regulation are, however, strongly conditioned by culture, as research on emotion in non-Western cultural contexts has long demonstrated (Briggs, 1998; Chao, 1995; Lutz, 1987, 1988; Markus & Kitayama, 1994; Miller, 1982, 1996; Shweder & LeVine, 1984; White, 1987). Not all cultures share the same regulatory or expressive behaviours (like talking) of the White, American middle class, nor do they interpret emotional experience in the same ways (see also Ballenger, 1992). According to Wierzbicka (1994), the Anglo script for emotional expression places a significant focus on behavioural control and the notion that speaking about one’s emotions qualifies as adequate expression, in contrast to other cultural scripts for this purpose (p. 178). She makes a compelling case for the cultural influences on the relationship between emotion and language in ways that directly contradict the universalizing assertions of a lot of psychology research on emotions in cognition. In addition to stressing the need for SEL training to be “culturally relevant, empowering children within their own cultural surroundings,” Denham and Weissberg (2004) also raise the potential that “some SEL categories may be unique to the child’s home culture” (p. 41). Adult Skills Any classroom in the world, from the most basic, without walls, to the most complex, needs good relationships between teachers and students in order for learning to occur. The combination of skills that enables kids to collaborate with others, study effectively and play vital roles in their families, communities, and workplaces is known as social-emotional skills or “emotional intelligence.” According to research, social and emotional learning can be taught to pupils, and their presence in classrooms and schools enhances academic performance. Students are more likely to retain and apply what they are taught when academic and social-emotional learning are both incorporated into the educational process. Additionally, they weave into their education a sense of accountability, compassion, and interest in the welfare of others as well as their own. Thus, learning can be said to affect both the “brain” and the “heart,” which leads to better-run classrooms and motivated students. Therefore, academic and social-emotional learning are intertwined in every school, everywhere. Curriculum SEL can be incorporated into a school’s curriculum even though it is not a defined subject like math or history. Students may be more likely to participate and may be less prone to mentally drift off during their classes when teachers personalize and relate academic topics to them. SEL can have a beneficial lifetime influence by encouraging self-awareness, empathy, and emotions of safety and inclusion in the classroom. SEL is approached from various angles. A more formally specified period of the school day, sometimes taught in homeroom, is devoted to SEL by certain teachers. To help students better understand the SEL basic skills, these lessons are repeated throughout the rest of the school day. In order to foster a sense of community or common ground between students of different ages, teachers may choose to assign students to write or journal about their thoughts and feelings regarding a certain SEL topic. Other teachers incorporate SEL teachings into topics that are more formal, like math, history, or literature. As an example of SEL in action, assigning a group project where students self-delegate roles to work together for the benefit of the group, having students role-play historical figures to comprehend the motivations behind their actions, or having students conduct formal interviews with one another to gauge current events are all examples of SEL in action. (“What is social-emotional learning (SEL): Why it matters,” 2022) What is social-emotional learning (SEL): Why it matters. (2022, August 17). National University. https://www.nu.edu/blog/social-emotional-learning-sel-why-it-matters-for-educators/ SUBTOPIC – Childhood Development and SEL SEL in the context of daily classroom instruction includes daily check-ins with students, embedded SEL content in the Reading and English Language Arts, and general awareness of the social well-being of students in their virtual/classroom environment (even if only with a simple greeting enquiring about their emotional state upon arrival to the class or an enquiry regarding the previous evening). According to a study by Babalis et al. (2013), SEL definitely affects primary school pupils’ emotional competence and academic achievement. Another study by Cook (2014) found that school culture and practices hinder students’ academic achievement. This suggests that, in the absence of SEL institutional practices and curriculum-supported content, students would struggle academically because their social and emotional needs are not met in the classroom. This study found that English language learners have been more negatively impacted by the absence of SEL-supported surroundings and material in schools. In addition to SEL having an effect on these learners’ academic achievement, additional factors include acculturation difficulties, encountering racism and discrimination, and poverty (as described in Benner & Graham, 2011) also negatively affect students’ academic performance. SUBTOPIC –  Mental Wellbeing and Trauma Navigation/Healing Wellbeing, Well being or Well-being  According to the Merriam-Webster Dictionary, the correct term is well-being, not wellbeing or well being, and it is hyphenated. The noun “well-being” is defined as “the state of being healthy, happy, or prosperous.” Being can be a verb or a noun, whereas well is an adjective. To create a noun, a hyphen must be added. Now, what is the difference between mental health and mental well-being? While we wish that was the case, the definition of mental well-being does not imply that life is trouble-free. Instead, it indicates that you are equipped to deal with whatever life throws at you. These are abilities that you can pick up and hone, making your mental health better today than it was yesterday  (Slade, 2010). On the other hand, your mental well-being is your state of mind. It resembles physical health in many ways, but only with regard to your mind. It fluctuates daily, just like your physical health does. In general, your mental health is influenced by your experiences, your environment, your relationships, and the strength of the community in which you live. You need to be as healthy as possible on both counts because your physical and mental health is closely related (Lawrence, et al., 2017). [Video podcast 1] (Mental Health & Emotional Well-being ,2022) Finally, mental well-being can be mental health but mental health cannot be mental well-being, because they are related but both of them are independent ( Wheeler, 2021). Definition and Meaning of Well-being Well-being is often described as the state of being comfortable, healthy or happy. A feeling of health and vitality that results from your thoughts, emotions, actions, and experiences is referred to as well-being. When we are in a state of well-being, we frequently feel joyful, healthy, socially connected, and purposeful (Lawrence, et al., 2017). These skills include:  Self-fulfilment The realization that you are part of something bigger  Ability to care for yourself independently Identifying and employing character strengths Accurate perception of reality  Desire to learn new skills  Emotional resilience  Interested in the world around you Recognizing and staying true to your values Forming and maintaining healthy relationships Having a sense of hope Understanding that happiness comes from within Being determined Taking action to improve your life [Video podcast 2] As ill-being is a lack of prosperity, happiness, or health. But well-being is often described as the state of being comfortable, healthy or happy. Well-being is completely the opposite of ill-being (Headey, Holmström , & Wearing , 1984). for more information, look at the figure. Why do we have to pay attention to the well-being of children in schools? Schools have a responsibility to consider ways to enhance the educational process while also paying attention to the students by safeguarding their mental health, especially well-being. Well-being is a useful strategy for enhancing the health of school-aged children. “The fact that we as leaders have been entrusted with the most priceless resource in the world—children—is the most vital reason to care about wellbeing. Every student, regardless of how they frustrate, perplex, delight, or impress us, has inherent value and potential. Because we genuinely care about the students entrusted to us, are aware of our moral duty to care for those under our stewardship, and want what is best for them both now and in the future, we should therefore place a priority on well-being” (Dewey, 1897, p. 78). How to apply Well-being in schools?  These techniques can be used to implement well-being in schools: Mental well-being training: To combat ill-being, educational facilities can offer teachers training courses. It is crucial to create a place for specialized mental health professionals in educational settings. However, all teachers must believe they have received the necessary instruction and encouragement to help their students with their mental well-being. Implement mental health, and well-being into the curriculum:  The influence of mental health and well-being on a person’s quality of life must be emphasized to students. Students’ increased mental health literacy will increase their personal awareness of particular problems. This can be accomplished by including specialized lessons on mental health and relationship education in school curricula. Promote healthy eating: Both students and staff must eat healthfully. by including lessons on healthy eating in the curriculum, starting cooking groups, and providing nutritious food on school grounds and university campuses. Encourage students and staff to stay hydrated: Drinking enough water every day is vital for both mental and physical health.                                                                                     What Is Trauma? Trauma is an important phenomenon these days among students, teachers, and parents, so firstly what does it mean? We can say, trauma is a mental sickness, it should be therapy as soon as possible. There are several methods to teach how to remove trauma from your society. Some experiences that can be bad accidents or bad actions in your life are called Trauma. A traumatic accident is because of bad that happened in the past, like: when a person is driving, one day he has an accident and after that, it has a bad effect on his morale and he can’t continue driving. (Escudero & Wong – RAND Corporation, 2001). Trauma is an emotional and normal response to miserable events such as violence, abuse, losing close relatives, conflict, and natural disasters. Trauma can be acute, chronic, and complex based on the types of experienced events. Everybody may underlie trauma in their life. But children are the most vulnerable part who are adversely affected by trauma. Parents and communities are responsible to facilitate their children with the basic concept of stress, anxiety, and trauma and letting them know that it is a normal reaction to different occurrences. Children need to be able to identify their reactions toward stress and release their stress by different methods which need to be taught to them in schools. So, providing mental and emotional health support related to children and youth cognitive behaviours is a significant issue that is required to be considered in schools’ curricula. Adding emotional and psychosocial support subject to the schools has a huge impact on the mental and physical well-being of students, making them more resilient, confident, innovative, and critical thinking (Escudero & Wong-RAND Corporation, 2001). According to the sources that have been provided in the reading sections in relation to the cognitive behaviour of children. There are many factors that impact the well-being of children even during adolescence; like safe, protective, and healthy environments especially in schools, social and emotional supports, and strategies on how to overcome stressful and traumatic events. One main reason a student is passive in learning, making relationships, and behaving properly is adverse childhood experiences such as neglect, war, domestic violence, and harassment  (Slade, 2010). Studies presented that children, youth, and teachers who trained with the emotional and psycho-social program had a significant decrease in their stress and trauma. They are empowered, confident, concentrated, optimistic, able to make relationships and enhance their ability to take part in the communities’ decision-making.  All the development programs for releasing tensions and stress are implemented in advanced countries’ schools with the main target groups of students, teachers, and parents to raise their skills in coping with stress, shifting their mindset into restorative practices, mindful breathing, and enabling them to better self-manage resources. All the 15 development skills programs validate the Whitaker Peace & Development Initiatives (WPDI) program that the major focus is the psychological, social, and emotional well-being of children as well as amplify the knowledge of parents on how to communicate friendly with their children, to identify their problems and provide the solution for them (Escudero & Wong – RAND Corporation, 2001). Students from developing countries are adversely affected by mental and psycho-social health problems. As an example, in Afghanistan students every day underlie conflict, child labour, early forced marriages and violence. As a result, they are either aggressive or isolated. They lose their self-esteem, concentrate on certain issues, and lose hope for the future (Ibrahim et al, 2020). Therefore, there is a dire need of adding emotional and social support subjects into Afghan schools’ curricula. Adding psycho-social support subjects in schools would help students with mental health problems, who are affected by ongoing conflict and disaster. In my country, students need special education and psychological and emotional support. Unfortunately, these important subjects have never been addressed in our curriculum at schools (Ibrahim et al, 2020). That’s why millions of students, in spite of endless attempts, could not get promoted at schools, and this resulted in a high drop-out rate in Afghanistan. Sadly, girls’ drop-out rate from school is much higher than boys’ due to many cultural reasons that do not let adult girls have access to education facilities. How to deal with Trauma? In addition to the previously mentioned methods of conflict resolution and mediation:  By Speaking with therapists or social workers. These are excellent resources for more information about identifying and comprehending the effects of trauma in addition to giving specific information about your students. Ensure order and consistency. On the board, write the agenda. Use exit and entry procedures. A student may feel more secure if she knows what to expect. warnings before changes in activity. If you’re going to do something unexpected like turn off the lights or make a loud noise, let someone know in advance. Develop their skills and passions. To support a positive self-concept, concentrate on one area of competence and promote its growth. Create a backup plan. Make it possible for a student to leave the classroom if she becomes agitated or overwhelmed. Set aside a location inside or outside the school so that you will know where to look for her if she needs to calm down or take a sensory break. You can also give a student access to a box or kit of sensory-calming equipment (Silly Putty, coloring, puzzles). Show them how to look after themselves. One of the most crucial things to keep in mind (Venet, 2014). Trauma causes challenging behavior There is proof that trauma exposure impairs the stress response system’s ability to regulate itself, which can result in impulsivity and poor emotional control (Tarullo & Gunnar, 2006; Bright & Thompson, 2018). Young people who have experienced trauma are therefore more likely to exhibit internalizing or externalizing behavioral issues in response to subsequent stressful events (Wilton, 2020). Conclusion: Good mental health is the key to living a good life and social-emotional learning is important for constructing and maintaining good mental health. Teaching young kids how to handle mental stress and navigate all the ups and downs of life is essential and including SEL in the school curriculum is the best way to make sure that young kids are learning the skills to take care of their mental well-being. Integrating SEL into the school curriculum is essential but we have to also be careful in how we implement it. The teacher has to be properly trained so they don’t accidentally turn any issues worse. There also has to be a balance between collective and individual well-being. And to achieve the best outcome both school teachers and staff need to be trained properly. Review Questions:  Think about the class you are in or teaching, do you/or your school incorporate SEL in the curriculum? If SEL is incorporated is there any gap? How can the curriculum be improved to maximize the benefit of SEL and how do you think SEL should be modified for your context?   Mental Health Check In Activity – Organize Mental  Health Awareness session / Play Mental Health Management Bingo with students. Introduce your topic Know Your Feelings and  Mental Health Management Bingo Map out how the activity will work and its steps for the students/teachers. Students will explain and draw 3 of their coping mechanisms for stress or another mental health issue. We can put all of the mechanisms on a bingo board and play MHM BINGO. To play, students require a copy of each sheet and a pencil, and each Bingo worksheet will contain a list of positive coping mechanisms that are related to maintaining good mental health. Students can check on each box as the teacher/facilitator explains and the class as a whole discusses each coping mechanism, its benefits and other implications. It’s easy for students to play, and just as easy for teachers or parents to join in! You can find a summary and some complimentary resources & information about our Chapter here. Here is a podcast episode for you where we talk about SEL with Expert Psychologist Nabila Afroz from Asian University for Women – Audio Playerhttps://pressbooks.pub/app/uploads/sites/2449/2022/09/SEL_074255.m4a00:0000:0000:00Use Up/Down Arrow keys to increase or decrease volume.   Earn A Badge:   You can get a micro-credential after finishing this chapter. All you have to do is write a review on this chapter, and answer the review questions. Our strong school team will issue you a badge through Badgr. Email sawdah.hoque@gmail.com if you wanna earn a badge for this chapter.     Key Terms:  Trauma: Is the response to a deeply distressing or disturbing event that overwhelms an individual’s ability to cope, causes feelings of helplessness, diminishes their sense of self and their ability to feel a full range of emotions and experiences Conflict: mental struggle resulting from incompatible or opposing needs, drives, wishes, or external or internal demands Conflict resolution: It is conceptualized as the methods and processes involved in facilitating the peaceful ending of conflict and retribution SEL(Social Emotional Learning): Social–emotional learning is an educational method that aims to foster social and emotional skills within school curricula. Mediation: intervention in a dispute in order to resolve it Maslow’s model: Maslow’s hierarchy of needs is a theory by Abraham Maslow, which puts forward that people are motivated by five basic categories of needs: physiological, safety, love, esteem, and self-actualization. Classroom Management: refers to the wide variety of skills and techniques that teachers use to keep students organized, orderly, focused, attentive, on task, and academically productive during a class. Reference: Collaborative for Academic, Social, and Emotional Learning. (2019). What is SEL? Retrieved from https://casel.org/what-is-sel Committee for Children (2020). Building a foundation of success. Retrieved from https://www.cfchildren.org/what-is-social-emotional-learning/schools/ Banks, A. (2022), 6 benefits of social and emotional learning in the classroom. Insights to Behavior. Retrieved October 25, 2022, from https://insightstobehavior.com/blog/6-benefits-social-emotional-learning-classroom/ DeVoogd, K., Lane-Garon, P. and Kralowec, C.A. (2016), Direct Instruction and Guided Practice Matter in Conflict Resolution and Social-Emotional Learning. Conflict Resolution Quarterly, 33: 279-296. https://doi.org/10.1002/crq.21156 Dewey, J. (1987). My pedagogic creed. The School Journal, 54(3), 77–80. Durlak, J. A., Weissberg, R. P., Dymnicki, A. B., Taylor, R. D., & Schellinger, K. B. (2011). The impact of enhancing students’ social and emotional learning: a meta-analysis of school-based universal interventions. Child development, 82(1), 405–432. https://doi.org/10.1111/j.1467-8624.2010.01564.x Lawrence, C., Sajni, G., Pierre, H., Alison, R., Bruce, A., & Dora, M. (2017). Health and Wellbeing. Esearch Gate, 236-254. MLSD. (2022). Social-emotional learning. [Infographic].  Social-Emotional Learning – Mental Wellness – Medical Lake School District. Retrieved January 1, 2023, from https://www.mlsd.org/apps/pages/index.jsp?uREC_ID=1726968&type=d&pREC_ID=1897652 Paterson, J. (2021). Three principles for using SEL in the classroom. NEA. Retrieved October 24, 2022, from https://www.nea.org/advocating-for-change/new-from-nea/three-principles-using-sel-classroom Roeser, R. W., Eccles, J. S., & Sameroff, A. J. (2000). School as a context of early adolescents’ academic and social-emotional development: A summary of research findings. The elementary school journal, 100(5), 443-471. Slade, M. (2010). Mental illness and well-being: The central importance of positive psychology and recovery approaches. BMC health services research, 1-14. Venet, A. S. (2014, September 14). 8 Ways to Support Students Who Experience Trauma. Retrieved from edutopia: https://www.edutopia.org/discussion/8-ways-support-students-who-experience-trauma WHEELER, K. (2021, June 17). Well-being. Retrieved from HappiFul: https://happiful. Wilton, J. (2020). Trauma, challenging behaviour and restrictive interventions in schools. Centre for Mental Health, 1-24. About the Authors name: Alizeh Sultan institution: Asian University for Women My name is Alizeh Sultan. I am an Afghan girl who was born in 1996 in Maidan Wardak, Afghanistan’s Provence. Kabul University is where I earned my bachelor’s degree in journalism. I have some experience in journalism. During my time at university, I did some work-study. In Nai, I worked as a reporter, announcer, and program manager. I was working as an interviewer for PHC (Pearl Horizon Consulting) with some powerful women in Afghanistan. I was also a member of IWA (Integrity Watch of Afghanistan) for about two years. I observed the teacher’s teaching method, the school environment, the curriculum, and the student’s school situations. During my service, I enjoyed being a part of Afghanistan’s educational system. name: Sawdah Hoque Sawdah Rubai Bente Hoque is currently a graduate student at Asian University for Women (AUW) pursuing a degree of MA in Education. She completed her Bachelor’s at AUW majoring in Environmental Science in 2021. Born in Chittagong, Bangladesh, Sawdah wishes to work on reforming the Education System of Bangladesh, raising awareness about Mental Health, and creating an inclusive curriculum that ensures students’ social-emotional well-being. name: Khurshid Arif institution: AUW This is Khurshid Arif and she is originally from Afghanistan, Ghazni province but she grew up and did her studies in Balkh province, Mazar-e-Sharif city. She has graduated from Balkh University, faculty of Mining and Environment Engineering department of Petroleum Engineering. During her studies she participated in many leadership program like AWDP( Afghanistan Workforce Development Program) and WLD( Women’s Leadership Development) which she learned various methods to improve her leadership skills. After her graduation, since she was one of the top students at the university, she started her work as an Assistant professor at Balkh University. This journey last only for about one year and she lost her job because of Covid19. Then she started her new job at Afghan-Turk Maarif Girls High school as mathematics teacher. She had worked at Afghan-Turk Maarif girls’ high school for one year and then she applied to the MA program in Bangladesh and got selected and now she is doing her Masters there. These experiences teach her a lot and she is very happy about it .For instance She learned how to interact with her students, how to learn from them, how to be flexible and how to deal with many conflict while teaching students. Edit Previous/next navigation Previous: Community Building and Parents Communication Next: Inter-Group Empathy and Inclusive Learning in Secondary Schools Back to top License Strong Schools Copyright © by Alizeh Sultan; Sawdah Hoque; and Khurshid Arif. All Rights Reserved. Share This Book Share on Twitter Powered by Pressbooks Guides and Tutorials |Pressbooks Directory |Contact Pressbooks on YouTube Pressbooks on Twitter

      Dear Alizeh, Sawdeh and Khurshid, I read your chapter twice carefully and I hope I have provided constructive feedbacks. In general, I can point out that while I was reading, I noticed your countless efforts in writing this chapter. Stay successful and talented.

    1. Tis but fortune, all is fortune. Maria once told me she did affect me, and I have heard herself come thus near, that should she fancy, it should be25 one of my complexion. Besides, she uses me with a more exalted respect than anyone else that follows her. What should I think on ’t?

      Malvolio, despite being a character who should not be trifled with, falls for a poorly planned prank, first and foremost due to his desire for power (which is evident in his verbal juggling), which he believes will only come to him through his marriage to Olivia. Additionally, because Shakespeare wanted to mock the Puritan tendencies of the time, he purposefully created Malvolio to fall for this prank in this scene. A typical set of issues are brought up by the practical prank played on Malvolio, including identity instability, the significance of clothing in establishing one's identity and place, and the illusions and delusions we allow ourselves to fall into in the name of love. Malvolio succumbs to the seduction of romance just like everyone else, including Orsino and Viola. He is as romantic as anyone, despite his outward puritanism, albeit his idea of wedlocking Olivia is motivated more by social aspiration than by love. Malvolio's self-delusion is caused by his desire to surpass his class, but it also helps to explain why Sir Toby and the others find his fantasy so absurd. Malvolio is not a good match for Olivia due to both his undesirable demeanour and the fact that he lacks aristocratic blood. He is an ordinary person, whereas Olivia is a lady. They find it offensive that Malvolio would envisage Olivia and him getting married. We may remember how intrigued Olivia is when she learns from young Cesario, on whom she has a crush, that he is a "gentleman"—meaning that he is of noble birth—in an earlier scene. A noblewoman marrying a lower-class man would have been quite unusual in the class system of Shakespeare's day

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers


      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      All the conclusions are based on solid evidence and convincing, and the methodology are in detail to follow or repeat. The writing of the manuscript is logical and easy to follow.

      We thank the reviewer for these comments

      1. The mutation experiments indicated that nkd enhanced the phenotype of scr, but there is no leaf phenotype variation in nkd muations, this is some way difficult to understand, it would much better if the authors can give much more explanation in the discussion.

      We have added more discussion on this point. One possibility is that collectively the four genes function redundantly, however, due to the transcriptional negative feedback loop discovered here (Figure 3B), when NKD genes are mutated then SCR expression is enhanced, hence phenotypic perturbations are less likely to be observed than when SCR genes are mutated.

      2.The word green millet in the first paragraph should be changed to green foxtail. Millet means domesticated small cereal grains, such as foxtail millet, finger millet, proso millet etc.

      We thank the reviewer for this feedback and have made the suggested change.

      Reviewer #1 (Significance (Required)):

      The manuscript, which titled Mutations in NAKED-ENDOSPERM IDD genes reveal functional interactions with SCARECROW and a maternal influence on leaf patterning in C4 grasses by Hughes et al., first reported that SCR works regulating both leaf inner pattern and epidermal stomatal patterning in the C4 model plant green foxtail. The functional difference of this gene in Setaria from that in maize and rice indicated that the inner leaf cell patterning regulation of SCR is not a characteristic of C4 Species; this gave us insight understanding of the complex of C4 leaf cell patterning. In addition to this important discover, the authors found that mutations in NKD IDD genes enhance loss of function scr phenotypes in the leaves of C4 maize and Setaria but not in the C3 rice, indicating NKD IDD was involved in the leaf cell patterning in C4 species, but no in C3. They also identified a maternal effect on cell-type patterning in maize leaves that are initiated during embryogenesis.

      We thank the reviewer for their kind comments and suggestions.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      The leaf anatomy that distinguishes C4 from C3 plants has been known for decades, with veins in C4 plants separated by 1 to 3 (generally 2) mesophyll cells whereas those in C3 plants are considerably farther apart. This anatomical pattern appears to be critical for the function of the C4 pathway, which under some environmental conditions is a more efficient way to fix carbon than the C3 pathway. Despite the obvious importance of close vein spacing, the genetic mechanisms that control it have been surprisingly difficult to untangle. The statement on the bottom on p. 2 ("To date, very few regulators of cell-patterning in inner leaf tissues have been identified...") is an understatement. The paper by Hughes et al. offers a major step in uncovering the basis of C4 vein spacing.

      We thank the reviewer for this feedback and agree that this work represents a major step forward in understanding C4 vein spacing.

      The authors build on their previous work in Scarecrow-like proteins in maize and rice. In maize, SCR controls patterning of the mesophyll, whereas in rice it controls development of stomata. This paper pursues the possibility that the differences in SCR roles may have to do with interacting proteins. Based on work in Arabidopsis the authors focus on proteins with an indeterminate domain (IDD) and specifically on the NAKED ENDOSPERM genes.

      The paper presents an analysis of an impressive set of mutants in three species. A major step in this paper is the comparison among three species of grasses - maize, rice, and Setaria - rather than the more common two species, usually maize and rice. Maize and rice differ in photosynthetic pathway but they also differ in many other traits that reflect the ca. 50 million years since their last common ancestor. Setaria is, like maize, C4 and the two species are more closely related to each other than either is to rice, although they represent two independent acquisitions of C4. This paper shows that SCR orthologs control stomatal patterning in both rice and Setaria implying that the stomatal function of SCR may be ancestral in the grasses and also is not directly connected to photosynthetic pathway.

      The availability of allelic combinations of SCR and NKD in maize in particular permits the inference of possible maternal effect on the vein spacing phenotype, although exactly how this happens remains unclear.

      The discussion provides a careful and logical assessment of the state of knowledge on SCR and IDD proteins in general, and the new data on SCR and NKD in particular. Many questions remain unresolved, and many additional experiments could be suggested. However, the power of the genetics and the phenotypic analysis together provide a novel direction for research on vein spacing. I will refrain in this review from suggesting what additional information would be nice to have since I think a review should assess the quality of the paper as it stands, not as it could be with months more of work.

      My only really substantive suggestion is that the micrographs of the Setaria leaves need to be improved. Specifically, in Figure 6E it is hard to see the details of the fused veins. Either the section is too thick or the camera was not focused properly. Because this image in particular is central to the entire paper I would recommend aiming for the clarity of the images of Zea cross sections, which are fine.

      We thank the reviewer for this suggestion. Obtaining leaf cross section micrographs from the Setaria scr1;scr2;nkd mutants was extremely challenging as the growth phenotype is so severe (Figure 5), meaning that the available leaves are small and extremely fragile. Multiple attempts to fix and section leaves using a microtome failed, with leaves consistently collapsing. In our hands, Setaria is not as amenable to fresh vibratome sectioning as maize, and combined with the additional challenges of handling the tiny triple mutant leaves mean that the resultant images are not of the same quality as the maize figures. We have included a supplemental figure (Figure S8) with additional examples of fused veins identified in our screening.

      Very minor point: p. 3 - "double Zmscr1;Zmscr1h mutants" - what does the "h" in Zmscr1h refer to?

      In this context h refers to this gene being a homeologous gene duplicate, as first explained in Hughes et al. (2019). We have included an explanation in the revision.

      Reviewer #2 (Significance (Required)):

      Strengths of the paper are 1) the inclusion of three species to help determine which aspects of the gene function may be ascribed to C4; 2) thoughtful and comprehensive genetic analysis; 3) careful sections of leaves; 4) outlines of the limitations of the approach. Limitations (several of which the authors acknowledge in the Discussion) include a general lack of molecular genetic data (protein interactions, DNA binding sites, RNA-seq, etc.). While this information would be great to have, I think the strength of the genetics is such that the paper will be foundational for future work in any case. The one bit of additional data that would be ideal would be information bearing on the two mechanistic hypotheses laid out on p. 10. The model that SCR and NKD promote cell division and specify mesophyll identity is the opposite of the model that SCR and NKD inhibit vein formation. An experiment that helped point the reader toward one or the other of these models would be very valuable.

      We agree that an experiment that could distinguish these possibilities would be extremely valuable, and will undoubtedly be the subject of future experimentation.

      The paper fills a critical gap. Little to nothing is known about how the internal anatomy of leaves is patterned and the data presented provide evidence that SCR and NKD are two important players. The paper also provides a conceptual advance in offering a couple of genes and some plausible mechanisms of how they might function.

      The audience will be primarily developmental geneticists and physiologists. The paper addresses an important problem that is of broad interest to developmental biologists and is potentially important for global agriculture.

      We thank the reviewer for their kind comments and suggestions.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      The manuscript of Hughes et al. aimed to demonstrate the functional interactions between Naked-Endosperm IDD genes and the transcription factor SCARECROW and a maternal effect on leaf patterning in C4 grasses. To this end, the authors conducted a greenhouse and labor experiment to create mutants of related genes and assess the expression of these genes through qRT-PCR combined with fluorescence microscopic images in Rice, Maize, and Setaria. They found an increase in the proportion of fused veins with no intervening mesophyll cells in scr;nkd mutants in C4 species (Maize and Setaria) but not in C3 species (rice). In the end, they revealed a maternal effect of derived NKD on patterning cells in leaf primordia during embryogenesis.

      Major comments - Optional: the authors should have conducted a whole transcriptome experiment through RNA-seq on the mutants as compared to the controls to check if these genes were significantly up-related followed by qRT-PCR for validation. By doing so, the authors should be able to get a broad overview of all key plays involved in leaf patterning.

      We agree with the reviewer that it would be useful to have this data, and such an approach will undoubtedly inform future research.

      • Optional: although the authors may evoke the statistical significance of observing fused veins in mutants sr;nkd, the presence of fused veins in one mutant Svscr1;Svscr2 and Zmscr1-m2;Zmscr1h-m1 may contradict the claim that the authors made regarding the association between scr and nkd. Moreover, the sampling size is not also large enough to draw a substantial conclusion.

      We disagree with the reviewer that our sampling size is not large enough to draw a substantial conclusion. In maize we surveyed 11 quadruple mutants and 588 veins. Although this phenotype is occasionally seen in Zmscr1;Zmscr1h mutants, it is far more penetrant in Zmscr1;Zmscr1h;Zmnkd1;Zmnkd2 quadruple mutants and easily distinguished by eye when viewing each mutant, the statistical analysis only serves to make this point. In Setaria we agree that the differences are less stark, and the sampling size is necessarily lower due to the challenges of working with the triple mutant leaves which are extremely small and fragile (far more so than the maize quadruple mutant leaves). We have already included discussion as to why the phenotype may be less penetrant in setaria. Together we think that the fact the direction of the phenotype matches that of maize is convincing evidence that the increase in fused veins is a real consequence of combining the scr and nkd mutations.

      • There are two copies of nkd in maize but only one copy in rice and Setaria. Does the presence of two copies in maize has any evolutionary or functional meaning? Does the presence and absence of one or two copies has any effect on leaf patterning? It would be interesting to discuss this in the discussion section.

      We thank the reviewer for this comment and have added discussion of this in the manuscript. This situation is common in maize, which underwent a more recent whole genome duplication since its divergence from rice and setaria. Most of these gene-pairs function redundantly, however, there is evidence of functional divergence in terms of expression in some gene-pairs. We have added a sentence in the results explaining why we think the presence of two NKD gene copies in maize is unlikely to have functional significance in this case.

      • The methods section is not easy to read for a non-specialized audience. I suggest providing an explanation of the abbreviations used to describe mutants.

      We thank the reviewer for this suggestion and have made the suggested change.

      • For the results section, you should provide a table summarizing the differences between mutants and controls regarding the leaf structure.

      We have added such a table at the end of the results section and referred to it in the discussion.

      Minor comments: - "Zmscr1-m2;Zmscr1h-m1 seed were" seeds instead

      We have made the suggested change.

      • "Loss of NKD gene function enhances SCR mutant phenotypes in maize and setaria" This section is confusing because several perturbations were observed in triple mutants of Setaria and quadruple mutants of Maize as compared to their double mutants (Svscr1;Svscr2 and Zmscr1;Zmscr1h). You should rewrite this subtitle for clarity.

      We have changed this sub title to read “In maize and setaria, but not in rice, nkd loss of function mutations enhance scr mutant phenotypes”

      • "The accumulation of transcripts in the ground meristem cells" How do you estimate the accumulation of transcripts? What do you mean by the accumulation of transcripts? What do you consider transcripts?

      We use this term as opposed to ‘gene expression in the ground meristem cells’ because we do not know whether the presence/absence/level of detectable RNA is regulated by transcriptional or post-transcriptional mechanisms.

      Reviewer #3 (Significance (Required)):

      The manuscript of Hughes et al. is very interesting in the context of C4 photosynthesis research because there are many transcription factor candidates involved in the development of C4 leaf anatomy but few of them have been validated. However, a whole comparative transcriptome of mutants and controls should provide a broad overview and probably new insight into key players involved in leaf patterning.

      We agree with the reviewer that this would be of great interest, but we feel it is beyond the scope of this study and will be a productive avenue of future research.

      This study goes far beyond the simple validation by outlining the potential interactions between transcription factors. The authors made a substantial effort by combining gene expression results with visual data that strengthen the quality of this manuscript. Therefore, this manuscript is very interesting for the C4 research communities and for the field of developmental biology.

      We thank the reviewer for their kind comments and suggestions.

      A plant biologist working on the evolution and regulation of morphological characters using transcriptomics and genomics.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1

      Major comments: The main conclusions of this work are that promoters of the different classes of genes display differing usage of GTFs and cofactors to promote transcription and likely recruit polymerase by different mechanisms. The in vivo experiments using factor depletion offer strong evidence that certain factors including TBP/TRF2 are differentially required for transcription at the housekeeping/developmental gene classes. The in-depth analysis of different promoter types combined with the genetic approaches outlined above provide compelling mechanistic insights into promoter-specific engagement of regulatory factors. In general, the data supports the authors' suggestions.

      One important shortcoming of these experiments is in the in-vitro DNA binding analysis of GTFs at differing core promoter contexts. The lack of GTFs binding to the housekeeping promoters may be a reflection of low intrinsic transcription activity. If the housekeeping promoters don't assemble active transcription complexes in this in vitro system but the developmentally-regulated promoters do, then a simple comparison of proteins bound to each promoter type is potentially misleading as to the factors required for transcription. For example, results of the in-vivo analysis suggest that the +1 nucleosome is an important factor in the positioning of the transcription start site at housekeeping promoters, therefore the use of chromatinized templates rather than naked DNA would likely better reflect the intrinsic binding properties of factors at promoters.

      We thank the reviewer for highlighting that the in vivo experiments constitute strong evidence for the differential requirements of certain factors at different promoter types and that our work provides compelling mechanistic insights into promoter-specific engagement of regulatory factors. We are also grateful to the reviewer for pointing out that we had not sufficiently clearly explained the aim and rationale of the initial in vitro DNA binding analyses (Figures 1 & 2). These which were not meant to assess different factor requirements but to assess if short core-promoter DNA is sufficient recruit transcription-related proteins, as had been reported for TATA promoters, and whether different core-promoter types differ in this ability. We therefore based the in vitro DNA binding assays on the fact that 121bp-short TATA core-promoter DNA is able to recruit and assemble the PIC even in the absence of activators, i.e. when the core promoters are transcriptionally inactive, and assayed all other core-promoter types under identical conditions. Interestingly, while the TATA core promoters enrich for canonical PIC components as expected, housekeeping promoter DNA does not, suggesting that the core-promoter DNA fragments’ abilities to recruit and assemble the PIC differs.

      We agree with the reviewer that one could possibly find conditions in which the different promoter types are active in vitro, e.g. by providing activators or chromatinized templates, and we hope that our explanations above clarify why this has not been the goal of these analyses. As the reviewer pointed out, we assay functional requirements of various TFs and GTFs in vivo in the remainder of the manuscript. We revised the manuscript to improve clarify the aim and scope of these sections (pages 4-9) and are grateful to the author for allowing a discussion of this topic as alternative (see below), many thanks

      One way to address this issue is to test transcription activity of the promoters used in the mass spec analysis. After incubation of promoters with extract, add NTPs and quantitate the basal transcription activity of each type of promoter. If they are the ~same - great. If not, at a minimum, the authors need to acknowledge this as a limitation of the study. The suggested transcription experiment is a simple extension of the work already completed.

      As outlined above, we deliberately assay all core promoter types under identical conditions, such that differences in protein binding reflect the different DNA fragments distinct functional properties. Please also note that while all core-promoter fragments are transcriptionally inactive, they can be activated by input from a strong enhancer (please see Supplementary Figure 2C; housekeeping and developmental core promoters can be induced to comparable levels, and thus weaker binding of GTFs to housekeeping promoters is not a reflection of weaker inducibility or activity).

      We note that all statements and claims are strictly in line of what we tested, namely the core promoter DNA’s ability to recruit transcription-related proteins in vitro. However, we agree with the reviewer that the notion that the core promoters are assayed under identical conditions but are not active is important and discuss it in the main text (pages 8 – 9) and the ‘limitations of this study’ section.

      The authors suggest from the depletion experiments of TBP/TRF2 that the factors are functionally redundant since the level of transcription for target genes recovers after prolonged depletion, though there is not specific functional evidence to support this claim. A suggested experiment to test the functional redundancy of TBP/TRF2 at subsets of genes is to assess the levels of proteins and/or protein binding to promoters after factor depletion. For instance, is there a global upregulation/stabilization of TBP after TRF2 depletion? Or is there an increase in TBP binding at promoters? These can be addressed by western blot for overall protein levels and ChIP-seq or similar method to assess binding to promoters, which are fairly straightforward experiments given that the cells lines have already been produced.

      We thank the reviewer for suggesting potential compensatory mechanism regarding the redundancy of TBP and TRF2 at a subset of tested promoters. To address the question regarding the stability of TBP or TRF2 in the absence of one or the other, we have performed label-free quantitative mass spectrometry on the TRF2-AID cell line and examined TBP levels (Supplementary Figure 4E). We do not see a stabilization of TBP upon the depletion of TRF2 with auxin. The apparent functional redundancy (e.g. Fig. 4J) thus indeed suggests that there might be increased TBP binding. Unfortunately, we are not able to directly test this experimentally due to a lack of resources. We now add a discussion of the potential compensatory mechanisms to the main text (page 14), many thanks.

      A discussion would be appreciated on the generality of the suggested mechanism in metazoans. For example, is DREF conserved only in insects but could other eukaryotes use a similar mechanism at housekeeping genes?

      We agree that some of the specific TFs don’t have one-to-one orthologs outside insects, yet that other prominent features of Drosophila housekeeping promoters are shared more widely. We now discuss the parallel between dispersed patterns of initiation at different promoter types across species, including Drosophila housekeeping and vertebrate CpG island promoters. We also provide an outlook towards future functional, biochemical and structural studies that might reveal more diverse transcription initiation mechanisms at the different promoter types in our genomes (pages 23-24).

      Minor comments: The manuscript is very difficult to read. One major problem is the large number of figures - many of which are not essential for understanding the results. I strongly suggest that the authors think carefully about which figures to include in the manuscript and keep only the most important.

      We agree that the manuscript is complex with six main figures and several different approaches, including biochemistry and mass spectrometry but also genomics and bioinformatics. In addition, the manuscript includes in vitro tests of DNA-protein binding and in vivo assays to probe functional requirement (by depletion) and sufficiency (by recruitment). These different assays assess different properties and complement and validate each other, which is why we feel they are required. We hope that the clarification of the different aspects and their purpose makes the manuscript more easily accessible, many thanks.

      Second, the legends on many of the graphs are very tiny and difficult to read.

      We have revised the figures to improve font size and readability of the figures, many thanks.

      Third, it would greatly help readability if the main figures and legends were imbedded in the manuscript and if the supplemental figures + legends were in a separate document. We have now included the main figures and legends into the manuscript, thanks.

      Fig 4E: very difficult to understand what was done.

      We now add further explanations to the figure legend to describe the different promoter groups compared in the analysis of ChIP-seq coverage of TBP and TRF2. Fig 4A vs G: why are ~ the same number of genes affected by TRF2 vs TBP + TRF2 depletion? I got the impression from the text that there should be a large difference in the number of affected genes.

      We had the same prior expectation, but indeed observed a similar number of downregulated genes upon TRF2 depletion versus TBP and TRF2 double depletion. This may partly be technical, e.g. relating to clonal selection of the different AID-cell lines or thresholding effects, but is likely explained by the relatively few TBP dependent genes (200) that don’t contribute substantially to the larger group of TRF2 dependent genes (3826). The observed number 3935 is 98% of the sum, even ignoring potential overlap. We now clarified this in the text. Fig 5A and similar figures: include the number of affected genes in the figure.

      We added the number to the figure, thanks. Fig S2C: hard to understand what was done from the legend.

      We have added additional explanations to the figure legend, thanks. Fig S2F and similar figures: hard to distinguish the legend and the green colors used. Proofreading: Add citation for Cut&run in the methods.

      We did not analyze CUT&RUN data, however ATAC-seq and ChIP-seq data sets are cited.

      In supplemental Fig1a, the percentage of "INR only" is greater than 100%.

      We thank the reviewer and fixed the typo.

      Supplemental Fig 1a legend-should 170,000 protein coding genes read "17,000"? Santana et al. reference on pg 8 should read 2022.

      We thank the reviewer and fixed the typos Readability: The categorizations of genes classes based on core promoter elements is somewhat unclear-from 1a, is it the case that all TATA contain INRs? A different way of representing the data to capture overlaps in motifs other than a pie chart may better convey these motif relationships. Work could be done to increase clarity in general on the promoter motif subtypes used and how mutually exclusive these elements are in the tested subsets.

      We thank the reviewer for the suggestion. We have added a heatmap in Supplementary Figure 1A showing the percent match score to motif PWMs across Drosophila promoters. As the reviewer suspects, most developmental core promoters have a high-scoring INR motif and some have an additional TATA box or DPE motif. We have also revised the remainder of the text and rewritten the methods section regarding the motif analysis (pages 36 to 38) to improve clarity. Many thanks. Figure 5: authors state "all protein coding genes" are downregulated with TFIIA depletion, though it appears some transcripts are unchanged or upregulated in 5B/C. Suggest change in language.

      We thank the reviewer for this comment. Less than 70 genes are not downregulated upon TFIIA depletion, and manual inspection shows that these genes include intronic non-coding RNAs such as tRNAs that hinder accurate PRO-seq quantification. However, we agree with the reviewer and revised the text to reflect that essentially all promoters are downregulated, affecting all promoter types. A discussion on the developmental context of the S2 cell line seems appropriate. If S2 cells represent a late stage developmental cell line, would the authors expect the relative utilization of cofactors to be the same/different in other cellular contexts?

      We thank the reviewer for this comment. We indeed expect the relative utilization of cofactors to be the same I most cellular contexts and now added a discussion with relevant references (page 23), many thanks.

      Reviewer #2

      1. The DNA affinity purification method is excellent as a discovery method, but it has some potential caveats. One is that it cannot capture remodeling events that could potentially remove otherwise stably bound factors to allow for transient PIC assembly and gene activation. It is possible that some of the insulator factors such as BEAF-32 and Ibf1/2, which selectively bind housekeeping sequences, could prevent or reduce binding by PIC factors. This could occur if BEAF-32 and/or Ibf1/2 inhibit PIC assembly if bound to DNA and if these factors bind housekeeping promoters with high affinity and slow off-rates. That is, in live cells, a competition could exist between binding of these enriched housekeeping factors and PIC assembly. By contrast, this caveat is not relevant at developmental promoters due at least in part to low/sub-nM TBP binding affinity. Ultimately, this is a minor concern but the authors should address in the article to inform readers about potential limitations of the experiments.

      We thank the reviewer for highlighting that DNA affinity purification is an excellent discovery method and for pointing out important differences between such in vitro assays and the in vivo situation. We agree and interpret our results from the DNA affinity purification carefully and specifically regarding differences observed for different types of core promoters under identical experimental conditions. We now highlight these differences more clearly throughout the relevant sections on pages 4-8 and expand the discussion of this issue in the ‘limitations of the study’ section. Many thanks.

      1. More information about how the PRO-seq spike-ins were implemented is recommended. For example, were they fit to a linear regression of read counts/chromosome between all samples, or did they take all hg19 reads as raw fold-change of all samples compared to a control replicate?

      We thank the reviewer for addressing the insufficient information provided about the spike-ins used for PRO-seq. We have added this information to the materials and methods section: We calculated the ratio of spiked-in reads representing the percentage of reads mapping to the human genome over all reads. This ratio was used to determine a scaling factor representing the fold-change of total transcriptional output between the auxin-treated sample and the control samples.

      1. Figure S1C should be cited (not S1B) to support the statement "Mutating either the TATA box or DRE motifs reduced TBP or DREF binding to control levels..."

      We thank the reviewer for this correction and implemented the correct panel citation.

      The authors could note that TATA box mutants still show slight enrichment for TBP compared to negative controls.

      We now note this in the figure legend and explain that it is consistent with TBP binding to non-TATA-box developmental core promoters (Figure 2 B & E).

      In Figure 2A, it would help to remind readers here that TATA, DPE, INR = developmental and TCT, Ohler1/6, DRE = housekeeping.

      We thank the reviewer for this suggestion and implement it

      Figure S2A shows only 121bp and 350bp DRE core promoters but the text refers to 450bp and 1000bp sequences as well. Can the authors show representative results from these longer sequences?

      We thank the reviewer for pointing out these inconsistencies, which we now fixed by revisions to the text and supplementary figures.

      1. In comparing data in Fig 2B and 2E, it seems the statement "the ChIP signals reflected the differential binding preferences observed in vitro for the respective promoter subtypes" should be modified. It is true to an extent but it is more nuanced than indicated by the text.

      We have reworded the section and now discuss the observed trends for GTFs and TFs.

      In Fig S2I, Ohler1 + Ohler6 and TCT are difficult to distinguish because of color scheme choice.

      We agree and now explain in the figure legend that the brighter green corresponds to the Ohler1/6 promoters and the darker green to the TCT promoters, we have additionally edited the legend for better color visibility, many thanks.

      In Fig 3F, perhaps add that Gld has TATA and Fit2 has DRE?

      We now indicate the presence of TATA-box and DRE motifs in the figure, thanks.

      Fig 5D: legend is cut off in the Figure. We thank the reviewer for this comment and now fixed the cropped legend. 11. Fig S2B needs more description and clarification in the main text and the legend. We now deleted Fig.S2B. 12. Page 8, 2nd paragraph "avoiding potential" should be replaced with "minimizing" or similar. We thank the reviewer for this comment and have changed the word choice. 13. Page 16, penultimate paragraph: "Essentially" should be replaced with "Essentiality"

      We thank the reviewer for this comment and correct the wording.

      Reviewer #3

      1. The authors perform a k-means clustering of PWM match scores within 17,000 promoter sequences. They describe in the Methods section that this data revealed 9 groups of promoters. However, although it is likely that several of these promoters contain matches for multiple core promoter motifs, the promoter classes are simply named DRE-promoters, TATA-promoters, TCT-promoters, etc., disregarding any combinatorial association. Furthermore, the clustering data is not visualized to support this naming. The authors should at least provide a heatmap showing the PWM match scores for these clusters and indicate which promoters were used. This is crucial for interpretation of results. We thank the reviewer for pointing out the description of the motif analysis lacked clarity and that the clustering of Drosophila promoters should be visualized. We agree and now provide the k-means clustering heatmap of all 17118 protein coding gene promoters, visualizing the position-weight-matrix (PWM) scores matches for the different promoter motifs in Supplementary Figure 1A. This visualization confirms the reviewer’s suspicion that core-promoter motifs often co-occur in the same core-promoter. For example, TATA promoters typically contain TATA-boxes and INR motifs, etc, which is now clearly seen in the newly provided heatmap. We have also revised the main text, figure legends and have rewritten the method section (pages 36-38) to clarify the analysis of motifs throughout the manuscript. Many thanks.

      2. Relatedly, this paper uses a seemingly over-simplified terminology to describe promoters as housekeeping or developmental. While this terminology has been used in several studies from the Stark lab, this is not well supported by data and the usage of this terminology will likely lead to confusion among readers. Here, housekeeping seems to refer solely to the presence of a motif match in the promoter sequence rather than to ubiquitous expression across cell types. Similarly, developmental promoters seem to refer to anything that is not housekeeping. Are S2 cells best reflecting the activity of developmental genes? What about genes that are not expressed as part of a specific developmental trajectory, but still cell-type restricted? Since focus here is on the behavior of promoters with respect to their core promoter elements, why not just refer to them according to their promoter elements? A good example where the developmental versus housekeeping distinction is not useful is the authors' desire to generalize differences observed in Figure 2B, in which it is quite obvious that there is no clear developmental versus housekeeping split. Rather the data demonstrate that TATA-containing and DRE-containing promoters behave differently.

      We thank the reviewer for raising a concern about the terminology of functionally distinct promoter types in Drosophila. The use of functionally distinct promoter types enriched in different motifs is built on extensive evidence by our lab and others (e.g. the Ohler or Kadonaga groups) that found extensive agreement between promoter sequence, promoter function, initiation pattern, gene annotation, and ubiquitous vs. cell-type-restricted activities. Ubiquitously active housekeeping promoters tend to contain the TCT, DRE and Ohler 1/6 motifs, while cell-type-restricted developmental promoters tend to contain TATA-box, DPE and INR motifs (Arnold & Zabidi, Nat Biotech 2017, Haberle et al. Nature 2019, Ngoc et al. Genetics 2019, Ohler et al. Genome Biol 2002, Ohtsuki et al. Genes & Dev 1998, Rach et al. Plos Genetics 2011).

      We find that the terminology is simple and thus accessible for the non-specialist reader. We agree with the reviewer that clarity is key and revise the introduction of the terminology to clarify that it is based on multiple lines of evidence. We also clarify that Figure 2B – in contrast to the reviewer’s claim – does support a clear developmental versus housekeeping split (please see the dendrogram on top of the heatmap). We now clarified this in the main text and legend to Figure 2B, many thanks.

      1. The authors state that the "prevalent model" in the community is that PIC assembly is the same at all promoters. This is not true. For instance, it is well established that certain core promoter elements have a strong positional effect on TSS selection, while dispersed promoters lack strong positional features. What is less known is how the dispersed pattern, e.g. of non-TATA promoters, arises. The authors should more clearly specify the unknowns and the novel findings of their paper.

      We agree with the reviewer that certain core promoter elements have strong positioning effects on TSS selection and that these occur in promoters with focused initiation patterns such as TATA promoters and developmental non-TATA promoters (e.g. promoters with INR and/or DPE motifs). We also agree that it is unclear how dispersed patterns at housekeeping promoters arise, especially because the initiation sites don’t co-occur with the TF motifs present in these promoters (e.g. DRE or M1BP motifs; see Figure 6A).

      However, the question we address goes beyond TSS selection: we have not seen any study of PIC recruitment and assembly at any promoter with dispersed initiation pattern and the idea of a single uniform Pol II PIC assembly has been the predominant view of transcription initiation during the past two decades (Schier & Taatjes, Genes & Dev 2020). Here, we provide evidence that protein recruitment and GTF usage differs between promoter types, which has mechanistic implications beyond TSS choice alone. In particular, we show that at least two modes of transcription initiation exist that differ between focused developmental and dispersed housekeeping promoters, whereby the developmental promoter DNA directly engages the Pol II PIC via TBP and TFIID, while the housekeeping promoter DNA does not and instead, housekeeping promoters recruit TFs, which recruit COFs and TFIIA. This is exciting and inconsistent with uniform GTF recruitment and assembly, and we hope that this work motivates the study of these different PIC assembly mechanisms at different promoter types.

      One of the major claims made by the authors in the paper is that PIC is recruited directly or indirectly depending on the presence of TATA or DRE. However, their approach seems to pick up a lot of indirect bindings, especially for TATA. This raises concerns of potential biases, which if addressed would strengthen the author's claims. The results do not exclude that TFIIA is directly recruited to TATA but might simply reflect stronger binding to other factors compared to DRE. It is also puzzling that DRE is the only one selected for further validation as it appears to have the lowest affinity for PIC binding and the focus on Ohler1/6 motifs in the final model. Disclaimer, this reviewer is not an expert on DNA-affinity purification assays.

      We thank the reviewer for pointing out that we had not sufficiently clearly explained the DNA affinity purifications. They were performed under identical conditions for all promoter types, such that the differential binding to TATA vs DRE promoters reflects the respective promoter DNA’s affinity to various transcription-related proteins – they are key results of our work. Please note that, despite the high number of TATA interactions, many of these interactors are expected and reflect the binding of multi-subunit protein complexes such as the Mediator and TFIID (please see Figure 2B) and reflect the fact that we did not purify the PIC nor reconstitute it from purified components but determine nuclear proteins that bind to TATA-box promoter DNA. We now introduce and discuss these aspects more clearly.

      It is possible that the fewer interactors found for housekeeping promoters stem from lower affinity of the PIC, the lack of chromatin, or the stable binding of sequence-specific binders such as DREF, BEAF-32 and M1BP in our assay (please see our response to reviewer 2 above). As these result from identical experiments under identical conditions, the fewer interactors for housekeeping promoters are also an important result that likely reflects lower affinity or more transient binding. We now clarify these results and their interpretation in the main text and discuss differences between this assay and transcription in vivo in the “limitations of the study” paragraph.

      As the reviewer might appreciate, the follow up experiments, including the creation of AID cell lines, PRO-seq, etc., are a lot of work such that we did them for promoters at the two extreme ends of the spectrum and their respective DNA-binding factors TBP and DREF identified in Figure 1. We think that these representatives sufficiently strongly demonstrate that PIC assembly and factor requirement is distinct for different promoter types, many thanks.

      Their final model is supported by results by Baumann et al (2018), which directly shows binding and interactions between M1BP, putzig, gfzf and TRF2. However, these factors bind to Ohler1, while most of the work within this study (Figures 1, 3) focused on DRE. How do DRE-containing promoters fit with the final model? Currently, these promoters are not even represented in the model figure.

      We thank the reviewer for pointing out that the final model highlights the Ohler 1 motif but omits the DRE motif. Based on the functional analyses shown in Figure 6 (pages 19-21), we think that the different motifs function equivalently in recruiting housekeeping cofactors and activating housekeeping transcription and have now included DRE motifs in the final cartoon. Our original choice was indeed based on the fact that previous reports from Baumann et al 2018 corroborate our findings for M1BP. As DRE promoters also recruit and depend on TRF2 (Hochheimer et al. Nature 2002), we now show a model by which housekeeping DRE promoters recruit a TRF2 containing PIC through TFIIA, but would like to stress that both likely function equivalently, leading to dispersed initiation. We also revised the data presentation and the final discussion regarding these promoters, many thanks.

      Minor comments

      1. The TSS patterns of promoters were evaluated using STAP-seq (in vitro data) and developmental CAGE data. For the purpose of the paper and to match the in DNA-affinity purification data better, it would be more reasonable to make use of S2 cell CAGE data (e.g. Rennie et al, 2018 PMID: 29659982).

      We thank the reviewer for bringing up this point. For figure 6 we have used CAGE data from Drosophila embryos instead of S2 cells in order to capture a larger proportion of expressed developmental genes and their promoters, rather than just the ones that are expressed in S2 cells. As promoter motifs are found in stereotypical positions in relation to the TSS (Ohler et al. Genome Biol 2002) and because non-S2-cell core promoters can be activated in STAP-seq (Arnold 2017; Haberle 2019), our use of CAGE data from Drosophila embryos allows us to base all subsequent analyses on many more core promoters and also exclude any cell-type specific effects that may arise in TSS selection.

      Previous models on TSS selection within non-TATA promoters have highlighted the dinucleotide frequency of +1 nucleosomal DNA as a strong positional feature. Here, the authors investigate this model using a rather weak analytical approach. We know that nucleosomes can vary between cells (fuzzy positioning). Variability across promoters may cause larger variability in relative TSS positioning. Hence, what is observed here as a TSS spread relative to the +1 nucleosome positioning might in fact be caused by averaging. A more suitable approach would be to analyze the positional cross-correlation between TSS locations (e.g. revealed by CAGE reads) and nucleosomal positions (e.g. revealed by MNase-seq reads). This would better support claims regarding specific TSS positioning with respect to nucleosome positioning.

      We agree that the analysis of cross correlation between TSS locations and nucleosomal positions at individual promoters would provide a more precise measure of TSS positioning relative to the nucleosome. We had originally chosen a visualization that more directly assesses whether the +1 nucleosome determines the TSSs by centering on the predicted +1 positions. In response to this comment, we have performed two additional analyses: a cross-correlation analysis on CAGE and Mnase-seq read coverage in relation to the dominant CAGE TSS (new Supplementary Figure 6I) and a TSS-centric analysis of Mnase-seq coverage (new Supplementary Figure 7. Both analyses agree with the original analysis and we thank the reviewer for pointing out how to strengthen this analysis.

      The cross-correlation analysis reveals a peak in the mean correlation score 125 base pairs downstream of housekeeping TSS (at TCT, Ohler1 and DRE) promoters but not downstream of developmental promoters (TATA-box, DPE and INR), in line with housekeeping TSS being positioned upstream of the +1 nucleosome.

      The analysis assessing +1 nucleosome positions as derived from MNase-seq coverage relative to the position of the dominant TSS reveals the expected phasing of downstream nucleosomes in housekeeping promoters but not at developmental promoters. Many thanks.

      It is interesting that tethering of housekeeping-associated coactivators leads to a higher positional dispersion compared to the result of developmental-associated coactivators. However, the positional TSS dispersion of housekeeping promoters seems to always be larger than that of developmental promoters regardless of coactivator recruitment. Can the authors explain these results?

      We agree that CAGE data typically show TSS dispersion at housekeeping promoters, yet this reflects the promoters’ transcriptionally active states during which endogenous TFs and coactivators are present. Our analyses are based on short, transcriptionally inactive core promoters that can be activated by cofactor recruitment, leading to the observed outcomes. We now clarify this in the manuscript and highlight that the differences in focused versus dispersed patterns occur even on the very same DNA sequences upon the recruitment of developmental or housekeeping activators (e.g. Fig. 6F).

      The authors seem to suggest that positional dispersion of TSSs within housekeeping promoters is due to stochastic initiation after non-positional specific PIC recruitment mediated via certain co-activators. If TSS selection is truly stochastic, why do these promoters then have dominant TSSs?

      We thank the reviewer for pointing out that our phrasing might have suggested that TSS selection was entirely random or stochastic, which is neither true for STAP-seq nor for endogenous CAGE data. In fact, not all positions have the same probability to initiate transcription, but certain positions or nucleotides seem to be inherently favored. We speculate that favorable positions relate to the local DNA structure, the energy barrier landscape for both DNA helix melting to occur and for the first phospho-diester bond to form (e.g. Dineen, D. et al. NAR 2009 and Vanaja, A. et al. ACS Publications 2022). We now added this discussion and the corresponding references to our manuscript (page 21).

      The authors find Chromator as a likely cofactor for indirect recruitment of TFIIA to housekeeping promoters. BEAF-32 is another factor the authors highlight as being enriched at housekeeping promoters (DRE promoters). Both of these factors have previously been considered insulator proteins or architectural proteins involved in the formation of chromatin folding (Ramirez et al, 2018, PMID: 29335486; Wang et al, 2018. PMID: 29335463). Could the authors comment on this link with their own findings?

      We thank the reviewer for addressing the importance of chromatin topology in the light of our findings, which we now discuss in the main text (pages 22-23).

      1. Can the authors justify PWM match thresholds used and why these were changed from Haberle et al 2019?

      We thank the reviewer for pointing out that these changes had not been justified. We adjusted them to be more stringent (e.g. DPE) or sensitive (e.g. TATA-box) exclusively for the motif enrichment analysis, which we did outside the rule-based promoter-annotation effort. These adjusted thresholds reflect the motifs vastly different information contents, which is low for DPE and high for TATA-box motifs.

      Figure related comments/concerns: • General: Sometimes wrong ordering of figure panels with regards to their first mention in the main text, varying font sizes, and minimal figure legends that are often inconsistent (e.g. PRO-seq is sometimes specified when used, but not always) • Typo: Supp Fig 1: INR only 121.37% • Fig 1E not explained, what does x axis describe and how is it calculated? • Figure 2C-D: The CAGE signal is poorly visualized in panel C, it also poorly describes that this is supposedly done using a pool of promoters. Where is the 450bp blot (it seems plausible that the 450bp fragment could actually facilitate a luciferase signal in Fig S2-B)? How was this pool selected, is it exclusively based on DRE-containing promoters? • Fig 2D: apparent gel leakage and loading on the second panel is low. Preferably, provide positive control on the same gel. • Figure 4C: all classes are negatively affected by TRF2 depletion, thus enrichment (4B) makes little sense here • Figure 5C: Missing axis labels • Figure 6F: A y scale would help here

      We thank the reviewer for these recommendations and have implemented all of them.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      Summary:

      In this study, mice were exposed to a specific form of so-called Intermittent Fasting (IF) and the effects of IF on adult neogenesis in the hippocampus were determined. The specific IF protocol used had no effect on activation, proliferation, or maintenance of adult Neural Stem Cells (aNSCs) and displayed a decrease in number of new neurons in the neurogenic niche but only after 1 month of the IF protocol. These results contrast previously published results from multiple studies that concluded that IF promotes survival of new neurons and by extension promote adult neurogenesis. The unresponsiveness of aNSCs or their immediate cell progeny, the Intermediate Neural Progenitors (IPCs), to IF is a novel finding. The authors make several relevant points in the discussion about the publication bias towards positive results (or omission of negative results), which may reinforce established dogmas. However, the presented results did not convincingly demonstrate that the absence of effects of IF on aNSCs or adult neurogenesis is simply not a result of a specific IF paradigm, which is not robust enough to elicit changes in adult neurogenesis. In other words, there is a lack of positive controls and alternative protocols that would rule out that the observed absence of effects is not a consequence of type II error (the error of omission), or more colloquially, a consequence of false negatives.

      We thank the reviewer for acknowledging the importance and novelty of our findings. On them being the result of a specific IF paradigm, we must point out that we used the same IF paradigm as in previous studies that had shown changes in neurogenesis upon IF. We do not claim that IF is unable to increase neurogenesis in all conditions, but report that IF is not a reliable method to increase adult neurogenesis (in particular, every-other-day intermittent fasting with food re-administration in the evening). We have repeated the experiment multiple times in different strains, always with enough animals to make our experiments conclusive and we never observed an increase in adult neurogenesis, effectively ruling out that our results are a false negative. Of note, even if other protocols might indeed increase neurogenesis (which we never claimed cannot) that would not make our results a false negative.

      Major Comments:

      1. Protocol-driven absence of effects: The absence of IF effects on aNSCs and IPCs observed in this study does not lend it the authority to conclude that aNSCs are resilient to IF or all IF paradigms and protocols. The absence of IF effects on aNSCs and neurogenesis could be specifically related to the chosen IF paradigm. Indeed, not all previous studies that observed IF-driven effects on adult neurogenesis used the same "night-time every-other-day fasting" protocol chosen in this study. For example, Brandhorst et al., 2015 (cited in this paper) used 4 days of IF 2x per month and observed an increase of DCX+BrdU+ cells. On the other hand, certain previous studies used the same or similar IF protocol used here, but often with longer duration or with a post-fasting ad libitum feeding period, which may be responsible for the pro-neurogenic or pro-survival effects. In fact, the authors acknowledge this in the discussion (page 7, lines 289-290 and 292-294). Why would the authors then not include similar feeding/IF paradigm in their study and determine if these would generate effects on survival of new neurons but also on aNSCs and/or IPCs?

      As just stated above, we never claimed that aNSCs are resilient to all IF paradigms. We refer to fasting in general in the introduction but quickly focus on every-other-day fasting throughout the paper and directly compare our results only to similar IF paradigms. We chose the most commonly used IF paradigm that had been shown to increase adult neurogenesis. As the reviewer points out, we speculate in the discussion that a refeeding period may explain the differences between our results and others. This is because a post-fasting ad libitum period was introduced in the study published in Dias et al. 2021. We are currently analysing a new experiment in which we replicate the IF protocol in that study, which we will include in our revised version.

      In addition, the authors acknowledge that the chosen IF paradigm may have affected the stress levels or behaviour of mice (page 9, lines 372-378). Why did they not test if their IF protocol does not increase stress or anxiety of mice by simple behaviour tests such as open field or elevated T maze?

      While testing all possible causes for the lack of positive results in our experiments is not viable, we do agree with the reviewer that stress levels might indeed influence the outcome of the experiments. We will collect blood from ad libitum-fed and fasted mice to analyse the levels of stress hormones (e.g. corticosterone). The results will be included in our revised version. These measurements will give us a more accurate reading of stress levels than behavioural tests. Of note, regardless of the outcome of this experiment, our conclusions will remain identical. We will not be able to compare stress levels with previous publications, as they were not tested. And if the protocol did increase stress levels, it would still argue that IF is not a reliable method to increase neurogenesis (as presumably might or might not increase stress to levels that affect neurogenesis).

      Alarmingly, the used IF protocol does not result in changes in final weight or growth curves (S.Fig.2), which is surprising and raises a question the used IF protocol is robust enough or appropriate.

      We were also surprised by the lack of change in the final weight our IF mice respect to control. Differences in final weight between different labs despite using the exact same protocol are one of the reasons why we conclude that this IF paradigm is not a robust intervention. However, we are not the first ones to report little or no difference in weight upon IF in C57BL6/J mice (Goodrick et al., 1990 and Anson et al., 2003) and this would not be a reason to dismiss the experiment since the benefits in crucial circulating factors induced by IF seem to be independent of weight loss (Anson et al., 2003).

      Finally, the authors acknowledge that their own results do not support well-established findings such as aging-related reduction in number of aNSCs (page 4, lines 177-179). This again questions whether the selected protocols and treatments are appropriate.

      As we already discuss, we believe this might be due to a difference between strains in the time when aNSC numbers decline. Nevertheless, we will complement our current data by counting the number of aNSCs at 1 and 3 months post-tamoxifen (3 and 5 month old mice) using GFAP, Sox2 and Nestin triple stainings (as suggested by another reviewer).

      Lack of topic-specific positive controls: The authors successfully demonstrated that the used IF protocol differentially impacts the adipose tissue and liver, while also inducing body weight fluctuations synchronized with the fasting periods. However, these peripheral effects outside the CNS do not directly imply that the chosen IF protocol is robust enough to elicit cellular or molecular changes in the hippocampus. The authors need to demonstrate that their IF protocol affects previously well-established CNS parameters associated with fasting such as astrocyte reactivity, inflammation or microglia activation, among other factors. In fact, they acknowledge this systemic problem in the discussion (page 8, lines 359-360).

      We fully agree with the reviewer in that even though the chosen IF protocol induces peripheral effects, it is not robust enough to elicit cellular or molecular changes in the hippocampus, and this is precisely the message of our paper. We have looked for references showing the influence of IF on astrocyte reactivity or microglia activation, but the studies we found so far look at the effects of IF and other forms of fasting in the CNS in combination with pathologies such as Alzheimer’s disease, Multiple Sclerosis, physical insults or aging (Anson et al., 2003; Chignarella et al., 2018; Rangan et al., 2022; Dai et al., 2022. Reviewed in Bok et al., 2019 and Gudden et al., 2021). Fasting seems to reduce astrocyte reactivity, inflammation or microglia activation in these pathological situations respect to the same pathology in ad libitum mice, but its effect in control, healthy mice is far less clear. In fact, the only reference that we could find where healthy mice were included in the analysis showed that these benefits only happened in the context of the injury (Song et al., 2022).

      Problematic cell analyses: Cell quantification should be performed under stereological principles. However, the presented results did not adhere to stereological quantification. Instead, the authors chose to quantify specific cell phenotypes only in subjectively selected subsets of regions of interest, i.e., the Subgranular Zone (SGZ). This subjective pre-selection may have been responsible for the absence of effects, especially if these are either relatively small or dependent on anatomical sections of SGZ. For example, IF may exert effects on caudal SGZ more than on rostral SGZ. But if the authors quantified only (or predominantly) rostral SGZ, they may have missed these effects by biasing one segment of SGZ versus other. The authors should apply stereological quantification at least to the quantification of new neurons and test if this approach replicated previously observed pro-survival effects of IF. Also, the authors should describe how they pre-selected the ROI for cell quantification in greater details.

      We did analyse only the more septal region of the hippocampus, which we will make clear in the text. As also suggested by other reviewers, we will include stereological counts of the neuronal output of aNSCs in the revised version. As for selecting the SGZ for aNSC counts, this is the standard in the field, as one of the criteria to identify aNSCs is precisely the location of their nucleus in the SGZ. Neuroblasts and new neurons were counted both in the SGZ and the granule cell layer. There was no subjective pre-selection of areas of interest since we counted the whole DG in each section and not a specific random region.

      Alarming exclusion of data points: There appears to be different number of data points in different graphs that are constructed from same data sets. For example, in the 3-month IF data set in Figure 4, there are 14 data points for the graph of Ki67+ cells (Fig.4B), but 16 (or 17) data points for the graph of DCX+ cells (Fig.4D). How is that possible? If data points were excluded, what objective and statistical criteria were applied to make sure that such exclusion is not subjective and biased? In fact, the authors state that "Samples with poor staining quality were also excluded from quantifications" (page 12, line 528-529). Poor preparation of tissue is not only suboptimal but not a valid objective reason for data point exclusion. This major issue needs to be explained and corrected.

      As we disclose in the methods, those stainings that did not work were excluded. This was done always before counting. Different samples were used in different counts because of the variability of staining quality between different antibodies. We will look back into the samples that failed in at least one of the stainings and exclude them from all counts, so that only samples for which all stainings worked are considered. These revised graphs will be provided in our revised version of the manuscript.

      Different pulse-and-chase time-points: One of the reasons why this study has found that aNSCs may not be responsive to IF could be the use of less appropriate pulse-and-chase time-points either after EdU or after Tamoxifen for cell lineage tracing. The authors observed that IF has negative effects on new neurons initially (Fig.4F). Similarly, it is well established that voluntary physical exercise affects SGZ adult neurogenesis only during the first 2 weeks. After this period, the neurogenic effects of exercise are diminished beyond observational detection (i.e., van Praag's and Kempermann's papers in the past 25 years). These two arguments suggest that the observed absence of aNSC responsiveness might be a consequence of the chosen EdU administration and the EdU pulse should not be administered 15 days after Tamoxifen/IF protocol start but earlier, in the first week of the IF protocol. In fact, the decreased number of new neurons during the initial IF phase may not be only a consequence of reduced survival but of higher aNSC quiescence during the first week of the IF protocol.

      We fully agree with the reviewer that BrdU or EdU pulses can give a biased view of the effects of any intervention on neurogenesis and that the EdU and Tamoxifen protocols would not allow us to detect an increase in neurogenesis during the first few days of IF. We cannot rule out that IF has a transient effect on aNSCs at some point of the treatment, but this hypothetical effect does not seem to have any consequences on neuronal output or aNSC maintenance. As for the effects on neurogenesis in the longer IF treatments, we used the same EdU protocol as in previous publications: administration after 2/3 months of IF and analysis after one month of chase.

      Discussion needs more specificity and clarity: The authors claim that the absence of IF effects on neurogenesis is multi-layered (including the influence of age, sex, specific cell labelling protocols etc.) but they do not specifically address why certain studies did find IF-driven neurogenic effects while they did not. In addition, some statements and points in the discussion are not clear. For example, when the authors refer to their own experiments (page 8, lines 331-334), it is not clear, which experiments they have in mind.

      We will double check our discussion and improve its clarity and direct comparison to other studies.

      Minor comments:

      1. Change in the title: The authors have shown that a very specific IF protocol does not affect aNSCs but initially decreases number of new neurons in SGZ. The title should reflect this. For example, it could state "Specific (night-time every-other-day) fasting does not affect aNSCs but initially decreases survival of new neurons in the SGZ".

      We find our title, together with the abstract, clearly and faithfully represent our findings and would rather prefer to keep our current title unmodified.

      Data depiction: Data in 3 datasets were found not normally distributed (Fig. S5A, B and S6A) and were correctly analysed with non-parametric tests. However, the corresponding graphs wrongly depict the data as mean +/- SD while they should depict median +/- IQR (or similar adequate value) because non-parametric statistical tests do not compare means but medians.

      We thank the reviewer for spotting this, we will correct the graphs in Fig. S5A, B and S6A.

      Statistical analysis: For ANOVA, the F and p values are not listed anywhere. The presented asterisks in the graphs are only for non-ANOVA or ANOVA post-hoc tests. This does not allow to judge statistical significance well and should be corrected.

      Again, thanks for spotting this, we will include them.

      Asymmetric vs Symmetric cell divisions: Representative images in Fig.2B suggest that IF may affect the plane of cell division for the Type-1 aNSCs. The plane of cell division is an indirect indicator of symmetric vs asymmetric (exhaustive vs maintaining) modes of cell division. Is it possible, IF influences this, especially during the first week of IF (see major comment 5)?

      This is an interesting hypothesis. However, since we do not see any effects on aNSC maintenance, it is unlikely that IF produces any long-lasting effects on the mode of division of aNSCs. In general, we did not notice a difference in the plane of division of aNSCs between control and IF mice, although we did not systematically test for this (would require specific short EdU pulses to capture aNSCs in M-phase). In Figure 2B, the two stem cells shown in the control are unlikely to be the two daughter cells after the division of one aNSC, as one of them is positive and the other negative for Ki67. We only pointed to the second one to show a Ki67-negative aNSC. We will emphasize this in the figure legend.

      Improved and more accurate citations: Some references are not properly formatted (e.g., "Dias", page 7, line 288). Some references are included in generalizing statements when they do not contain data to support such statements. For example, Kitamura et al., 2006 did not determine the number of new neurons (only BrdU+ cells) in the SGZ, yet this reference is included among sources supporting that IF "promote survival of newly born neurons" (page 2, line 60). Authors should be more careful how the cite the references.

      Thanks for spotting these mistakes, we will correct them and check again all our references. As for the sentence where the Kitamura paper is cited, most of the other references also use only BrdU+ cells while concluding that IF enhances the survival of new neurons. We will change new neurons for new cells to reflect this, which we already bring up in the discussion (see also extended discussion in previous BioRxiv version).

      How do the authors explain that they observe 73-80% caloric restriction and yet the final body weight is not different between IF and control animals? Would it suggest that the selected IF protocol or selected diet are not appropriate (see major point 4)?

      We also found this surprising and were expecting a change in overall activity in IF mice, which we did not observe. Many factors might play a role, like, as the reviewer suggests, changes in stress levels, which we will measure and show in the revised version.

      Given that aNSCs rely more on de novo lipogenesis and fatty acids for their metabolism as shown by Knobloch et al., Nature 2013 and given the interesting changes in RER with the IF shown in this study, it would be interesting to see whether there are differences in Fasn expression in aNSCs between control and IF animals (see minor point 4).

      This is an interesting suggestion but given that we see no effect on aNSCs, we find it’s unlikely and unnecessary to test for Fasn expression differences in our IF protocol.

      Determining apoptosis in the SGZ by picnotic nuclei (Figure S6A) should be supplemented by determining the number and/or proportion of YFP+ cells positive for the Activated Caspase 3.

      We previously found that counting picnotic nuclei is a more accurate and sensitive readout of cell death in the DG, as cells positive for caspase 3 are extremely rare due to the high efficiency of phagocytosis of apoptotic cells by microglia (see Urbán et al., 2016).

      Reviewer #1 (Significance (Required)):

      General assessment:

      This study concludes that aNSCs do not respond to the intermittent fasting. This expands and supplements previous findings that suggest that the intermittent fasting promotes adult neurogenesis by increasing survival and/or proliferation in the Subgranural Zone. The study is well designed, however, over-extends its conclusions beyond a specific fasting paradigm and does not acknowledge serious limitations in the experimental design and analyses. In fact, until major revision is done, which would rule out that the absence of effects of fasting on aNSCs is not due to false negative results, many conclusions from this study cannot be accepted as valid.

      Advance:

      As mentioned above, the study has a potential to advance our understanding of how fasting affects neurogenesis and fills the knowledge gap of how fasting specifically affects the stem cells. However, unless the study addresses its limitations, its conclusions are not convincing.

      Audience:

      This study would be particularly interesting for the niche readers from the neurogenesis field. However, the study can also be interesting for researchers in metabolomics and dietology.

      My expertise:

      adult neurogenesis, neural stem cells, dietology, metabolism

      We disagree with the reviewer and find our conclusions well balanced, as we acknowledge our results are to be compared only with similar IF protocols. We also do not believe our results can be attributed to a false negative, as we consistently observe the same with different strains and protocols, always with sufficient animals to make our counts conclusive.

      We nevertheless thank the reviewer for assessing our paper and for the advice to improve it. We hope that the reviewer will maintain the same level of scrutiny and scepticism with all IF-related papers.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      In this manuscript, Gabarro-Solanas et al. question the suitability of IF (Intermittent fasting - non-pharmacological strategy to counteract ageing, which has been previously shown to increase the number of adult-born neurons in the dentate gyrus of mice) as a pro-neurogenic intervention, since IF treatment did not stimulate adult hippocampal neurogenesis, neither at the stem cell level nor on immature and/or dividing neurons. The Authors used a tamoxifen inducible transgenic model (Glast-CreERT2;RYFP mice) to trace neural stem cell lineage and found that IF did not enhance neural stem cell proliferation, nor the abundance of immature, DCX+ neurons. Three-months of IF failed to increase the number of new adult-born neurons (NeuN+/YFP+), while one month of IF significantly reduced the number of new adult-born neurons.

      The study appears technically sound, including many different approaches in order to reach its conclusions.

      For instance, tamoxifen has been reported to impair various physiological processes, including neurogenesis (Smith et al., 2022), and most studies on adult hippocampal neurogenesis use the C57BL/6J strain of mice; hence, the use of Tamoxifen or that of the GlastCreERT2;RYFP model may have underscored these observations. However, to account for this potentially confounding factor, the Authors characterised the effect of their IF treatment in C57BL/6j mice, also reporting no evident effects of IF as a pro-neurogenic intervention.

      I think the study was carefully planned and the analyses well done. Several possible variables were considered, including sex, labelling method, strain, tamoxifen usage or diet length. Several controls were performed in other organs and tissues (liver, fat) to establish the fasting protocol and to check its effects.

      Data are presented in a clear way. Quality of images is high level.

      In general, it appears as a highly reliable paper reaching an authoritative conclusion for the absence of effect of IF on adult neurogenesis.

      Major comments:

      I think that the key conclusions are convincing and no further experiments are required.

      The methods are presented in such a way that they can be reproduced, and the experiments adequately replicated with proper statistical analysis.

      We thank the reviewer for the encouraging remarks and the appreciation of our efforts.

      Minor comments:

      Prior studies are referenced appropriately, both regarding the IF protocols and the adult neurogenesis modulation.

      Line 288 - a reference is incomplete (Dias); integrate with: (Dias et al., 2021)

      We will re-format the reference, thanks for spotting the mistake.

      There is one concept that is not expressed in the manuscript. Maybe it is not strictly necessary, but I think can be useful to mention it here. It is the fact that most information currently available strongly indicates that adult neurogenesis in humans is not present after adolescence. Of course the research described here is carried out on mice, and in the manuscript it is stated many times that adult hippocampal neurogenesis is strongly decreasing with age, also due to age-related stem cell depletion. Yet, it seems that in humans the exhaustion of such a process can start after adolescence. We know that a sort of controversy is currently present on this subjects, because DCX+ neurons can be detected in adult and old human hippocampi. Yet, it is also clear that there is no substantial cell division (stem cells are depleted) to sustain such hypothetical neurogenesis. Hence, it has been hypothesized that non-newlyborn, "immature" neurons can persist in the absence of cell division, as it has been well demonstrated in the cerebral cortex (see La Rosa et al., 2020 Front Neurosci; Rotheneichner et al., 2018, Cereb Cortex).

      This point can be important in the case someone want to use dietary approached such as IF (or any other pharmacological treatment) to stimulate neurogenesis in humans.

      We agree with the reviewer and also find this a very interesting and timely topic. However, we find it a bit far from our results and would prefer not to comment on it in the context of the current paper.

      Reviewer #2 (Significance (Required)):

      The significance of this study relies on the fact that adult neurogenesis field (AN) has been often damaged by the search of "positive" results, aiming at showing that AN does occur "always and everywhere" and that most internal/external stimuli do increase it. This attitude created a bias in the field, persuading many scientists that a result in AN is worthy of publication (or of high impact factor publication) only when a positive result is found.

      Personally, I found particularly meaninful the last sentences of the Discussion (reported below), which might seem "off topic" in a research paper, while - I think - underline the real significance of the manuscript:

      "In addition, publication bias might be playing a role in skewing the literature on fasting and neurogenesis towards reporting positive results.

      In some reviews, even studies reporting no effect are cited as evidence for improved neurogenesis upon IF. Reporting of negative results, especially those challenging accepted dogmas, and a careful and rigorous evaluation of the publications cited in reviews are crucial to avoid unnecessary waste of resources and to promote the advancement of science."

      Reviewer field of expertise - keywords: adult neurogenesis, brain structural plasticity, non-newly born immature neurons, comparative neuroplasticity.

      We are very happy that the reviewer shares our concern with the biased publication of positive results in the field. We hope our work (and that of Roberts et al., 2022) will encourage other labs to publish their negative results.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      In this manuscript, Gabarro-Solanas et al. investigate the effects of intermittent fasting (IF) on adult hippocampal neurogenesis in young adult mice. IF has been reported to increase the number of adult-born neuron in the hippocampus, a region that is important for learning and memory. However, it is not well understood what stages of adult neurogenesis are regulated by IF. To address this, the authors utilized lineage tracing and label retention assays in mice undergoing an IF diet. The authors used 2 months old Glast-CreERT2;RYFP mice in combination with Edu label retention to characterize adult NSCs and placed these mice on 1 and 3 months of IF. Despite seeing a decrease in neural stem cell proliferation with age, the authors did not observe a change due to diet. The authors then used immunohistochemistry to characterize changes in cell proliferation, neuroblasts, and new neurons following 1 month and 3 months of IF. Only 1 month of IF seemed to decrease the number of new neurons; however, by 3 months the neuronal output was the same. There were no differences in neuroblasts or cell proliferation due to diet. Gabarro-Solanas et al. conclude that IF transiently and mildly inhibits neurogenesis. Due to contradicting results, the authors then try to determine what variables (sex, labeling method, strain, tamoxifen usage, or diet length) could be affecting their data. The authors saw no substantial differences due to any of their variables.

      Major Points

      1. The authors analyze NSCs homeostasis and neurogenesis in young adult mice and do not observe any significant changes with their chosen alternate day intermittent fasting paradigm. However, a lot of the data and cell counts appears to be highly variable between animals in the same group. At times, there is an order of magnitude difference between the highest and lowest counts (e.g. Figure 2C,E). According to the method section, it appears that the authors predominantly analyzed a single DG (section?) for most immunostainings, which may explain the large variability in their data. If this is indeed the case, it is insufficient to quantify only a single section for each animal. The authors should quantify several DG sections for each mouse from a pre-defined range along the rostral-caudal axis of the hippocampus in accordance with a standard brain reference atlas. There are also several quantifications, especially of Ki67 where several individuals appear to have no Ki67+ (Figure 3B, 6D) NSCs. These findings are surprising given the still young age of these mice and may be another reflection of the limited brain sections that were analyzed.

      The counts are indeed very variable. The counts were made on 1 to 4 DG sections (counted in full), depending on the staining. We will more clearly disclose this information in the revised version. In addition, we will re-count the neuronal output after fasting using stereology. Regarding the very low number of Ki67+ aNSCs, our counts are lower than those in other publications because we are much more stringent with our aNSC identification. Instead of using merely Sox2 (which also labels IPCs), we rely on the presence of a radial GFAP+ process.

      There appear to be significant cutting or imaging artifacts across most fluorescent images further raising concerns regarding the accuracy of the quantifications (e.g. Figure 3D, 4C,E, 6B) and publication quality of the images and data. Importantly, uneven section thickness, either from cutting artifacts or imaging issues, may lead to inaccurate cell quantifications a could, possibly, account for the high variability. This issue would further exacerbate concerns regarding the quantification of a single DG section for each animal.

      We only processed those samples that passed our QC after sectioning, meaning any unevenly cut brains were never considered (or stained). The stitched images do show artifacts (lower signal in the image junctions), particularly in the NeuN staining. However, this did not affect quantifications, as the measured levels were always clearly above the threshold to consider a cell positive, regardless of the position within the image. The images were cropped to improve the visualisation of NSCs, and to avoid the display of empty tiles. A low magnification image will be provided in the revised version to show that there were no staining artifacts.

      It is unclear how NSCs were counted in the B6 mice (Fig 6D,E). The authors only provide a description for the Glast-CRE mice, where they used YFP labeling and GFAP. We assume they performed Sox2/GFAP or Nestin labeling, however, this is not clear at all. The authors should describe their methodology and provide representative images.

      We used GFAP, location and morphology to count aNSCs in non-YFP mice. We will make this clear in the text and will also add one more count using Sox2, GFAP and Nestin to identify aNSCs.

      NSC populations represent a heterogenous group of stem cells with different replicative properties. As such, the Glast-Cre approach used for the majority of this study may represent a specific subset of NSCs. In line with the previous point, we recommend the authors complement their NSC counts with Sox2/GFAP and Nestin immunostainings.

      aNSCs labelled with Glast-Cre are the great majority of aNSCs (>90%) in both ad libitum fed and fasted mice. The data will be included in the revised version. Nevertheless, we will add counts using Sox2, GFAP and Nestin for key experiments.

      Stress is a significant negative regulator of neurogenesis. Is it possible that the IF mice display higher stress level which could counteract any beneficial effects of the IF intervention. The authors should provide some measures of stress markers to rule out this potential confounding factor in their IF paradigm.

      This is a great suggestion. We will collect blood from control and fasted mice and measure the levels of stress factors (e.g. corticosterone). We will include the data in our revised version.

      Minor Point

      1. The authors state that "Experimental groups were formed by randomly assigning mice from different litters within each mouse strain and all experiments were conducted in male and female mice". Given that neurogenesis, especially at young ages, is highly sensitive to the exact age of the mice, the authors should provide a rationale why animals from different litters instead of littermate controls were used in these experiments.

      Littermate controls were always used in the experiments. But also, more than one litter was used for each experiment, since one litter was never generating enough mice for the experiments. We will clarify this point in text.

      Currently, the statistical tests are only described in the method section, however it would be helpful if this information to be integrated into the figure legend as well. Additionally, the authors provide individual data points for some but not all bar graphs (eg Figure 1D).

      We will consider including the statistical information in the figure legend, provided there is not a maximum length for figure legends. In the case of figure 1D, data points are not shown because of how the food intake was calculated: as an average per cage instead of per animal (included in the materials and methods). We therefore do not consider it useful to show the datapoints in the final version of the manuscript, but will provide them for the reviewer.

      Cell counts per AU is a rather unorthodox unit. With a representative selection of tissue for each animal, the authors could avoid the need to normalize to the DG length and may be able to extrapolate an estimate of cell counts for the entire DG instead.

      Thanks for the suggestion. Our arbitrary units (AU) were in fact already equivalent to cells per mm of DG, and we have updated our graphs to reflect this.

      In Figure 4D, the authors highlight a few NSC with arrowheads. At a quick glance this is rather confusing as it appears that the authors only counted 3 NSCs in each picture. It may be a better option to show a zoomed in picture to highlight an example of a representative NSC.

      Examples of representative NSCs are already shown in Fig 2. With this image, we intended to show a larger number of NSCs. We realise the arrows only pointed to some of them, making the message confusing. We will consider removing them from the figure in the revised version.

      In Supplementary Figure S6, the authors should complement the quantification of the nuclei with representative images.

      We will include representative images in Figure S6.

      For the daytime IF, did the authors assess weights, food intake, RER as well liver/fat measurements similar to night-time IF? If so, this data should be provided in the supplement.

      We do have data for the daytime IF in the metabolic cages, which was taken from mice housed in groups (during the preliminary phase of our study). We also have the weight and data on neurogenesis, which we will show as a supplement.

      Reviewer #3 (Significance (Required)):

      The authors are commended for compiling a manuscript on what is commonly considered 'negative data', that, at the same time, are also contradicting independent reports on the effects of IF on neurogenesis. The studies outlined in this manuscript are comprehensive and mostly well designed. Given the broad, growing interest in dietary restriction as an aging intervention the study is timely.

      We thank the reviewer for the positive assessment of the significance of our work.

      Reviewer #4 (Evidence, reproducibility and clarity (Required)):

      Summary:

      In this manuscript, Gabarró-Solanas et al. tested the effect of intermitted fasting (IF, every-other-day fasting) on adult neural stem cells and neurogenesis. They demonstrate that the paradigm they have used does not affect NSC activation or maintenance, and does also not promote neurogenesis. As previous reports showed increased neurogenesis with IF, the authors controlled for various parameters such as mouse strain, sex, and diet length. They also used different methods of identification of newborn neurons, such as tamoxifen-induced lineage-tracing versus birth-dating with thymidine-analogues to substantiate their findings.

      Major comments:

      This study is very well done with carefully designed and controlled experiments. The manuscript reads nicely and the data are presented in a clear way, making it easy to follow. The authors have done a "tour-de force" to rule out confounding factors that might influence their findings that IF does not affect NSCs nor neurogenesis.

      The claims and conclusions are supported by the data. The methods are clearly described and should allow to reproduce the data independently. The number of replicates (i.e. the number of mice analyzed) is impressive and statistical analysis is adequate.

      The major findings, namely that the chosen IF does not affect NSCs and neurogenesis is not in line with some previous studies. Despite a careful ruling out of potentially confounding factors (see also "significance" below), it remains unclear why other studies have found an increase in neurogenesis with IF. As each of these studies has some specific experimental design, it is difficult to judge these data in the context of previous data without going through all the details of the other studies. It would thus be a great help for the reader if the authors could provide a table or schematic, which lists the major parameters of each of these studies, such as detailed paradigm of IF, age of mice at start, sex, duration of the intervention, method of identification of NSCs and neurogenesis etc.

      This is a very good suggestion, and we had already created such a table. We, however, consider that it might be better suited for a review on the effects of IF on neurogenesis than for this work. We will include the table in our response to the reviewers together with our revised version.

      Two points that the authors have not discussed might also be worth mentioning in the discussion part:

      1.) The mice in the night-time IF were single caged, could there be a potential negative effect on neurogenesis that would mask the presumably beneficial effect of IF? Although the controls were also single caged, the stress of social isolation might play a role?

      The mice were only single caged for the metabolic phenotyping, but not for the neurogenic counts. We will make this clearer in the text. In any case, we do agree that stress might play a role and we will measure stress levels in the control and fasted mice and will include this data in the revised version.

      2.) The IF mice gained the same weight over time (Fig. S2), but had a ~20% reduction in overall calory intake. This would be explainable by a reduction in energy expenditure, but the overall activity was also not significantly changed (Fig. S1). Can the authors speculate why they reach the same weight with less calories?

      We also found this surprising and were expecting a reduction in the overall activity of the fasted mice. We do not have an explanation for this discrepancy, but perhaps stress levels might explain part of it (we will check stress levels in the revised version). We will also look at whether energy expenditure and activity levels changed over time.

      Minor comments:

      1.) It would be nice to replace the arbitrary units (AU) in the graphs were this is used (e.g. Fig. 2F, 3C, 4B, D and F etc) to the actual number of cells per a certain µm DG, so that the number of cells can be put in context and compared between the figures.

      Yes, our AU already corresponded to mm and we will update our figures accordingly.

      2.) Fig 3 D: can the authors also show the Ki67 channel to illustrate how it looks after a 3 month IF?

      We find it does not help much, as Ki67+ cells are mostly IPCs and that data is already shown in Fig. 4A. We will nevertheless include the image in our response to the reviewers together with our revised version.

      3.) Fig.4E: the NeuN staining looks strangely interrupted, this might be due to tile-stitching? In that case, it would be better to either only show one segment or to try to get a better stitching algorhythm.

      It is indeed because of the tile-stitching and uneven illumination. However, this did not affect the counts, as already discussed in the response to reviewer #3 (major point #2).

      4.) Fig.6 D shows a minus axis in Y-axis, this should only been shown from 0 to positive values, as it is a percentage of cells and cannot be negative.

      True, thanks for spotting this. We will correct the graphs in the revised version.

      5.) Fig.6 B: the same problem with the NeuN staining as mentioned under point 3. This should be improved.

      As with point 3, the stitching did not affect the quantification. We find it more accurate to show the image with the stitching, as that was the one used for quantification. We will provide a new picture with lower magnification to better show the quality of the staining.

      6.) Fig. S6B: maybe add a comment in the result part or in the figure legend that a 10 day chase after an EdU pulse is not the classical protocol to look at mature NeuN positive neurons. But apparently enough newborn neurons were already NeuN positive for this quantification.

      We fully agree 10 days is not the standard for neuronal identification. We did find neurons after the 10-day chase but in low numbers. We will add a comment in the text of the revised version to clarify this.

      7.) The authors refer to personal communications with M. Mattson and S. Thuret to underline that circadian disruption is not enough to explain the differences (line 367 onwards). Can they refer the reader to published data instead?

      While the results are published in their papers, the methods did not specify the time at which the food was added/removed for the IF protocol. That is why we refer to personal communication.

      Further showing that disruption of circadian rhythms is not enough to explain the difference in outcome of the IF protocol, we will show the data for the 1-month daytime IF, which again does not increase adult neurogenesis (reviewer #3, minor point #6).

      Reviewer #4 (Significance (Required)):

      Given the great interest in the seemingly positive effects on health of IF in general, and also for increasing neurogenesis, it is important to better understand the mechanism of this intervention. The study by Gabarró-Solanas et al. clearly demonstrates that IF is not a universal, "works all the time" way of increasing neurogenesis. The study is very well done, with well controlled and measured parameters. It shows that a physiological interference such as IF might depend on many factors and might be less robust across laboratories than anticipated. This study is a very good example that all the details of the experimental settings need to be taken into consideration and are ideally reported with every IF study. It is also a good example how to follow up "no effect" data in a way that they are conclusive.

      The significance of this study is to point out that IF as a strategy to increase neurogenesis needs to be reconsidered. It raises the questions how IF can be beneficial in some studies and not in others, asking for more experiments to better understand the detailed mechanisms of IF action. In a systematic approach, this study rules out some of the potentially confounding factors and shows that at least with the chosen IF paradigm, these factors are not the reason for not seeing increased neurogenesis. The study is thus of clear interest for the neurogenesis field and will also need to be considered by the broader field of IF research, although it speaks against the beneficial effects of IF. It might have the potential to bring together the different study authors who did or did not see increased neurogenesis with IF and discuss together the non-published details of their study design to advance the field.

      We thank the reviewer for the positive assessment of our work and for acknowledging its importance for the broader field of IF research.

      List of references used in the response to reviewers:

      Anson, R. M. et al. Intermittent fasting dissociates beneficial effects of dietary restriction on glucose metabolism and neuronal resistance to injury from calorie intake. Proceedings of the National Academy of Sciences 100, 6216–6220 (2003).

      Bok, E. et al. Dietary Restriction and Neuroinflammation: A Potential Mechanistic Link. International Journal of Molecular Sciences 20, 464 (2019).

      Cignarella, F. et al. Intermittent Fasting Confers Protection in CNS Autoimmunity by Altering the Gut Microbiota. Cell Metabolism 27, 1222-1235.e6 (2018).

      Dai, S. et al. Intermittent fasting reduces neuroinflammation in intracerebral hemorrhage through the Sirt3/Nrf2/HO-1 pathway. Journal of Neuroinflammation 19, 122 (2022).

      Dias, G. P. et al. Intermittent fasting enhances long-term memory consolidation, adult hippocampal neurogenesis, and expression of longevity gene Klotho. Mol Psychiatry 1–15 (2021).

      Goodrick, C. L., Ingram, D. K., Reynolds, M. A., Freeman, J. R. & Cider, N. Effects of intermittent feeding upon body weight and lifespan in inbred mice: interaction of genotype and age. Mechanisms of Ageing and Development 55, 69–87 (1990).

      Gudden, J., Arias Vasquez, A. & Bloemendaal, M. The Effects of Intermittent Fasting on Brain and Cognitive Function. Nutrients 13, 3166 (2021).

      Lee, J., Seroogy, K. B. & Mattson, M. P. Dietary restriction enhances neurotrophin expression and neurogenesis in the hippocampus of adult mice. Journal of Neurochemistry 80, 539–547 (2002).

      Rangan, P. et al. Fasting-mimicking diet cycles reduce neuroinflammation to attenuate cognitive decline in Alzheimer’s models. Cell Reports 40, 111417 (2022).

      Roberts, L. D. et al. The 5:2 diet does not increase adult hippocampal neurogenesis or enhance spatial memory in mice. 2022.10.03.510613 BioRxiv Preprint (2022).

      Song, M.-Y. et al. Energy restriction induced SIRT6 inhibits microglia activation and promotes angiogenesis in cerebral ischemia via transcriptional inhibition of TXNIP. Cell Death Dis 13, 449 (2022).

      Urbán, N. et al. Return to quiescence of mouse neural stem cells by degradation of a proactivation protein. Science 353, 292–295 (2016).

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      In this manuscript, Gabarro-Solanas et al. question the suitability of IF (Intermittent fasting - non-pharmacological strategy to counteract ageing, which has been previously shown to increase the number of adult-born neurons in the dentate gyrus of mice) as a pro-neurogenic intervention, since IF treatment did not stimulate adult hippocampal neurogenesis, neither at the stem cell level nor on immature and/or dividing neurons. The Authors used a tamoxifen inducible transgenic model (Glast-CreERT2;RYFP mice) to trace neural stem cell lineage and found that IF did not enhance neural stem cell proliferation, nor the abundance of immature, DCX+ neurons. Three-months of IF failed to increase the number of new adult-born neurons (NeuN+/YFP+), while one month of IF significantly reduced the number of new adult-born neurons.

      The study appers technically sound, including many different approaches in order to reach its conclusions. For instance, tamoxifen has been reported to impair various physiological processes, including neurogenesis (Smith et al., 2022), and most studies on adult hippocampal neurogenesis use the C57BL/6J strain of mice; hence, the use of Tamoxifen or that of the GlastCreERT2;RYFP model may have underscored these observations. However, to account for this potentially confounding factor, the Authors characterised the effect of their IF treatment in C57BL/6j mice, also reporting no evident effects of IF as a pro-neurogenic intervention. I think the study was carefully planned and the analyses well done. Several possible variables were considered, including sex, labelling method, strain, tamoxifen usage or diet length. Several controls were performed in other organs and tissues (liver, fat) to establish the fasting protocol and to check its effects. Data are presented in a clear way. Quality of images is high level. In general, it appears as a highly reliable paper reaching an authoritative conclusion for the absence of effect of IF on adult neurogenesis.

      Major comments:

      I think that the key conclusions are convincing and no further experiments are required. The methods are presented in such a way that they can be reproduced, and the experiments adequately replicated with proper statistical analysis.

      Minor comments:

      Prior studies are referenced appropriately, both regarding the IF protocols and the adult neurogenesis modulation. Line 288 - a reference is incomplete (Dias); integrate with: (Dias et al., 2021) There is one concept that is not expressed in the manuscript. Maybe it is not strictly necessary, but I think can be useful to mention it here. It is the fact that most information currently available strongly indicates that adult neurogenesis in humans is not present after adolescence. Of course the research described here is carried out on mice, and in the manuscript it is stated many times that adult hippocampal neurogenesis is strongly decreasing with age, also due to age-related stem cell depletion. Yet, it seems that in humans the exhaustion of such a process can start after adolescence. We know that a sort of controversy is currently present on this subjects, because DCX+ neurons can be detected in adult and old human hippocampi. Yet, it is also clear that there is no substantial cell division (stem cells are depleted) to sustain such hypothetical neurogenesis. Hence, it has been hypothesized that non-newlyborn, "immature" neurons can persist in the absence of cell division, as it has been well demonstrated in the cerebral cortex (see La Rosa et al., 2020 Front Neurosci; Rotheneichner et al., 2018, Cereb Cortex). This point can be important in the case someone want to use dietary approached such as IF (or any other pharmacological treatment) to stimulate neurogenesis in humans.

      Significance

      The significance of this study relies on the fact that adult neurogenesis field (AN) has been often damaged by the search of "positive" results, aiming at showing that AN does occur "always and everywhere" and that most internal/external stimuli do increase it. This attitude created a bias in the field, persuading many scientists that a result in AN is worthy of publication (or of high impact factor publication) only when a positive result is found.

      Personally, I found particularly meaninful the last sentences of the Discussion (reported below), which might seem "off topic" in a research paper, while - I think - underline the real significance of the manuscript: "In addition, publication bias might be playing a role in skewing the literature on fasting and neurogenesis towards reporting positive results.

      In some reviews, even studies reporting no effect are cited as evidence for improved neurogenesis upon IF. Reporting of negative results, especially those challenging accepted dogmas, and a careful and rigorous evaluation of the publications cited in reviews are crucial to avoid unnecessary waste of resources and to promote the advancement of science."

      Reviewer field of expertise - keywords: adult neurogenesis, brain structural plasticity, non-newly born immature neurons, comparative neuroplasticity.

    1. we must acknowledge that our styles of teaching may need to change. Let's face it: most of us were taught in classrooms where styles of teachings reflected the hotion of a single norm of thought and experience, which we were encouraged to believe was universal

      This quote addresses our society's ever-changing nature, and the need for our teaching methods to change as a result. The way that our current educational institutions are structured are a sort of "one-size-fits all". However, it only fits a select group and does not take into account that people think and learn differently.

    2. we must acknowledge that our styles of teaching may need to change. Let's face it: most of us were taught in classrooms where styles of teachings reflected the hotion of a single norm of thought and experience, which we were encouraged to believe was universal. This has been just as true for nonwhite teachers as for white teachers. Most of us learned to teach emulating this model.

      I agree with the author that if the next generation can be made to understand and change perceptions, it will require teachers and parents to show respect for different groups in society by example, and I think it is beneficial for children to learn about groups that are different from their own social class. Changing teaching styles is also an important point, as mentioned in the article that there needs to be different learning styles for students, so teachers need to learn more diverse teaching styles to improve the quality of teaching.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      Marta Sanvicente-García et al and colleague developed a comprehensive and versatile genome editing web application tool and a nextflow pipeline to give support to gene editing experimental design and analysis.

      The manuscript is well written and all data are clearly shown.

      While I did not tested extensively, the software seems to work well and I have no reason to doubt the authors' claims.

      I usually prefer ready to use web applications like outknocker, they are in general easier to use for rookies (it would be good if the author could cite it, since it is very well implemented) but the nextflow implementation is anyway well suited.

      We have been able to analyze the testing dataset that they provide, but we have tried to run it with our dataset and we have not been able to obtain results. We have also tried to run it with the testing dataset of CRISPRnano and CRISPResso2 without obtaining results. The error message has been in all the cases: “No reads mapping to the reference sequence were found.”

      Few minor points:

      Regarding the methods to assess whether the genome editing is working or not, I would definitely include High Resolution Melt Analysis, which is by far the fastest and probably more sensitive amongst the others.

      Following the Reviewer 1 suggestion, we have added this technique in the introduction: “Another genotyping method that has been successfully used to evaluate genome editing is high-resolution melting analysis (HRMA) [REFERENCE]. This is a simple and efficient real-time polymerase chain reaction-based technique.”

      Another point that would important to taclke is that often these pipelines do nto define the system they are working with (eg diploid, aploid vs etc). This will change the number of reads needed ato unambigously call the genotypes detected and to perform the downstream analysis (the CRISPRnano authors mentioned this point).

      In the introduction, it is already said: " it is capable of analyzing edited bulk cell populations as well as individual clones". In addition, following this suggestion we have added in the help page of CRISPR-A web application and in the documentation of the nextflow pipeline a recommended sample coverage to orient the users on that.

      I am also wondering whether the name CRISPR-A is appropriate since someone could confuse it with CRISPRa.

      CRISPR-A is an abbreviation for CRISPR-Analytics. Even if it is true that it can be pronounced in the same way that CRISPRa screening libraries, it is spelled differently and would be easily differentiated by context.

      CROSS-CONSULTATION COMMENTS

      Reviewer 2 made an excellent work and raised important concerns about the software they need to be addressed carefully.

      In the meantime we had more time to test the software and can confirm some of the findings of Reviewer 1:

      1) We spent hours running (unsuccessfully) CRISPR A on Nextflow. The software does not seem to run properly.

      2) No manual or instruction can be found on both their repositories (https://bitbucket.org/synbiolab/crispr-a_nextflow/

      https://bitbucket.org/synbiolab/crispr-a_figures/)

      We have added a readme.md file to both repositories and we hope that with the new documentation the software can be downloaded and run easily. We have also added an example test in CRISPR-A nextflow pipeline to facilitate the testing of the software. Currently, the software is implemented in DLS1 instead of DLS2, making it impossible to be run with the latest version of nextflow. We are planning to make the update soon, but we want to do it while moving the pipeline to crisprseq nf-core pipeline to follow better standards and make it fully reproducible and reusable.

      Few more points to be considered:

      • UMI clustering is not proper terminology. Barcode multiplexing/demultiplexing (SQK-LSK109 from Oxford Nanopore).

      We have added more details in the methods section “Library prep and Illumina sequencing with Unique Molecular Identifiers (UMIs)” to clarify the process and used terminology: “Uni-Molecular Identifiers are added through a 2 cycles PCR, called UMI tagging, to ensure that each identifier comes just from one molecule. Barcodes to demultiplex by sample are added later, after the UMI tagging, in the early and late PCR.”

      We had already explained the computational pipeline through which these UMIs are clustered together to obtain a consensus of the amplified sequences in “CRISPR-A gene editing analysis pipeline” section in methods:

      “An adapted version of extract_umis.py script from pipeline_umi_amplicon pipeline (distributed by ONT https://github.com/nanoporetech/ pipeline-umi-amplicon) is used to get UMI sequences from the reads, when the three PCRs experimental protocol is applied. Then vsearch⁴⁸ is used to cluster UMI sequences. UMIs are polished using minimap2³² and racon⁴⁹ and consensus sequences are obtained using minialign (https://github.com/ocxtal/minialign) and medaka (https://github.com/nanoporetech/medaka).”

      We also have added the following in “CRISPR-A gene editing analysis pipeline” methods section to help to understand differences between the barcodes that can be used: “In case of working with pooled samples, the demultiplexing of the samples has to be done before running CRISPR-A analysis pipeline using the proper software in function of the sequencing used platform. The resulting FASTQ files are the main input of the pipeline.”

      Then, SQK-LSK109 from Oxford Nanopore is followed through the steps specified in methods: “The Custom PCR UMI (with SQK-LSK109), version CPU_9107_v109_revA_09Oct2020 (Nanopore Protocol) was followed from UMI tagging step to the late PCR and clean-up step.”

      Finally, we want to highlight that, as can be seen in methods as well as in discussion, UMIs are used to group sequences that have been amplified from the same genome and not to identify different samples: “Precision has been enhanced in CRISPR-A through three different approaches. [...] We also removed indels in noisy positions when the consensus of clusterized sequences by UMI are used after filtering by UBS.” As well as in results (Fig. 5C).

      • Text in Figure 5 is hard to read.

      We have increased the letter size of Figure 5.

      • They should test the software based on the ground truth data

      We have added a human classified dataset to do the final benchmarking. And we can see that for all examined samples CRISPR-A has an accuracy higher than 0.9. As has been shown in the figure with manual curated data, CRISPR-A shows good results in noisy samples using the empiric noise removal algorithm, without need of filtering by edition windows.

      • The alignment algorithm is not the best one, I think minimap2 would be better for general purpose (at least it work better for ONT).

      As can be seen in figure 2A, minimap is one of the alignment methods that gives better results for the aim of the pipeline. In addition, we have tuned the parameters (Figure 2B) for a better detection of CRISPR-based long deletions, which can be more difficult to report in a single open gap of the alignment.

      • The minimum configuration for installation was not mentioned (for their Docker/next flow pipeline).

      Proper documentation to indicate the configuration requirements for installation has been added to the readme.md of the repository·

      • Fig 2: why do they use PC4/PC1?

      Principal Component Analysis is used to reduce the number of dimensions in a dataset and help to understand the effect of the explainable variables, detect trends or samples that are labeled in incorrect groups, simplify data visualization… Even PC4 explains less variability than PC2 or PC3, this helps us to understand and better decipher the effect of the 4 different analyzed parameters even if the differences are not big. We have decided to include as a supplementary figure other PCs to show these.

      • There are still typos and unclear statements thorughout the whole manuscript.

      One more drawback is that the software seems to only support single FASTQ uploading (or we cannot see the option to add more FASTQ).

      In the case of paired-end reads instead of single-end reads, in the web application, these can be selected at the beginning answering the question “How should we analyze your reads? Type of Analysis: Single-end Reads; Paired-end Reads”. In the case of the pipeline, now it is explained in the documentation how to mark if the data is paired-end or single-end. It has to be indicated in “input” and “r2file” configuration variables.

      In the case of multiple samples, and for that reason multiple FASTQ files, there is the button to add more samples in the web application. In the pipeline, multiple samples can be analyzed in a single run by putting all together in a folder and indicating it with variable “input”.

      Since usually people analyze more than one clone at the time (we usually analyze 96 clones together) this would mean that I have to upload manually each one of them.

      All files can be added in the same folder and analyzed in a single run using the nextflow pipeline. Web application has a limit of ten samples that can be added clicking the button “Add more”.

      Also, the software (the webserver, the docker does not work) works with Illumina data in our hands but not with ONT.

      This should be clarified in the manuscript.

      If a fastq is uploaded to CRISPR-A, the analysis can be done even if we haven't specifically optimized the tool for long reads sequencing platforms. We have checked the performance of CRISPR-A with CRISPRnano nanopore testing dataset and we have succeeded in the analysis. See results here: https://synbio.upf.edu/crispr-a/RUNS/tmp_1118819937/.

      Summary of the results:

      Sample

      CRISPRnano

      CRISPR-A

      'rep_3_test_800'

      42.60 % (-1del); 12.72 % (-10del)

      71% (-1del);

      16% (-10del)

      – 36 (logo)

      'rep_3_test_400'

      37.50 % (-1del); 15.63 % (-10 del)

      65% (-1del);

      28% (-10del)

      – 38 (logo)

      'rep_1_test_200'

      39.29 % (-1del); 8.33 % (-17del)

      10del; 17del; 1del

      'rep_1_test_400'

      80.11 % (-17 del)

      del17; del20; del18; del16;del 16

      'rep_0_test_400'

      80.11% (-17 del)

      del17; del20; del 18; del16; del16

      'rep_0_test_200'

      71.91% (-17 del)

      del17; del18

      As we can see from these exemple, CRISPR-A reports all indels in general without classifying them as edits or noise. Since nanopore data has a high number of indels as sequencing errors the percentages of CRISPR-A are not accurate. Eventhat, CRISPR-A reports more diverse outcomes, which are probably edits, than CRISPRnano.

      Therefore, we have added the following text in results:

      “Even single-molecule sequencing (eg. PacBio, Nanopore..) can be analyzed by CRISPR-A, targeted sequencing by synthesis data is required for precise quantification.”

      Reviewer #1 (Significance (Required)):

      As I mentioned above I think this could be a useful software for those people that are screening genome editing cells. Since CRISPR is widely used i assume that the audience is broad.

      There are many other software that perform similarly to CRISPR-A but it seems that this software adds few more things and seems to be more precise. It is hard to understand if everything the author claims is accurate since it requires a lot of testing and time and the reviewing time is of just two weeks. But 1) I have no reason to doubt the authors and 2) the software works

      Broad audience (people using CRISPR)

      Genetics, Genome Engineering, software development (we develop a very similar software), genetic compensation, stem cell biology

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      Summary:

      CRISPR-Analytics, abbreviated as CRISPR-A, is a web application implementing a tool for analyzing editing experiments. The tool can analyze various experiment types - single cleavage experiments, base editing, prime editing, and HDR. The required data for the analysis consists of NGS raw data or simulated data, in fastq, protospacer sequence and cut site. Amplicon sequence is also needed in cases where the amplified genome is absent from the genome reference list. The tool pipeline is implemented in NextFlow and has an interactive web application for visualizing the results of the analysis, including embedding the results into an IGV browser.

      The authors developed a gene editing simulation mechanism that enables the user to assess an experiment design and to predict expected outcomes. Simulated data was generated by SimGE over primary T-cells. The parameters and distributions were also fitted for 3 cell lines to make it more generalized (Hek293, K562, and HCT116). The process simulated CRISPR-CAS9 activity and the resulting insertions, deletions, and substitutions. The simulation results are then compared to the experimental results. The authors report the Jensen-Shannon (JS) divergence between the results. The exact distributions that served as input to the JS are not well defined in the manuscript (see below).

      To clarify the used distributions in the JS divergence calculation, we have changed the following piece of text in section “Simulations evaluation” of methods:

      “ Afterward, we tested the performance on the fifth fold, generating the simulated sequences with the same target and gRNA as the samples that belong to the fifth fold, in order to calculate the distance between these. The final validation, with the mean parameters of the different training interactions, was performed on a testing data set that was not used in the training. Validation was done with samples that had never taken place in the training process. Jensen distance is used to compare the characterization of real samples and simulated samples since this is the explored distance that differentiates better replicates among samples. In order to obtain the different distributions, the T cell data, including 1.521 unique cut sites, was split into different datasets based on the different classes: deletions, insertions and substitutions. For each of these classes, giving as input the datasets with only that class, we obtained the distribution for size and then for position of indels. The same was done for the other three cell lines: K562, HEK293 and HCT116, which included 96 unique cut sites, with three replicates each. The whole datasets (with 1521 and 96 unique cut sites) were split into five-folds (4 for training and one for test) and validation, in order to train and validate the simulator. Using the parameters obtained during the training-test iterations (the average value of the 5 iterations), we generate simulated sequences with the same target and gRNA as the samples that are assigned to the test subset to calculate the Jensen-Shannon (JS) divergence between the simulated and real samples of that subset. Finally, the same was performed for validation. The input for the distance calculations were the generated simulated subset and its real equivalent (same target and gRNA) distributions of the classes. ”

      The authors also report an investigation of different alignment approaches and how they may affect the resulting characterization of editing activity.

      The authors examine three different approaches to increase what they call "edit quantification accuracy" (aka, in a different place - "precise allele counts determination" - what is this???): (1) spike-in controls (2) UMI's and (3) using mock to denoise the results. See below for our comments about these approaches.

      Moreover, the authors developed an empirical model to reduce noise in the detection of editing activity. This is done by using mock (control), and by normalization and alignment of reads with indels, with the notion and observation that indels that are far from the cut site tend to classify as noise.

      The authors then perform a comparison between 6 different tools, in the context of determining and quantifying editing activities. One important comparison approach uses manually curated data. However - the description of how this dataset was created is far from being sufficiently clear. The comparison is also performed for HDR experiment type, which can be compared only to 2 other tools.

      We have changed alleles by editing outcomes in the title section “Three different approaches to increase precise editing outcomes counts determination” trying to be more clear.

      There is already a section in methods “Manual curation of 30 edited samples” explaining how the manual curation was done.

      We see the potential contribution aspects of the paper to be the following:

      1. NextFlow pipeline implementation is an important engineering contribution. Same is true for the interactive web application
      2. The option to simulate an experiment to assess it is a nice feature and can help experiment design
      3. Identification of amplicons when not provided as input
      4. CRISPR-A seeks substitutions along the entire amplicon sequence and is less dependent on the quantification window and on the putative cutsite
      5. Analysis of the difference, in edit activity, comparing different cell lines
      6. CRISPR-A supports the use of UMIs
      7. Interesting sequence pattern insights - like "...found certain patterns associated with low diversity outcomes: free thymine or adenine at the 3' nucleotide upstream of the cut site that leads to insertions of the same nucleotide, a free cytosine at the same place that leads to its loss, and strong micro-homology patterns that lead to long deletions " We further comment on the soundness of these contributions in our comments below and on their significance in our comments related to the general potential significance of the paper.

      Major comments:

      • Upon attempting to run an analysis from the web interface (https://synbio.upf.edu/crispr-a) and using: fastq of Tx and mock (control), the human genome and the gRNA sequence provided as input for the protospacer field, our run was not successful. In fact the site crashed with no interpretable error message from CRISPR-A. We have improved the error handling together with the explanations in the help page, where you will find a video. Hopefully these improvements will avoid unexpected crashings.

      • Moreover, there should be more clear context. There is no information regarding the type of experiments that can be analyzed with the tool. We figure it is multiplex PCR and NGS but can the tool also be used for GUIDESeq, Capture, CircleSeq etc.? Experiments that could be analyzed are specified in Results: “CRISPR-A analyzes a great variety of experiments with minimal input. Single cleavage experiments, base editing (BE), prime editing (PE), predicted off-target sites or homology directed repair (HDR) can be analyzed without the need of specifying the experimental approach.” We have also specified this in the nextflow pipeline documentation as well as in the web application help page.

      • No off target analysis. Only on-target The accuracy of the tool allows checking if edits in predicted off-target sites are produced, this being an off-target analysis with some restrictions, since just variants of the predicted off-target sites are assessed. Translocations or other structural off-targets will not be detected by CRISPR-A since the input data analyzed by this tool are demultiplexed amplicon or targeted sequencing samples.

      • No translocations and long/complex deletions The source of used data as input does not allow us to do this. There are other tools like CRISPECTOR available for this kind of analysis. We have added this to supplementary table 1.

      • We view the use of a mock experiment as control as a must for any sound attempt to measure edit activity. This is even more so when off-target events need to be assessed (any rigorous application of GE, certainly any application aiming for clinical or crop engineering purposes). We therefore think that all investigation of other approaches should be put in this context. We agree with the necessity of using negative controls to assess editing. For that reason we have included the possibility of using mocks in the quantification. In addition, there are few tools that include this functionality.

      • It's a nice feature to have simulated data, however, it is not a good approach to rely on it. As can be seen in the manuscript we highlight the support that simulations can give without pretending to substitute experimental data by just simulated data. Simulated data has been useful in the development and benchmarking of CRISPR-A, but we are aware of the limitations of simulations. Here some examples from the manuscripts explaining how we have used or can be used simulated data:

      “Analytical tools, and simulations are needed to help in the experimental design.”

      “simulations to help in design or benchmarking”

      “We developed CRISPR-A, a gene editing analyzer that can provide simulations to assess experimental design and outcomes prediction.”

      “Gene editing simulations obtained with SimGE were used to develop the edits calling algorithm as well as for benchmarking CRISPR-A with other tools that have similar applications.”

      Even simulated data has been useful for the development and benchmarking of CRISPR-A, we have also used real data and human validated data.

      • In p7 the authors indicate the implementation of three approaches to improve quantification. They should be clear as to the fact that many other tools and experimental protocols are also using these approaches. for example, ampliCan, CRipresso2 and CRISPECTOR all take into account a mock experiment run in parallel to the treatment. Even in page 7 (results) we don’t mention the other tools that also use mocks for noise correction, we detail this information in Supplementary Table 1. CRISPResso2 was not included since they can run mocks in parallel but only to compare results qualitatively, i.e. there is not noise reduction in their pipeline. It has been added to the table.

      • Figure1: ○ The figure certainly provides what seems to be a positive indication of the simulations approach being close to measured results. Much more details are needed, however, to fully understand the results.

      We have added more details.

      ○ Squema = scheme ??

      We have changed the word “schema” by diagram.

      ○ What was the clustering approach?

      As is said in the caption of Figure 1 the clustering is hierarchical: “hierarchical clustering of real samples and their simulations from validation data set.” And we have added that “The clustering distance used is the JS divergence between the two subsets.”

      ○ What is the input to the JS calculation? What is the dimension of the distributions compared? These details need to be precisely provided.

      The distribution has two dimensions, sizes and counts or positions and counts.

      As said before, to clarify the used distributions in the JS divergence calculation, we have changed the following piece of text in section “Simulations evaluation” of methods:

      “ Afterward, we tested the performance on the fifth fold, generating the simulated sequences with the same target and gRNA as the samples that belong to the fifth fold, in order to calculate the distance between these. The final validation, with the mean parameters of the different training interactions, was performed on a testing data set that was not used in the training. Validation was done with samples that had never taken place in the training process. Jensen distance is used to compare the characterization of real samples and simulated samples since this is the explored distance that differentiates better replicates among samples. In order to obtain the different distributions, the T cell data, including 1.521 unique cut sites, was split into different datasets based on the different classes: deletions, insertions and substitutions. For each of these classes, giving as input the datasets with only that class, we obtained the distribution for size and then for position of indels. The same was done for the other three cell lines: K562, HEK293 and HCT116, which included 96 unique cut sites, with three replicates each. The whole datasets (with 1521 and 96 unique cut sites) were split into five-folds (4 for training and one for test) and validation, in order to train and validate the simulator. Using the parameters obtained during the training-test iterations (the average value of the 5 iterations), we generate simulated sequences with the same target and gRNA as the samples that are assigned to the test subset to calculate the Jensen-Shannon (JS) divergence between the simulated and real samples of that subset. Finally, the same was performed for validation. The input for the distance calculations were the generated simulated subset and its real equivalent (same target and gRNA) distributions of the classes. ”

      ○ What clustering/aggregation approach did the authors use here (average dist, min dist, dist of centers?)

      Hierarchical clustering.

      ○ 5 pairs were selected out of how many? Call that number K.

      We have 100 samples in the validation set. Following the suggestion of indicating the total number of samples in the testing set, we have added this information to the figure caption.

      ○ What does the order of the samples in 1C mean? Is 98_real closer to 22_sim than to 98_sim? If so then state it. If not - what is the meaning of the order? Furthermore - how often, over K choose 2 pairs does this mis-matching occur for the CRISPR-A simulator??

      Exactly, it is a hierarchical clustering, where samples are sorted by JS divergence. It was already stated in Results: “In addition, on top of comparing the distance between the experimental sample and the simulated, we have included two experimental samples, SRR7737722 and SRR7737698, which are replicates. These two and their simulated samples show a low distance between them and a higher distance with other samples.” As well as in Figure 1 caption: “For instance, SRR7737722 and SRR7737698, which cluster together, are the real sample and its simulated sample for two replicates.” Then, since these samples are replicates, its simulations will come from the same input and is expectable to find low distance between these two real samples as well as between both of them and their simulation. We have stated it in the discussion.

      • "From the characterized data we obtained the probability distribution of each class" (page 3) - How is this done? how many guides? how many replicates? what is class? where do you elabore regarding it? how you obtain the distributions? More details of the methods need to be provided. Added in methods.

      • The 96 samples used for development here - where are they taken from? This should be indicated in the first time these samples are mentioned. Namely - bottom of P6 Added: “The 96 samples, from these cell lines, are obtained from a public dataset BioProject PRJNA326019.”

      • CRISPECTOR is not mentioned in the comparison in the section: "CRISPR-A effectively calls indels in simulated and edited samples" (Table S2). Is there a specific reason for having left it out? CRISPECTOR, as well as ampliCan, is not in Table S2, since in this table is shown detailed data from Figure 2. CRISPECTOR is compared with CRISPR-A in figure 5, where the different approaches to enhance precision, like using a negative control, are explored.

      • In the section "Improved discovery and characterization of template-based alleles or objective modifications" - part of the analysis was made over simulated data and then over real data. The authors state "it is difficult to explain the origin of these differences...". Thus, needs to be investigated in more detail ... :) (P5) Moreover - the performance over real data is, at the end of the day, the more interesting one for comparison purposes. We have added this sample to the human validated dataset to understand better what was happening in this case and the results and pertinent discussion have been added in the manuscript: “CRISPResso2 is detecting a 2% more of reads classified as WT. These 2% correspond with the percentage classified as indels by CRISPR-A. In total, the percentage difference between CRISPResso2 and CRISPR-A template-based class is 0.6%, higher in CRISPR-A. CRISPR-A percentage is closer to the ground truth data than CRISPResso2.”

      • We found no explanation of "spike-in"/"spike experimental data" across the entire article. There is some general language about lengths but the scheme is still totally unclear. We have indicated in methods section when we were talking about the spike-in controls.

      • Description of the 96 gRNAs? Is this data from REF26? If so - where do you state this? If so - how do the methods described herein avoid the unique characteristics of the data of REF26? We have added the reference: “The 96 samples, from these cell lines, are obtained from a public dataset BioProject PRJNA326019.” In addition, there are other sources of data, simulations and now even human validated data.

      • "distance between the percentage of microhomology mediated end-joining deletions of samples with the same target was calculated and the mean of all these distances was used to reduce the information of the 96 different targets to a single one." (P6) What is the exact calculation used? which distance? How was clustering performed? What is the connection for gene expression? The used distance was euclidean distance and the clustering was performed using hierarchical clustering. We have added this information to the manuscript. Regarding the connection of gene expression, we are exploring the correlation of two phenotypes: the gene expression of the proteins differentially related with NHEJ and MMEJ pathways, and the gene editing landscape (indel patterns that are related with MMEJ and those that are more prone to be generated with NHEJ). We have tried to improve this explanation in the manuscript.

      • "we have fitted a linear model to transform the indels count depending on its difference in relation to the reference amplicon" (P7) - needs more explanation. Is this part of the pipeline? We have explained better how we have fitted the linear model in methods: “A linear regression model was fitted to obtain the parameters of Equation 1 using spike-in controls experimental data (original count, observed count and size of the change in the synthetic molecules). We have used the lm function from R. Parameter m in Equation 1 is equivalent to the obtained coefficient estimate of x which was 0.156 and n is the intercept (n=10). ”.

      The model is optionally used as part of the pipeline as explained at the end of section “CRISPR-A gene editing analysis pipeline” to correct amplification biases due to differences in amplicon size. Then, what is part of the pipeline is the use of this model to make the transformation of counts from the observed counts to the predicted original counts. This is done with Equation 1 and can be found in the pipeline (VC_parser-cigar.R).

      • What is it "...manually curated data set"? (page 8) This is explained in “Manual curation of 30 edited samples” in methods.

      • Section "CRISPR-A empiric model removes more noise than other approaches" - with what data were the comparisons performed? Moreover, how were the comparison criteria selected (efficiency and sensitivity)? The literature already used several approaches to compare data analysis tools for editing experiments. See for example ampliCan, Crispresso (1 and 2) and CRISPECTOR. Maybe the authors should follow similar lines. The data used in this comparison comes from the reference 26:“26. van Overbeek, M. et al. DNA Repair Profiling Reveals Nonrandom Outcomes at Cas9-Mediated Breaks. Mol. Cell 63, 633–646 (2016).We have added it to the manuscript.

      The values of efficiency and sensitivity were not used directly for the comparison. We wanted to firstly evaluate our own algorithm. For that we obtained the values of efficiency and sensitivity for the previous mentioned dataset. These values were chosen to firstly have an idea of firstly, how much noise the algorithm is able to detect, and secondly, how much of it is able to be reduced after the Tx vs M process. That established a framework of comparison in which we can then compare directly the reported percentage of edition of the different tools.

      Regarding the approaches used to compare data analysis tools for editing experiments, we are going to explain why we haven’t followed similar lines or how we have now included it:

      In the case of ampliCan, the comparison that they do is with a synthetic dataset with introduced errors:

      "synthetic benchmarking previously used to assess these tools (Lindsay et al. 2016), in which experiments were contaminated with simulated off-target reads that resemble the real on-target reads but have a mismatch rate of 30% per base pair".

      In CRISPResso2, they benchmarked the efficiency against an inhouse dataset but this dataset is not published. Finally, for the benchmarking of CRISPECTOR, a manual curated dataset is used as a standard: "Assessment of such classification requires the use of a gold standard dataset of validated editing rates. In this analysis, we define the validated percent indels as the value determined through a detailed human investigation of the individual raw alignment results". In this sense, we have added a human validated dataset to do something similar to complement the analysis that we had already done.

      In the end, we consider that simulated or synthetic datasets, as those used by ampliCan or CRISPResso2, does not capture the complete landscape of confounding events that can be detrimental to the analysis results. Similar limitations are found in the use of a gold standard dataset of validated editing rates, since the amount of reads or samples that can be validated by humans is not big since it is time consuming. In addition, humans can also make errors and have biases. Eventhogh, we have found very valuable talking into consideration adding a human validated dataset to complete our exploration.

      • In the section "CRISPR-A empiric model removes more noise than other approaches" the authors state, incorrectly, that CRISPECTOR only reports the percentage of editing activity per site (there is much more information reported in the HTML report, including the type of edit event detected - deletion, of various lengths, insertions, substitutions etc). (P8) We thank the reviewer for the observation, as indeed the state is incorrect. What we wanted to express is that with CRISPECTOR we cannot trace individually each of the called indels, as any sort of excel or file with this content is given in the output. Therefore we cannot investigate which events have been corrected. To be precise in our statement we changed this sentence to the following:

      “CRISPECTOR, although providing extensive information on the statistics and information about the indels, is not possible to track the reads along their pipeline, thus we cannot know which have been corrected and which have not.”

      • Section "CRISPR-A noise subtraction pipeline" describes a pretty naive method for noise subtraction (P12). Should be rigorously compared, for Tx vs Mock experiments, to CRISPECTOR and to CRISPResso2. In the section "CRISPR-A empiric model removes more noise than other approaches", we perform an exhaustive comparison with a dataset that contains 288 Mock Files vs 864 Tx files. This can be better appreciated in the, now included, figure Sup. 13A. CRISPResso2 was intentionally left out since their pipeline does not use a model to reduce noise but other approaches like reducing the quantification window.

      • "recalculated using a size bias correction model based on spike-in controls empiric data.." (P14). Where is the formula? The formula comes from Equation 1. Now it is correctly referenced.

      • Section "Noise subtraction comparison with ampliCan and CRISPECTOR" - fake mock was generated for comparison. We consider the avoidance of a Mock control in experiments designed to measure editing activity to not be best practice. It is OK to support this approach in CRISPR-A. However - the comparison to tools that predominantly work using a Mock control (including ampliCan and CRISPECTOR) should be done with actual Mock. Not with fake Mock .... (P15) We understand the claims of the reviewer for this point as the use of a “fake” mock may not be the best practice for general comparisons. Nevertheless here what we wanted to compare is the difference in the edition percentages using mock and not using it. Since to make a run for on-target data CRISPECTOR requires a mock, the only way to replicate the conditions of “no mock” was to use a synthetic file with the same characteristics of the treated files in terms of depth, but with no edition/noise events to avoid any correction outside this framework. The other run was made with the 288 real Mocks. This was a solution ad Hoc for CRISPECTOR, with ampliCan we used only real mock since they allow to make runs without a mock for on-target.

      We changed the word fake for synthetic in the Noise subtraction comparison with ampliCan and CRISPECTOR section:

      “As for CRISPECTOR, since it requires a mock file to perform on-target analysis, synthetic mock files were generated”.

      Minor comments:

      • "Also, most of these tools lack important functionalities like reference identification, clustering, or noise subtraction" - bold part incorrect for CRISPECTOR, although it is not aiming only for CRISPECTOR In supplementary table 1, it is already elucidated which are the functionalities that each tool has. We have also added more context to that statement to highlight the differences between different tools:

      “Even not all of them have the same missing functionalities, as can be seen in the Supplementary table 1, CRISPR-A is the only tool that can identifies the amplicon reference from in a reference genome, correct errors through UMI clustering and sequence consensus, correct quantification errors due to differences in amplicon size, and includes interactive plots and a genome browser representation of the alignment.”

      • "Same parameters and probability distributions were fitted for three other cell lines: Hek293, K562, and HCT11626, to make SimGE more generalizable and increase its applicability" (page 3) - how was fitted? It was fitted in the same way as the t-cell samples as specified in methods. We have detailed more methods explaining how SimGE is built.

      • What is the "nature of modification"? (P5) We have changed nature by type for a better understanding.

      • In the section "CRISPR-A effectively calls indels in simulated and edited samples" (P5) towards the end, the authors write that the CRISPR-A algorithm did not give good results for a few examples. They then state that this was corrected and then yielded good results. There is no explanation of what correction was done, if it was implemented in the code and how to avoid/detect it in further cases. The problem was that the used reference sequence was too short. There is no modification in CRISPR-A code, we have just used the whole amplicon reference sequence obtained with the amplicon reference identification functionality of CRISPR-A. We have tried to explain it better in the manuscript: “Once the reference sequence is corrected used is the one corresponding to the whole reference amplicon, obtained with CRISPR-A amplicon sequence discovery function, CRISPR-A shows a perfect edition profile”

      • Cell culture, transfection, and electroporation - explanation only for HEK293, what about the others? (P15) We already had explained it for HEK293 and for C2C12, that are the experiments done by use. In the case of the analysis of the three cell lines and 96 targets we reference the source of the data as this data was not produced in our lab.

      • Typos and unclear wording: ○ "obtention" (P8) → changed by obtaining

      ○ "mico" >> micro (P 7,10) → changed

      ○ "Squema" >> scheme (Fig.1) → changed

      ○ "decombuled" (P10) → changed by separated

      ○ "empiric" >> empirical (P8 and other places) → changed

      ○ "Delins" (P14) → this is not a typo, it is used to indicate that a deletion and insertion has take place (http://varnomen.hgvs.org/recommendations/DNA/variant/delins/)

      ○ "performancer" (P9) → Change to performance

      ○ Change word across all article - "edition" to "editing" → changed. In the case of edition windows it has been changed by quantification windows.

      ○ "...has enough precision to find" (P6) not related to "results" section → We have moved to discussion.

      • Comments on figures: ○ Fig. 2C:

      ■ No CRISPECTOR in the analysis

      It is not included because for on-target analysis this tool requires a mock control sample. For this reason, it is compared in Figure 5D, where samples using negative controls are compared, and in Figure 5E where all tools and their different analysis options are compared.

      ■ It is simulated data only

      Yes, it is. Comparison with real data is done in Figure 2D and 2E. And now we also have added a ground truth data in our comparisons obtained from human validation of the classification of more than 3,000 different reads.

      ■ It is not violin plot as mentioned in the description

      It is a violin plot, but in general there is not much dispersion of the data points making the density curves flat.

      ○ Fig 3A - Is it significant? Yes, it is. We have added this information in the caption of the figure.

      ○ Fig. 4:

      ■ A

      • Each row/column is a vector of 96 guides? No, as it is said in the caption of the figure, it is the “mean between the distances calculated for each of the 96 different targets.”

      • How is the replicate number decided? Is it a different experiment by date? What is separating between experiments? Rep numbers? All this information should be found in the referenced paper from which this dataset comes from as already referenced.

      ■ B - Differential expression:

      We have realized that the caption was not correct, missing the explanations for Fig. 4B and all the following ones moved to a previous letter.

      • How? did you measure RNA? It is already stated in methods that RNAseq data was obtained from SRA database and the analysis was done using nf-core/rnaseq pipeline: “RNAseq differential expression analysis of samples from BioProject PRJNA208620 and PRJNA304717 was performed using nf-core/rnaseq pipeline⁵².”

      • Is the observed data in the figure sufficiently strong in terms of P-value? Yes, at is it is highlighted in the plot with ** and ***. We have also added the p-value in the cation of the figure.

      • Where is the third cell-line? As mentioned in the text, we have just chosen the cell lines that show us higher differences in the the percentage of MMEJ: “HCT116 than in K562, which are the cell lines with the major and minor ratios of MMEJ compared with NHEJ, respectively”.

      ○ Fig.13 - There is no A and B as mentioned in the text

      We thank the reviewer for the observation as we mistakenly uploaded the wrong figure. We corrected it.

      Reviewer #2 (Significance (Required)):

      We repeat the aspects of contribution, as listed in the first part of the review, and comment about significance:

      • NextFlow pipeline implementation is an important engineering contribution. Same is true for the interactive web application

        Significant engineering contribution. Nonetheless, we were not able to run the analysis. So - needs to be checked.

      Hopefully now that the documentation is properly added to the repository it will be easier to run analysis.

      • The option to simulate an experiment to assess it is a nice feature and can help experiment design

        An important methodology contribution

      • Identification of amplicons when not provided as input

        Not important in the context of multiplex PCR and NGS measurement assays, as amplicons will be known. Not clear what other contexts the authors were aiming at.

      It is useful to save time, no need to look for the sequence of each amplicon and add it as input. Also, it can help to detect unspecific amplification, since all amplicons of the same genome can be retrieved from the discovery amplicon process. In addition, we have already found one example where this avoids getting incorrect results: “Once the reference sequence used is the one corresponding to the whole reference amplicon, obtained with CRISPR-A amplicon sequence discovery function, CRISPR-A shows a perfect edition profile”. We have added this to the discussion of the manuscript.

      • CRISPR-A seeks substitutions along the entire amplicon sequence and is less dependent on the quantification window and on the putative cutsite

        Importance/significance needs to be demonstrated

      In figure 3 are shown the results of template-based and substitutions detection. CRISPR-A is a versatile and agnostic tool for gene editing analysis. This means that it can be prepared for the analysis of gene editing of future tools, since the cut site or other elements of experiment design are not required. In addition, it has been shown that when a mock is used its performance is comparable to filtering by edition windows, avoiding the loss of edits when the cut site is slided.

      • Analysis of the difference, in edit activity, comparing different cell lines

        Significant contribution. However - the methods need to be much better explained and the results better described in order for this to be useful to the community.

      We have made an effort to try to be more clear in the description of the results.

      • CRISPR-A supports the use of UMIs

        Mildly significant technical contribution. However - only addresses on-target. Also addressing off-target would have been significant.

      The use of UMIs is something that has never been done before in this context. Sequencing biases are not taken into account and editing percentages are reported as observed. Being able to differentiate between different molecules at the beginning of the amplification sequence, allows a higher precision avoiding under or overestimation of each of the species in a bulk of cells.

      In the case of off-targets, can be for sure done using sequencing the predicted off-target sites. In addition, there are other methods, like GuideSeq that can be used to discover off-targets, but this kind of data is out of the scope of CRISPR-A. Even that, we are aware of the importance of being able to analyse off-targets when in a context of a broad analysis platform and we will take these into consideration when participating in the building of crisprseq pipeline from nf-core.

      • Interesting sequence pattern insights - like "...found certain patterns associated with low diversity outcomes: free thymine or adenine at the 3' nucleotide upstream of the cut site that leads to insertions of the same nucleotide, a free cytosine at the same place that leads to its loss, and strong micro-homology patterns that lead to long deletions "

        As stated - interesting.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary:

      CRISPR-Analytics, abbreviated as CRISPR-A, is a web application implementing a tool for analyzing editing experiments. The tool can analyze various experiment types - single cleavage experiments, base editing, prime editing, and HDR. The required data for the analysis consists of NGS raw data or simulated data, in fastq, protospacer sequence and cut site. Amplicon sequence is also needed in cases where the amplified genome is absent from the genome reference list. The tool pipeline is implemented in NextFlow and has an interactive web application for visualizing the results of the analysis, including embedding the results into an IGV browser. The authors developed a gene editing simulation mechanism that enables the user to assess an experiment design and to predict expected outcomes. Simulated data was generated by SimGE over primary T-cells. The parameters and distributions were also fitted for 3 cell lines to make it more generalized (Hek293, K562, and HCT116). The process simulated CRISPR-CAS9 activity and the resulting insertions, deletions, and substitutions. The simulation results are then compared to the experimental results. The authors report the Jensen-Shannon (JS) divergence between the results. The exact distributions that served as input to the JS are not well defined in the manuscript (see below).

      The authors also report an investigation of different alignment approaches and how they may affect the resulting characterization of editing activity. The authors examine three different approaches to increase what they call "edit quantification accuracy" (aka, in a different place - "precise allele counts determination" - what is this???): (1) spike-in controls (2) UMI's and (3) using mock to denoise the results. See below for our comments about these approaches. Moreover, the authors developed an empirical model to reduce noise in the detection of editing activity. This is done by using mock (control), and by normalization and alignment of reads with indels, with the notion and observation that indels that are far from the cut site tend to classify as noise. The authors then perform a comparison between 6 different tools, in the context of determining and quantifying editing activities. One important comparison approach uses manually curated data. However - the description of how this dataset was created is far from being sufficiently clear. The comparison is also performed for HDR experiment type, which can be compared only to 2 other tools. We see the potential contribution aspects of the paper to be the following:

      1. NextFlow pipeline implementation is an important engineering contribution. Same is true for the interactive web application
      2. The option to simulate an experiment to assess it is a nice feature and can help experiment design
      3. Identification of amplicons when not provided as input
      4. CRISPR-A seeks substitutions along the entire amplicon sequence and is less dependent on the quantification window and on the putative cutsite
      5. Analysis of the difference, in edit activity, comparing different cell lines
      6. CRISPR-A supports the use of UMIs
      7. Interesting sequence pattern insights - like "...found certain patterns associated with low diversity outcomes: free thymine or adenine at the 3' nucleotide upstream of the cut site that leads to insertions of the same nucleotide, a free cytosine at the same place that leads to its loss, and strong micro-homology patterns that lead to long deletions " We further comment on the soundness of these contributions in our comments below and on their significance in our comments related to the general potential significance of the paper.

      Major comments:

      • Upon attempting to run an analysis from the web interface (https://synbio.upf.edu/crispr-a) and using: fastq of Tx and mock (control), the human genome and the gRNA sequence provided as input for the protospacer field, our run was not successful. In fact the site crashed with no interpretable error message from CRISPR-A.
      • Moreover, there should be more clear context. There is no information regarding the type of experiments that can be analyzed with the tool. We figure it is multiplex PCR and NGS but can the tool also be used for GUIDESeq, Capture, CircleSeq etc.?
      • No off target analysis. Only on-target
      • No translocations and long/complex deletions
      • We view the use of a mock experiment as control as a must for any sound attempt to measure edit activity. This is even more so when off-target events need to be assessed (any rigorous application of GE, certainly any application aiming for clinical or crop engineering purposes). We therefore think that all investigation of other approaches should be put in this context.
      • It's a nice feature to have simulated data, however, it is not a good approach to rely on it.
      • In p7 the authors indicate the implementation of three approaches to improve quantification. They should be clear as to the fact that many other tools and experimental protocols are also using these approaches. for example, ampliCan, CRipresso2 and CRISPECTOR all take into account a mock experiment run in parallel to the treatment.
      • Figure1:
        • The figure certainly provides what seems to be a positive indication of the simulations approach being close to measured results. Much more details are needed, however, to fully understand the results.
        • Squema = scheme ??
        • What was the clustering approach?
        • What is the input to the JS calculation? What is the dimension of the distributions compared? These details need to be precisely provided.
        • What clustering/aggregation approach did the authors use here (average dist, min dist, dist of centers?)
        • 5 pairs were selected out of how many? Call that number K.
        • What does the order of the samples in 1C mean? Is 98_real closer to 22_sim than to 98_sim? If so then state it. If not - what is the meaning of the order? Furthermore - how often, over K choose 2 pairs does this mis-matching occur for the CRISPR-A simulator??
      • "From the characterized data we obtained the probability distribution of each class" (page 3) - How is this done? how many guides? how many replicates? what is class? where do you elabore regarding it? how you obtain the distributions? More details of the methods need to be provided.
      • The 96 samples used for development here - where are they taken from? This should be indicated in the first time these samples are mentioned. Namely - bottom of P6
      • CRISPECTOR is not mentioned in the comparison in the section: "CRISPR-A effectively calls indels in simulated and edited samples" (Table S2). Is there a specific reason for having left it out?
      • In the section "Improved discovery and characterization of template-based alleles or objective modifications" - part of the analysis was made over simulated data and then over real data. The authors state "it is difficult to explain the origin of these differences...". Thus, needs to be investigated in more detail ... :) (P5) Moreover - the performance over real data is, at the end of the day, the more interesting one for comparison purposes.
      • We found no explanation of "spike-in"/"spike experimental data" across the entire article. There is some general language about lengths but the scheme is still totally unclear.
      • Description of the 96 gRNAs? Is this data from REF26? If so - where do you state this? If so - how do the methods described herein avoid the unique characteristics of the data of REF26?
      • "distance between the percentage of microhomology mediated end-joining deletions of samples with the same target was calculated and the mean of all these distances was used to reduce the information of the 96 different targets to a single one." (P6) What is the exact calculation used? which distance? How was clustering performed? What is the connection for gene expression?
      • "we have fitted a linear model to transform the indels count depending on its difference in relation to the reference amplicon" (P7) - needs more explanation. Is this part of the pipeline?
      • What is it "...manually curated data set"? (page 8)
      • Section "CRISPR-A empiric model removes more noise than other approaches" - with what data were the comparisons performed? Moreover, how were the comparison criteria selected (efficiency and sensitivity)? The literature already used several approaches to compare data analysis tools for editing experiments. See for example ampliCan, Crispresso (1 and 2) and CRISPECTOR. Maybe the authors should follow similar lines.
      • In the section "CRISPR-A empiric model removes more noise than other approaches" the authors state, incorrectly, that CRISPECTOR only reports the percentage of editing activity per site (there is much more information reported in the HTML report, including the type of edit event detected - deletion, of various lengths, insertions, substitutions etc). (P8)
      • Section "CRISPR-A noise subtraction pipeline" describes a pretty naive method for noise subtraction (P12). Should be rigorously compared, for Tx vs Mock experiments, to CRISPECTOR and to CRISPResso2.
      • "recalculated using a size bias correction model based on spike-in controls empiric data.." (P14). Where is the formula?
      • Section "Noise subtraction comparison with ampliCan and CRISPECTOR" - fake mock was generated for comparison. We consider the avoidance of a Mock control in experiments designed to measure editing activity to not be best practice. It is OK to support this approach in CRISPR-A. However - the comparison to tools that predominantly work using a Mock control (including ampliCan and CRISPECTOR) should be done with actual Mock. Not with fake Mock .... (P15)

      Minor comments:

      • "Also, most of these tools lack important functionalities like reference identification, clustering, or noise subtraction" - bold part incorrect for CRISPECTOR, although it is not aiming only for CRISPECTOR
      • "Same parameters and probability distributions were fitted for three other cell lines: Hek293, K562, and HCT11626, to make SimGE more generalizable and increase its applicability" (page 3) - how was fitted?
      • What is the "nature of modification"? (P5)
      • In the section "CRISPR-A effectively calls indels in simulated and edited samples" (P5) towards the end, the authors write that the CRISPR-A algorithm did not give good results for a few examples. They then state that this was corrected and then yielded good results. There is no explanation of what correction was done, if it was implemented in the code and how to avoid/detect it in further cases.
      • Cell culture, transfection, and electroporation - explanation only for HEK293, what about the others? (P15)
      • Typos and unclear wording:
        • "obtention" (P8)
        • "mico" >> micro (P 7,10)
        • "Squema" >> scheme (Fig.1)
        • "decombuled" (P10)
        • "empiric" >> empirical (P8 and other places)
        • "Delins" (P14)
        • "performancer" (P9)
        • Change word across all article - "edition" to "editing"
        • "...has enough precision to find" (P6) not related to "results" section
      • Comments on figures:
        • Fig. 2C:
      • No CRISPECTOR in the analysis
      • It is simulated data only
      • It is not violin plot as mentioned in the description
        • Fig 3A - Is it significant?
        • Fig. 4:
      • A
      • Each row/column is a vector of 96 guides?
      • How is the replicate number decided? Is it a different experiment by date? What is separating between experiments? Rep numbers?
      • B - Differential expression:
      • How? did you measure RNA?
      • Is the observed data in the figure sufficiently strong in terms of P-value?
      • Where is the third cell-line?
        • Fig.13 - There is no A and B as mentioned in the text

      Significance

      We repeat the aspects of contribution, as listed in the first part of the review, and comment about significance:

      • NextFlow pipeline implementation is an important engineering contribution. Same is true for the interactive web application
        • Significant engineering contribution. Nonetheless, we were not able to run the analysis. So - needs to be checked.
      • The option to simulate an experiment to assess it is a nice feature and can help experiment design
        • An important methodology contribution
      • Identification of amplicons when not provided as input
        • Not important in the context of multiplex PCR and NGS measurement assays, as amplicons will be known. Not clear what other contexts the authors were aiming at.
      • CRISPR-A seeks substitutions along the entire amplicon sequence and is less dependent on the quantification window and on the putative cutsite
        • Importance/significance needs to be demonstrated
      • Analysis of the difference, in edit activity, comparing different cell lines
        • Significant contribution. However - the methods need to be much better explained and the results better described in order for this to be useful to the community.
      • CRISPR-A supports the use of UMIs
        • Mildly significant technical contribution. However - only addresses on-target. Also addressing off-target would have been significant.
      • Interesting sequence pattern insights - like "...found certain patterns associated with low diversity outcomes: free thymine or adenine at the 3' nucleotide upstream of the cut site that leads to insertions of the same nucleotide, a free cytosine at the same place that leads to its loss, and strong micro-homology patterns that lead to long deletions "
        • As stated - interesting.
  4. docdrop.org docdrop.org
    1. Many issues in education policy have therefore come down to an apparent choice between the individual success of comparatively privileged students and the collective good of all students or the nation as a whole.

      I have always believed that individualism is a very selfish behavior pattern. I agree that individualism can make one's life better, or let one have a higher quality of life. But people live in a collective. In a collective, too strong individualism may harm the interests of the collective. There is an old Chinese saying that when there is no lip, the teeth will be very cold. It means that the collective is the umbrella of the individual. Individualism can exist, but I think we should put collective interests first.

    1. so you’ve ignored the potential benefits to your current situation as university students.

      I think another relevant thing here is what the student's goal is. We talked a bit about this in class, but some students just want to get the paper/assignment done in the shortest time and with the least effort. They probably won't use this process because it's not a requirement like it was in high school. They may not even consider that it's worth writing the best paper they can and considering strategies to help them do so.

    1. Reviewer #2 (Public Review):

      By now, the public is aware of the peculiarities underlying the omicron variants emergence and dissemination globally. This study investigates the mutational biography underlying how mutation effects and epistasis manifest in binding to therapeutic receptors.

      The study highlights how epistasis and other mutation effect measurements manifest in phenotypes associated with antibody binding with respect to spike protein in the omicron variant. It rigorously tests a large suite of mutations in the omicron receptor binding domain, highlighting differences in how mutation effects affect binding to certain therapeutic antibodies.

      Interestingly, mutations of large effect drive escape from binding to certain antibodies, but not others (S309). The difference in the mutational signature is the most interesting finding, and in particular, the signature of how higher-order epistasis manifests in the partial escape in S309, but less so in the full escape of other antibodies.

      The results are timely, the scope enormous, and the analyses responsible.

      My only main criticisms walk the stylistic/scientific line: many of the others have pioneered discussions and methods relating to the measurement of epistasis in proteins and other biomolecules. While I recognize that the purpose of this study is focused on the public health implications, I would have appreciated more of a dive into the peculiarity of the finding with respect to epistasis. I think the authors could achieve this by doing the following:

      a) Reconciling discussions around the mutation effects in light of contemporary discussions of global epistasis "vs" idiosyncratic epistasis, etc. Several of the authors of the manuscript have written other leading manuscripts of the topic. I would appreciate it if the authors couched the findings within other studies in this arena.

      B)While the methods used to detect epistasis in the manuscript make sense, the authors surely realize that methods used to measure is a contentious dimension of the field. I'd appreciate an appeal/explanation as to why their methods were used relative to others. For example, the Lasso correction makes sense, but there are other such methods. Citations and some explanation would be great.

      Lastly (somewhat relatedly), I found myself wanting the discussion to be bolder and more ambitious. The summary, as I read it, is on the nose and very direct (which is appropriate), but I want more: What do the findings say for greater discussions surrounding evolution in sequence space? For discussions of epistasis in proteins of a certain kind? In, my view, this data set offers fodder for fundamental discussion in evolutionary biology and evolutionary medicine. I recognize, however, the constraints: such topics may not be within the scope of a single paper, and such discussions may distract from the biomedical applications, which are more relevant for human health.

      But I might say something similar about the biomedical implications: the authors do a good job outlining exactly what happened, but what does this say about patterns (the role of mutations of large effect vs. higher-order epistasis) in some traits vs others? Why might we expect certain patterns of epistasis with respect to antibody binding relative to other pathogenic virus phenotypes?

      In summary: rigorous and important work, and I congratulate the authors.

    1. Author Response

      Reviewer #3 (Public Review):

      In this manuscript, Kim et al. use a deep generative model (a Variational Auto Encoder previously applied to adult data) to characterize neonatal-fetal functional brain development. The authors suggest that this approach is suitable given the rapid non-linear development taking place in the human brain across this period. Using two large neonatal and one fetal datasets, they describe that the resultant latent variables can lead to improved characterization of prenatal-neonatal development patterns, stable age prediction and that the decoder can reveal resting state networks. The study uses already accessible public datasets and the methods have been also made available.

      The manuscript is clearly written, the figures excellent and the application in this group novel. The methods are generally appropriate although there are some methodological concerns which I think would be important to address. Although the authors demonstrate that the methods are broadly generalisable across study populations - however, I am unsure about the general interest of the work beyond application of their previously described VAE approach to a new population and what new insight this offers to understanding how the human brain develops. This is a particular consideration given that the major results are age prediction (which is easily done with various imaging measures including something as simple as whole brain volume) and recapitulation of known patterns of functional activity in neonates. As such, the work will be of interest to researchers working in fMRI analysis methods and deep learning, but perhaps less so to a wider neuroscience/clinical readership.

      Specific comments:

      1) (M1) If I understand correctly, the method takes the functional data after volume registration into template space and then projects this data onto the surface. Given the complexities of changing morphology of the development brain. would it not be preferable to have the data in surface space for standard space alignment (rather than this being done later?). This would certainly help with one of the concerns expressed by the authors of "smoothing" in the youngest fetuses leading to a negative relationship between age and performance.

      While projecting onto the cortical surface has its advantages, as suggested here18, several studies have also shown that with careful registration, such as in the current study, volumetric registration can yield comparable performance19. Regardless, we did attempt to directly generate cortical surfaces for our fetuses. We refer the reviewer to our response to the RE-M2 [page 9].

      Regarding the “smoothing” effect in the youngest fetuses, we want to clarify that the smoothing effect in the scans of young fetuses is not unique to the choice of registration method. In other words, the same smoothing effect must be seen with cortical registration as well. Regarding this perspective, we kindly refer the reviewer to our response to RE-M1 [page 7]. Regarding the specific change made in the revised manuscript, we kindly refer to our response to R1-m5 [p21] or [page 9 line 191-213] in the main manuscript.

      2) (M2) A key limitation which I feel is important to consider if the method is aiming to be used for fetuses is the effects of the analysis being limited only to the cortical surface - and therefore the role of subcortical tissue (such as developmental layers in the immature white matter and key structures like the thalami) cannot be included. This is important, as in the fetal (and preterm neonatal) brain, the cortex is still developing and so not only might there be not the same kind of organisation to the activity, but also there is likely an evolving relationship with activity in the transient developmental layers (like the subplate) and inputs from the thalamus.

      The reviewer raises an important point. We agree with the reviewer that the subcortical region plays a critical role in fetal and newborn neurodevelopment. Unfortunately, our current VAE model cannot utilize such information without a major change in the model structure. We added this as a limitation of our study and discussed why our VAE model, in its current form, did not include subcortical areas. Please see our detailed response to RE-M1 [page 4] or [page 25 line 558-570] in the main manuscript.

      3) (M3) As the authors correctly describe, brain development and specifically functional relationships are likely evolving across the study time window. Beyond predicting age and a different way of estimating resting state networks using the decoding step, it is not clear to me what new insight the work is adding to the existing literature - or how the method has been specifically adapted for working with this kind of data. Whilst I agree that these developmental processes are indeed likely non-linear, to put the work in context, I think the manuscript would benefit from explaining how (or if) the method has been adapted and explicitly mentioning what additional neuroscientific/biological gains there are from this method.

      We appreciate the reviewer’s critical insights. In the revised paper, we included additional results that, we hope, can address the reviewer’s concerns. We believe that the strength of the VAE model is that, relative to linear models, it can be more generalizable across different datasets and ages (adult vs. full-term babies vs. preterm babies vs. fetuses). In the original manuscript, this was supported by the superior age prediction performance of the VAE over linear models when applied to different datasets covering the fetal to neonatal periods. Age prediction could also be done using other imaging modalities, as the reviewer pointed out. However, we do not think this undermines the potential impact of having the ability to accurately estimate age based on functional connectivity patterns. Brain function-structure relationships may not exactly be one-to-one20. It is entirely possible that for one disease, brain functional connectivity alterations precede structural changes such that delayed growth trajectories will first manifest in the functional space. There are also certain aspects of brain function that cannot be mapped directly to its structural characteristics (i.e., structural connectivity patterns). For example, brain changes its functional connectivity patterns dynamically over different brain states (resting vs. task-engaging)21, mental disorders (depression22, anxiety23, Schizophrenia24), cognitive traits25, 26, and individual uniqueness25, etc. Therefore, we believe that estimating the functional age of fetuses and neonates given their functional connectivity profiles may provide a biomarker for tracking neurodevelopment trajectories, allowing clinicians to identify deviations early and intervene in a timely manner if necessary. For these reasons, we believe that superior age prediction performance of the VAE model compared to linear models is scientifically significant.

      The value of the VAE lies in its ability to capture FC features that are otherwise not modeled by linear strategies. For example, here, we showed that only the VAE model can extract latent variables representing brain networks that are similar across different datasets. In contrast, linear models, showed higher network pattern similarity between full-term and preterm infants within the dHCP dataset. This suggests that the VAE model can be a very useful tool for capturing common brain networks in datasets acquired using different recording parameters and preprocessing steps. Moreover, the VAE representations predicted age with higher accuracy compared to linear representations. Together, these findings show that the methodology is effective in extracting functionally relevant features of the brain. Please see RE-M1 [page 3] and R1-m13 regarding the specific changes made in the revised manuscript.

      4) (M4) The unavoidable smoothing effect of VAE is very noticeable in the figures - does this suggest that the method will be relatively insensitive to the fine granularity which is important to understand brain development and the establishment of networks (such as the evolving boundaries between functional regions with age) - reducing inference to only the large primary sensory and associative networks? This will also be important to consider for the individual "reconstruction degree" - (which it would likely then overstate - and would need careful intersubject comparison also) if it was to be used as a biomarker or predictor of cognition as suggested by the authors.

      Regarding the first concern, yes. Greater smoothing will tend to yield less granular network patterns; this is true for all representational models (not only VAE, but also models like ICA or PCA). This effect becomes ever more pronounced when representations consist of fewer components (e.g., IC50); the smoothing effect becomes stronger, leading to coarser brain patterns (see Fig. 3 in the revised manuscript). In this regard, higher number of components is desired, but on the flipside, IC maps with higher components are generally less interpretable. In short, there will always be trade-offs between interpretability and spatial resolution. Also, higher components tend to cause over-fitting issue, as shown in our age prediction performance across different datasets (worse performance in the IC300 vs. IC50). In this sense, what matters for the representations is how informative each latent variable (or component) is. In the revised Fig. 2, we showed that latent variables from the VAE model were more informative in representing rsfMRI than linear representations. It is also noteworthy that the smoothing effect of the VAE is comparable to IC300 (similar effect to manual smoothing at the level of FWHM=5mm; revised Fig. 3). Given above results, we believe the VAE model may be more suitable for investigating finer scale of brain networks, than linear models. The above perspective was updated in the revised manuscript as [page 23 line 506-511]:

      "Another interesting observation was that the smoothing effect of the VAE is comparable to IC300 (similar effect to manual smoothing at the level of FWHM=5mm; Fig. 3). Given the above, we believe the VAE model may be more suitable for investigating finer scale of brain networks, than linear models. Perhaps, the VAE model with a greater number of latent variables (e.g., 512 or 1024 instead of 256 in the current VAE) can be utilized to find brain networks at finer scale."

      On top of the points raised above, network mapping with linear models is limited when it comes to mapping the spatial evolution of brain networks over aging due to their linear nature. This limitation can be observed in the ICA study with dHCP dataset (Fig. 4 in 7). On the other hand, thanks to its nonlinearity nature, the VAE model may have a potential to observe the spatial gradient of brain network over aging, while this expectation needs confirmation. To that end, we revised our discussion to reflect our perspective. We refer the full change made in the revised manuscript to our response to R1-m13.

    1. Author Response

      Reviewer #1 (Public Review):

      The manuscript by Shaikh and Sunagar addresses the question of the origin of spider venom proteins. It has been known for many years that an important component of spider venoms is a diverse group of small proteins known as disulfide-rich peptides (DRPs). However, it has not been clear whether this group of proteins has a common origin or evolved convergently in different lineages. The authors collected sequences of the genes encoding these proteins from publicly available genomes of spiders from a range of families. They aligned the sequences using the structural cysteines as guides and carried out a phylogenetic analysis of the different sequences, ultimately classifying the different proteins into over 50 super-families. One thing that is not clear from the text or from the references cited (I am not an expert on spider venom) is how many of these superfamilies were known before and how many are novel. There is also no clear indication of what criteria were used to define a subset of sequences as a superfamily. Nonetheless, the authors show that all these superfamilies have a single common ancestor, predating the divergence of araneomorphs and mygalomorphs and that the DRPs underwent independent diversification in each of these two lineages.

      We have identified 78 novel superfamilies in this study and 33 were previously identified (Pineda et al. 2020 PNAS). We had previously described information in lines 90, 101 and 106 regarding the description of novel superfamilies from previous studies and the ones described in this study.

      Line 90 “Recently, using a similar approach, 33 novel spider toxin superfamilies have been identified from the venom of the Australian funnel-web spider, Hadronyche infensa (9).”

      Line 101 “This approach enabled the identification of 33 novel toxin superfamilies along the breadth of Mygalomorphae (Figures S1 and S2).”

      Line 106 “Moreover, analyses of Araneomorphae toxin sequences using the strategy above resulted in the identification of 45 novel toxin superfamilies from Araneomorphae, all of which but one (SF109) belonged to the DRP class of toxins (Figures S3 and S4).”

      Spider toxin superfamilies have been named after gods/deities of death, destruction and the underworld based on nomenclature introduced by Pineda et al. (2014 BMC genomics). We have now included this explanation in the manuscript under the methods and results sections. We have also provided additional details pertaining to this nomenclature in Table S1.

      The authors also looked at selective forces acting on the sequences using dN/dS analyses. They reach the conclusion that there are different modes of selection acting on different sequences based on their role - defensive or predatory venoms - building on previous work by the lead author on venom sequence evolution in diverse animals.

      All in all, this is an admirable piece of molecular evolution work, providing new data on the evolution of spider venom proteins. There are some confusions in terminology that need to be cleared up, and somewhat more context needs to be given for non-specialists as detailed in the points below:

      We thank the reviewer for their constructive and critical suggestions, as well as the kind words of encouragement. Their suggestions have helped us in significantly improving the quality of our work.

      Suggestion 1) Common names of the main spider infraorders should be given.

      We thank the reviewer for their helpful input. We have now introduced spider infraorders with well-known spiders and their common names under the introduction section. Furthermore, we have also included a schematic representation of the spider phylogeny, and highlighted lineages under investigation as Figure 1.

      Suggestion 2) Opisthothelae is not the common ancestor of Mygalomorphae and Araneamorphae, but the clade that encompasses those two clades. This incorrect statement appears in several places. Further on, it is stated that Opisthothelae is the common ancestor of all extant spiders. This is wrong both from a terminological point of view (a clade cannot be ancestral to another clade) and from a factual point of view, since there are extant spiders not included in Opisthothelae.

      We thank the reviewer for pointing out this oversight. We have now corrected it to suborder Opisthothelae as the clade encompassing Mygalomorphae and Araneomorphae spiders.

      Suggestion 3) Several proteins and proteins families are mentioned without being introduced, e.g. knottin. Please provide short descriptions.

      We have now provided a short introduction to terms such as Knottin.

      Reviewer #2 (Public Review):

      This interesting study looks into the evolution of putative spider venom toxins, specifically disulfide-rich peptides (DRPs). The authors use published sequence data to gain new insights into the evolution of DRPs, which are the major component of most spider venoms. Through a series of sequence comparisons and phylogenetic analyses they identify a substantial number of new spider toxin superfamilies with distinct cysteine scaffolds, and they trace these back to a primitive scaffold that must have been present in the last common ancestor of mygalomorph and araneomorph spiders. Looking at the taxonomic distribution of these putative venom DRPs, they conclude that mygalomorph and araneomorph DRPs have evolved in different ways, with the former being recruited into venom at the level of genera, and the latter at the level of families. In addition, they perform selection analyses on the DRP superfamilies to uncover the surprising result that mygalomorph and araneomorph DRPs have evolved under different selective regimes, with the evolution of the former being characterised by positive selection, and the latter by purifying (negative) selection.

      However, I don't think that in the current state of the manuscript these conclusions are robustly supported for several reasons. First, it seems that not all previously published data were included in the phylogenetic analyses that were used to identify new superfamilies of DRPs.

      We have, indeed, analysed all spider toxin sequences available to date. We have relied on the signal and propeptide regions for identifying novel superfamilies, which is an accepted convention: Pineda et al. (2014 BMC Genomics); Pineda et al. (2020 PNAS).

      Although many additional superfamilies can be identified, we have only retained those sequences for which there were at least 5 representatives for the identification of toxin superfamilies, and 15 representatives for selection analyses to ensure robustness. This filtering step ensured that the generated alignments, phylogenetic trees, and evolutionary assessments were robust and devoid of noise that stems from single-representative groups. Adding in those sequences would have enabled us to identify many more superfamilies, solely based on the signal and propeptide examination, but it wouldn’t have been possible to support them with other lines of evidence that were provided for all other superfamilies in this study, jeopardising the overall quality of the manuscript. Nonetheless, there is strong evidence that the left-out sequences are also related to the ones analysed in this study (Figure S10). In future, when more transcriptomes are sequenced, it would be possible to designate these newer toxin superfamilies with much stronger support.

      Second, much of the data were obtained from whole-body transcriptome data, which leaves a degree of uncertainty that these data indeed derive from the venom glands that produce the toxins.

      We respectfully disagree with the reviewer that ‘much of the data’ are from the whole-body transcriptomes. Nearly all sequences in our study are sourced from Pineda et al. (2014 BMC Genomics and 2020 PNAS), Sunagar et al (2013 Toxins), Cole and Brewer (2020 bioRxiv) and transcriptome sequence assembly data from established online repositories NCBI (NR and TSA) and ENA. All the above-mentioned studies (KS is a part of many of these) under their methods section clearly state that the transcriptomes were generated using mRNA isolated from venom gland tissue (BioProject accessions: PRJEB14734; PRJEB6062; PRJNA189679, PRJNA587301 and PRJNA189679, where source tissue type is designated as venom gland).

      We would like to direct the reviewer’s attention to the following excerpts from reference papers from which data for this study has been sourced:

      1. Pineda S et al. (2020 PNAS): “Three days later, they were anesthetized, and their venom glands were dissected and placed in TRIzol reagent (Life Technologies). Total RNA from pooled venom glands was extracted following the standard TRIzol protocol.”
      2. Sunagar et al (2013 Toxins): “Paired venom glands were dissected out and pooled from nine mature females on the fourth day after venom depletion by electrostimulation. Total RNA was extracted using the standard TRIzol Plus method ...”
      3. Cole and Brewer (2020 bioRxiv): “... the venom glands of each ctenid were dissected out, whole RNA was isolated from the venom glands …”

      We would also like to point out that hexatoxins are widely studied and are some of the most well-understood spider venom toxins. Many representatives have been functionally characterised and shown to be potent in affecting prey and predatory species [Sunagar et al (2013 Toxins); Pineda et al. (2014 BMC Genomics and 2020 PNAS); Volker, et al. (2020 PNAS) - KS is a part of most of these studies as well]. However, the current technologies do not permit the high-throughput screening of the enormous diversity of toxins in spiders, which is why not every toxin sequence identified from the venom gland is functionally characterised. Nonetheless, venom researchers will not contest the role of these highly expressed venom gland proteins in envenoming, especially given that they share significant sequence identities with toxins that are functionally well-characterised.

      The only exception to the above is non-ctenid araneomorph toxin superfamily sequences, which are retrieved from whole-body transcriptomes (Cole and Brewer; 2020 bioRxiv). The authors of the paper indicated these as putative toxins. As explained above, homologs of these peptides are well-characterised to be venom toxins. Additionally, in our phylogenetic trees (Figures 3, 4, S6 and S9), they are nested within the toxin clades, reaffirming their identity.

      Third, the taxonomic representation of mygalomorph and araneomorph diversity in this study is so sparse that it becomes impossible to distinguish whether toxin recruitments have happened at the level of genera, families, or even higher-level taxa.

      We respectfully disagree with this suggestion. The taxonomic breadth investigated in this study isn’t sparse. Analysed sequences belong to groups across the breadth of the spider phylogeny. To address this criticism, we are now including a schematic representation of spider phylogeny, where lineages under investigation are highlighted (Figure 1A). Given this broader taxonomic breadth, all of our interpretations are parsimoniously extendable to their common ancestors. For instance, we establish the common origin of all DRPs in the members of these widespread spider families. Therefore, not including sequences from other sister groups will not invalidate this hypothesis, and the most parsimonious explanation will be that the missing members too are likely to have DRPs in their venom (which is also a common understanding of the spider venom research). Whether DRPs dominate the venoms of these missing groups will only come to light upon investigation, but their presence in the venom is highly likely. Moreover, please do note that we have analysed nearly all sequences available in the literature to date.

      As for the recruitment of the toxin superfamily at the taxon level, we would like to point out the phylogenies in Figures 2 and 3 that clearly show the differential recruitment events. We would also like to point out lines 120 and 136 state that this may not only be a result of recruitment and could arise from differential rates of diversification (also evident in other analyses presented in Figures 5 and Tables S2 and S3).

      Line 120 “Interestingly, the plesiotypic DRP scaffold seems to have undergone lineage-specific diversification in Mygalomorphae, where the selective diversification of the scaffold has led to the origination of novel toxin superfamilies corresponding to each genus (Figure 2).”

      Line 136 “However, we also documented a large number of DRP toxins (n=32) that were found to have diversified in a family-specific manner, wherein, a toxin scaffold seems to be recruited at the level of the spider family, rather than the genus. As a result, and in contrast to mygalomorph DRPs, araneomorph toxin superfamilies were found to be scattered across spider lineages (Figure 3; Figure S6; node support: ML: >90/100; BI: >0.95).”

      Adding any number of missing lineages will neither change the fact that araneomorphs ‘appear’ to have recruited these superfamilies at the genera level, nor the family-level recruitment of toxin superfamilies in a large number of examined mygalomorphs.

      We have now introduced a new figure (Figure 7) that highlights the different scenarios that explain the observed differences in the evolution of mygalomorph and araneomorph spider toxins. We have also included additional text in the manuscript to explain this better.

      Fourth, only a selection of DRP superfamilies was used for natural selection analyses, without the authors explaining how this selection was made. Yet, they attempted to draw general conclusions about toxin evolution in mygalomorphs and araneomorphs, even though most of the striking differences they found were restricted to just two mygalomorph genera, and one family of araneomorphs.

      From our experience and previous reports [Sunagar and Moran (2015, PLoS genetics); Sunagar, et al. (2012, MBE); Yang, Z. (2007, MBE)], the unavailability of enough sequences from datasets results in inaccurate estimation of omega values. For instance, if there are only a couple of sequences in a superfamily, both of which are slightly different from one another, then even these minor differences in them would be exaggerated. Hence, we have resorted to performing selection analysis on datasets for which there are at least 15 sequences. No doubt that this conservative approach reduces the number of datasets analysed, but it also ensures that our findings are well-supported. We have now clarified this in our manuscript under the methods section.

      However, we did previously include sequences from all toxin superfamilies described to date in our alignment figure (Fig S10) and analysed their signal and propeptide regions. They were only excluded from selection analyses. It can be seen that they too are DRPs, but they belong to distinct superfamilies from the ones being described here.

      If these concerns are addressed this study can shed important new light on venom toxin evolution in one of the most diverse venomous taxa on Earth.

      We thank the reviewer for their constructive inputs and suggestions which have enabled us to make this manuscript more accessible to a wider audience.

      Reviewer #3 (Public Review):

      This work aims to elucidate the evolutionary origins of disulfide-rich spider toxin superfamilies and to determine the modes of natural selection and associated ecological pressures acting upon them. The authors provide a compelling line of evidence for a single evolutionary origin and differing factors (e.g., prey capture strategies and methods of anti-predator defense) that have shaped the evolution of these toxins. Additionally, the two major spider infraorders are claimed to have experienced differing selective pressures regarding these toxins.

      The results presented here are novel and generally well-presented. The evidence for a single origin of DRP toxins in spiders is exciting and changes the paradigm of spider venom evolution.

      The data are well analyzed, but the methods lack enough detail to reproduce the results. More information regarding the parameters passed to each software package, version numbers of all software employed, and models of molecular evolution employed in phylogenetic analyses are among the necessary missing information.

      We thank the reviewer for their kind words and constructive and critical suggestions. Their suggestions have contributed towards improving the quality of our work. Upon their suggestion, we have now expanded the methods section to include more details.

      The differences in the evolutionary pressures between mygalomorphs and RTA-clade spider DRP toxins are clear, but expanding RTA results to all araneomorphs may be overreaching. Additional araneomorph sequence data is available, despite the claims within this manuscript (e.g., see Jiang et al.. 2013 Toxins; He et al.. 2013 PLoS ONE; and Zobel-Thropp et al.. 2017 PEERJ). These papers include cDNA sequences of spider venom glands and contain representatives of inhibitory cysteine knot toxins, which are DRP toxins. These data would greatly enhance the strengths of the results presented herein.

      In response to the expansion of RTA results to araneomorphs, we would like to point out that RTA comprises about 50% of the diversity recorded in Araneomorphae. The araneomorph data analysed in our study covers a range of araneomorph family divergence time Agelenidae (<70 MYA), Pisauridae (<50 MYA) and Theridiidae (~200 MYA, Magalhaes 2020, Biological Reviews 95.1). We report a strong signature of purifying selection influencing the evolution of araneomorph toxin SFs, despite the long evolutionary time separating them (50 - 200 MYA). We firmly believe that further addition of toxin sequence data from other groups will not deviate from the general trend of molecular evolution observed in both these lineages across such large period of time; barring certain certain exceptions (such as SF13 a defensive toxin identified from Hadronyche experiencing purifying selection; Volker, et al. 2020 PNAS).

      We had initially excluded non-ctenid datasets from our analyses on account of poor sequence annotation and lack of representative sequence data. However, we have now incorporated Dolomedes mizhoanus (DRP) (Jiang et al. 2013 Toxins) and Latrodectus tredecimguttatus (non-DRP) (He et al. 2013 PLoS ONE) toxin dataset into our analyses, following reviewer’s suggestion. This has led to identification of 5 novel superfamilies, providing additional support to our spider venom evolution hypothesis.

    1. "Individuals are social animals who are influenced by social preferences, social networks, social identities, and social norms: most people care about what those around them are doing and how they fit into their groups..."

      https://www.theguardian.com/global/2019/nov/24/fear-of-missing-out-fomo-making-decision-biology-fobo-christmas-turkey#:~:text=Indecision%20when%20the%20decision%20is,or%20fear%20of%20missing%20out.

      This idea is one which we are all familiar with: the fear of missing out, more commonly known as FOMO. The referenced article extends this idea beyond to FOBO, or the fear of better options. This is a great example exhibiting the rationale behind why many of us tend to think socially, as we are afraid of missing out on the better choices that may be presented to us. Consequentially, in decision-making, we may rely on others' decisions, as we are all aware, whether consciously or subconsciously, that collective thinking is "smarter".

    1. can dive deeper into how you how you practice a spirituality that promotes both an individual well-being and the 00:09:02 health of our society and our environment like well it's interesting you ask that question because at the root of it or you could say the the presumption of that is is the kind 00:09:14 of duality or separation between the two right yeah exactly i mean i'm i'm reminded of something joanna macy uh said um the world has a role to play in our awakening 00:09:28 um i think many of us still have a kind of romanticized idea about the path even the bodhisattva path the idea that somehow you might go off to a cave and meditate really hard or something and 00:09:40 then when you're deeply enlightened then you return to the world and become engaged you know returning to the marketplace and i think frankly that's a bit simplistic if not if not naive it's like 00:09:54 the two go hand in hand uh because they reinforce each other you know um i think that when we start buddhist practice perhaps inevitably there there's a kind 00:10:07 of self-preoccupation because what brings us to it i mean there's some some suffering some dissatisfaction in our own lives why else would we spend so much time energy and money you know 00:10:20 making sore legs and backs for ourselves um but as we progress you know as as we get more insight into what's going on then if things are going well we eventually 00:10:33 begin to realize that at the root of our dissatisfaction is the delusion of separation yes from from other people and from the rest of the world

      !- integrating : individual and collective wellbeing - David Loy offers a clear explanation of the entangled nature of self-and-other - we begin the journey of self improvement due to problems in our personal lives, that is the motivation - but as we continue the journey, we may discover that it is our separation from others and from nature herself that is the cause of our dis-satisfaction - David quotes Joanna Macy, who said that "the world has a role to play in our awakening"

    1. you  see a lot of third world debts that uh if the   third world better countries have to pay uh their  foreign debts under as the world economy slows   down they're going to be subject to austerity to  the world banks and the imf's austerity programs   00:35:01 and they're going to be kept in poverty uh is it  really right that they should be kept in poverty   just to enrich the bondholders of the one percent  the one percent will say yes that's why we're   the one percent so that we can impoverish  other people that's our liberty our liberty   is the right to impoverish other people and reduce  them to dependency uh that will happen if you do   not write down the debts uh it's already happening  in the united states to the student debt uh crisis   00:35:30 where students uh have to pay so much money uh as  they fall behind on their student debts that they   can't afford to take out mortgages to buy homes  uh and you're having the home ownership rates   plunge in the united states that's the result of  leaving the debts in place uh the mortgage steps   uh uh are causing shrinkage so there is no way  to get out of this economic polarization without   00:35:54 a debt write down and that's something that  is too radical and uh uh when we talked about   when i was referring to what china's doing i'm  referring to what it's doing today and tomorrow   about uh the uh real estate company evergreen  uh uh china has a choice is it going to leave   evergreen's real estate debts in place and every  grand uh as a real estate company is two to three   00:36:21 percent of the entire chinese economy if it  pays the foreign creditors and the domestic   one percent of china it's going to impoverish the  uh the employees of evergrand it's going to make   housing prices more and more expensive in china  china has had a debt finance housing boom uh   if you leave the debts in place then uh you're  you're going to impoverish china and obviously   00:36:47 china is going to say i'm we're not going to put  the creditors first we're not going to do what   the west does and say the sanctity of debt service  debts are uh that you owe or sacred uh it's worth   sacrificing the economy it's worth plunging the  economy into poverty just to preserve the wealth   of the one percent i think china's uh is going to  make the opposite decision and say we're not going   00:37:12 to commit political suicide we're going to operate  for it's a socialist economy and when it comes to   debt and credit thank god we have our banking in  the public domain and since the public domain the   people's bank of china is the creditor they can  afford to write down the debt without having any   political backlash because it's cancelling that  so do itself uh which is a great advantage uh and   00:37:38 it's also uh as for the private bond holders uh  it's going to say well sorry bondholders you made   loans to a company that was way over leveraged  uh already uh the american bond rating companies   have reduced their bond rating to chunk so you  knew what you were buying if you continued to hold   bonds that uh fitch and other bond raiders moody's  all say or junk and you lose your money well   00:38:03 you took the risk you got a high rate of  interest now you're you're paying the price   that's how markets work uh and uh that really  uh is the argument and i think uh you have to   uh obviously what i'm suggesting is a radical  step just as you're suggesting of taxing wealth   would require the radical step of closing down  offshore banking centers of simply negating uh if   00:38:28 banks would simply erase all of the deposits  they have from the offshore banking centers from   the cayman islands from from panama from uh from  liberia to all the places that began by to be set   up by the mining companies the oil companies  and then were set up beginning in the 1960s   essentially by the cia to finance  the vietnam war by making america   like england the home for criminal capital  for flight capital all this uh all this flight   00:38:57 capital and the kleptocracy that you mentioned  in russia all this really should be wiped out   and if you leave this capital if you leave this  one percent in place the economy is going to be   sacrificed and shrinking is it worth shrinking  the economy just to leave the one percent in place   and if you challenge them that's pretty radical  that's really what i think marx would say today

      !- Micheal Hudson : debt writedown - At a certain point, Governments of 3rd world countries who are so debt trapped may simply decide to write down the debts and start over - They may reach a point where instead of servicing the debt of the 1%, they decide its not worth it and save their own economies, freeing themselves from World Bank and IMF debt conditions - It's just as radical a move as your suggestion to stop tax evasion by closing down all offshore banks

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Dear Editor,

      Please find below our detailed responses (in black font) to the Reviewer's comments (in blue). In addition, and to the request of Reviewer #1, we added a PDF file called “Reply to the reviewers MS data” that shows MS/MS and quantification information of representative peptides which were selected based on their (different) caspase/control abundance ratios. We thank the reviewers for their time and valuable comments.

      NOTE: our original reply includes several tables and graphs that were not incorporated into our reply shown below

      Reviewer #1

      Page 4 - In contrast to the hindrance of N-terminal amine ionization by Nt-acetyl groups concluded by the authors, previous studies reported an improved MS-scoring if α-amino-acetylated (tryptic) peptides by the higher numbers of b and y fragment ions observed as compared to α-amino-free (tryptic) peptides (e.g. (Staes et al., 2008)). It is rather the lack of any N-/C-terminal charged residue in case of Lys-N type N-termini which makes LATE less suitable for studying N-terminal protein acetylation.

      We thank the reviewer for this comment. In the HYTANE and LATE workflows, only peptides with modified N-termini (by dimethylation or acetylation) are observed after negative selection, hence we argue that the important comparison here is between Nt-acetylated peptides and Nt-dimethylated peptides with (as in HYTANE) or without basic residue (as in LATE). While we are aware of the study by Staes et al 2008 (PMID: 18318009), we do not believe it contradicts our claim as it discusses the difference between Nt-acetylated peptides and peptides with free N-termini.

      As we indicated in the manuscript (page 5 in the last sentence of 1st paragraph), we observed less overall peptide identifications in LATE, which was expected due the lack of basic C-term residue. The reduction of identification was more pronounced for Nt-acetylated peptides. However, this still does not exclude LATE as a useful tool for identification of such peptides.

      Of note, the overall fragmentation coverage we obtained by LATE and HYTANE for Nt-acetylated and Nt-dimethylated peptides was similar. See the figure below.

      Hence, following Cho et al 2016 (PMID: 26889926), we suggest that the difference in ionization of Nt-dimethylated peptides vs Nt-acetylated peptides is the more dominant factor in peptide identifications.

      Figure 1:relative Ion coverage for modified peptides in LATE and HYTANE

      Page 4 - Besides indication the retained N-termini with high relative caspase-3/control abundance ratio's as putative caspase-3 proteolytic products, also indicate that unique peptides were retained, as many such singletons were reported in previous (caspase-focussed) degradomics studies making use of differential proteomics (e.g. (Van Damme et al., 2005)). Therefore the cut-off ratio of 2 rather seems unsubstantiated, unless the cellular proteomes of so-called control cells were affected by caspase activation. As such, showing some representative MS-spectra of neo-N-termini would be informative.

      We thank the reviewer for this comment. We agree that caspase-3 cleavage generates many singletons. This is indeed what we observed in the in vitro experiment as shown in Figure 2B by the long straight lines at Log2(caspase-3/control) >10. We also add here histograms of the obtained ratios that we hope will make this clearer. We chose a cut-off of 2 due to the basal activity of proteases (including caspase-3) as we did not add caspase-3 inhibitors to the cell lysate. In addition, peptides derived from the putative caspase-3 cleavages in the in vitro experiment were required to be identified only in the caspase-3-treated samples (i.e. to appear only with the heavy labeling). Minor changes to Figure 3 legend have been introduced accordingly. As can be seen in the table below, with a cut-off ratio of 2 (Log2=1) and selection of cleavage sites after D or E we identified >98% of the cleavage sites that were identified only in the caspase-3 treated samples (column text in blue). This rate did not change when the cut-off was set to 8 (Log2=3). Therefore, we have chosen to maintain our selection criteria and cut-off ratio as used before for both experiments.

      Figure 2: Histograms of Log2(Caspase/control) ratios indicating the large number of singleton peptides (marked with arrows)

      Table 1: In vitro experiment selection ratio

      Method

      Cutoff

      Time

      Sites

      Sites identified only caspase-3 treated samples

      % of caspase-treated only sites (singleton)

      Sites D/E with light

      Sites after D/E no light

      % of singleton

      LATE

      Log2=1

      18H

      936

      906

      96.8%

      798

      786

      98.5%

      LATE

      Log2=2

      18H

      884

      866

      98.0%

      767

      759

      99.0%

      LATE

      Log2=3

      18H

      819

      810

      98.9%

      722

      716

      99.2%

      HYTANE

      Log2=1

      18H

      1186

      1159

      97.7%

      1037

      1032

      99.5%

      HYTANE

      Log2=2

      18H

      1128

      1110

      98.4%

      998

      993

      99.5%

      HYTANE

      Log2=3

      18H

      1035

      1025

      99.0%

      924

      919

      99.5%

      LATE

      Log2=1

      6H

      755

      732

      97.0%

      656

      645

      98.3%

      LATE

      Log2=2

      6H

      711

      700

      98.5%

      630

      623

      98.9%

      LATE

      Log2=3

      6H

      671

      666

      99.3%

      601

      597

      99.3%

      HYTANE

      Log2=1

      6H

      1028

      988

      96.1%

      899

      890

      99.0%

      HYTANE

      Log2=2

      6H

      955

      931

      97.5%

      851

      844

      99.2%

      HYTANE

      Log2=3

      6H

      882

      871

      98.8%

      795

      791

      99.5%

      LATE

      Log2=1

      1H

      445

      423

      95.1%

      380

      372

      97.9%

      LATE

      Log2=2

      1H

      411

      402

      97.8%

      361

      355

      98.3%

      LATE

      Log2=3

      1H

      386

      380

      98.4%

      344

      338

      98.3%

      HYTANE

      Log2=1

      1H

      593

      559

      94.3%

      513

      506

      98.6%

      HYTANE

      Log2=2

      1H

      544

      532

      97.8%

      488

      482

      98.8%

      HYTANE

      Log2=3

      1H

      508

      498

      98.0%

      461

      455

      98.7%

      In the cell-based experiments of caspase-3, we induced apoptosis on both cell types (over-expressing caspase-3 and the control). Therefore, in this case, as the reviewer has also mentioned, a cut-off of 2 is appropriate because the control cells are also affected by caspase activation. Following the reviewer’s request we have added (in a separate PDF file) several representative MS/MS spectra of neo-N-term peptides and their corresponding quantification data.

      Page 4 - replace 'without labelling of lysine residues (epsilon-amines)' to 'without notable labelling of lysine residues (epsilon-amines)', as residual labelling of lysine side-chains was observed. Also in case of the latter, do note that reduced MS-ionization potential might impact labelling efficiency calculation, and chromatographic detection of labelling efficiency should be considered to conclusify this finding.

      We thank the reviewer for this comment. We have changed the sentence as requested (Page 4 marked in red). Regarding the labeling efficiency calculations, it is true that ionization potential might affect them. We used a common way to test this aspect (see e.g. Hurtado Silva et al 2019 (PMID: 30934878)) and we are not aware of any reduction in ionization potential following lysine dimethylation. Although we did not study this aspect thoroughly, we frequently observe the opposite: that dimethylation improves MS detections.

      Page 6 - The experimental setup comparing caspase-3 overexpressing and ABT-199 induced versus ABT-199 induced cells will be highly biased as it will not be able to detect efficient caspase-3 cleavages (Plasman et al., 2011), as such cleavage events are complete and thus do not require any additional overexpressed capase-3. I see this as an important flaw and the authors should demonstrate that the list also includes efficient caspase-3 cleavages.

      We thank the reviewer for highlighting this important aspect. We agree that with our setup, we can miss some efficient cleavages of caspases-3. We acknowledged this caveat in the original text (page 6), but chose to perform our experiments this way in order to highlight cleavages that are affected by caspase-3 expression. To address the reviewer’s comment we have added new experiment and data on caspase cleavages that occur following ABT-199 treatment in HCT116 cells without overexpression of caspase-3. The focus of this experiment was on the relatively short time points following the ABT-199 treatment when no cell death is observed based on XTT assay (see Supplement Figure 6B). This experiment was used to prove that neo-Nt-acetylation of NACA is an early event in apoptosis (Figure 5 E-F page 12). We also used this experiment as an indication of the appearance of efficient cleavages. As can be seen from Supplement Table S10, if we consider all 3 time points of the ABT-199 treatment, we quantified 106 cleavages with free neo-Nt that were cleavages after D and were identified only in the ABT-treated samples. We refer to such cleavages, which appeared prior to noticeable cell death, as "efficient cleavages". Out of these efficient cleavages, 82 were also identified and quantified in the cell-based experiment with overexpression of caspase-3. Twenty efficient cleavages show a high ratio (≥2) in both experiments. Fifty six efficient cleavages had a high ratio in the new experiment and ratio below 2 in the cell-based experiment with overexpression of caspase-3. This supports our original claim regarding efficient cleavages and addresses the reviewer’s concern regarding our ability to identify efficient caspase-3 cleavages with the experimental setup of HCT116 cells overexpressing caspase-3.

      Page 12 - The setup doesn't permit ORF N-terminal stability per se, rather the cleavage susceptibly of N-termini holding (a) putative caspase-3 cleavage site(s). Please adjust accordingly. Again since the setup might have missed efficient cleavages, the assessment might be biased.

      Thanks for the comment. As requested, the word “stability” has been deleted. As discussed above, we demonstrate that our setup does allow the identification of efficient cleavages and hence our basis for believing that the assessment is not biased. Please also refer to our reply to the next comment.

      The claim that Nt-acetylation is protective for caspase-3 cleavage should be validated by monitoring cleavage efficiency of an Nt-acetylated versus an Nt-free variant (e.g. by introducing a Pro residue at AA position 2, or comparing cleavage efficiencies in corresponding NAT knockdown versus control cells) of an identified caspase substrate (i.e. a substrate holding a caspase-3 cleavage site in its N-terminal sequence) versus its Nt-free counterpart

      Thanks for raising this point. The reviewer's suggestions have some caveats: a mutation at a protein’s N-terminus in order to generate an Nt-free variant can alter its stability or function and NAT knockdown might have a profound biological impact on the cells. Therefore we chose a different way to study this aspect by selecting from our data ORF N-terminal peptides that were identified with both free N-termini and acetylated N-termini (i.e. the same peptide was identified in some PSMs as acetylated and in other as dimethylated). We managed to find 136 ORF N-terminal peptides that were quantified in both forms, and out of these, 122 contained Asp or Glu residues (the putative caspase cleavage motifs). We added the comparison of the abundance ratios of these peptides in Figure 4C (see also below). It shows a remarkable difference between the groups when the Nt-acetylated peptides ratios did not change as a result of caspase-3 overexpression while the peptides with free Nt were identified mostly in the control cells (negative Log2(caspase-3/control)). Comparison of the 14 ORF Nt-peptides that do not have Glu or Asp in their sequence shows no difference (see below).

      Figure 3: Abundance ratio distributions of the ORF Nt peptides identified with both Nt-acetylated and free Nt in HCT116 cells overexpressing caspase-3 and in the control. A. Comparison of peptides that contain putative caspase cleavage in their sequence (D or E) B. comparison of peptides without putative caspase cleavage

      These results provide additional support for the notion of the protective or shielding effect of Nt-acetylation against caspase-3 cleavage.

      Page 12 - Since post-translational Nt-acetylation of neo-N-termini could be reproduced in vitro in the non-dialyzed sample, enzymatic over chemical Nt-acetylation should be demonstrated (e.g. by the use of a (bisubstrate) NAT inhibitor).

      We think this is an interesting idea for future work. Yet, in our opinion, the fact that only very few neo-Nt-acetylated peptides were affected in vitro and that a similar trend of few selected neo-Nt-acetylation targets was shown in the cell-based experiments indicates that this process is enzymatic and not chemical in nature.

      Other concerns:

      Abstract - The abstracts holds complex/incorrect sentence constructions (e.g. simply indicate 'Protein N-termini', '... undergo ... processing by proteases' (currently: 'not be processed by proteases').

      Thanks for pointing this out. We have changed the abstract accordingly.

      Abstract - 'To expand the coverage of the N-terminome' only applies when this is used in conjunction with other negative enrichment strategies as by itself, LATE doesn't intrinsically provide a better coverage of the N-terminome (this is also noted at page 2).

      We thank the reviewer for pointing this out. We have changed the abstract accordingly.

      Change 'that cannot be identified by other methods' to 'that cannot be identified by other negative selection methods'

      Thanks for pointing this out. We believe that our description here is appropriate as we explicitly state “some of which cannot be identified by other methods”.

      Page 1 - Suggestion to change 'Proteases are typically described as degradative enzymes' to 'Proteases used to be described as degradative enzymes'

      Changed as suggested.

      Page 1 - Not really correct how written; 'N-terminomics methods highlight the N-terminal fragment of every protein (N-terminome)'

      Changed as suggested.

      Page 2 - Positive selection techniques .... Enrichment of unblocked (or Nt-free) N-termini

      We are not sure what the reviewer had in mind here but have added the text in the brackets

      Page 2 - Besides altering charge, Nt-acetylation also alters hydrophobicity ...

      Changed as suggested.

      Page 2 - remove 'to better chart'

      Changed as suggested.

      Page 2 etc. - Do note that caspase-3 can potentially activate downstream caspases in vitro

      Following this comment, we have added a sentence on Page 5 with this reservation

      Page 3 - functional crosstalk between proteolysis and neo-Nt-acetylation has already been demonstrated in the case of co-translational acting methionine aminopeptidases and chloroplast N-terminal acetyltransferases. Adjust accordingly.

      We thank the reviewer for highlighting this aspect, although we used the term “neo-Nt-acetylation” which we used to mark that this is not the common (co-translational) acetylation. To assure that this is more clear we have added the words “post-translational” to better define the novelty of our findings.

      Page 3 - when discussing the identification of ORF N-termini, note that some of the strategies of which note when used to enrich for in vivo blocked N-termini, can also be used without blocking/labelling of Lys residues, and thus trypsin will also result in Lys-ending peptides. This is important to consider in this context.

      Following the reviewer's remark we have changed the sentence so it now states: “Many of these N-terminomics methods……”

      Page 3 - remove the following sentence part; '... or run individually which can be useful for quantifying naturally modified N-termini.', since also a differential/labelled proteomics setup enables such assessment. Related to this, the authors should comment on the observation that much fewer (i.e. less than 40%) Nt-acetylated N-termini were identified by LATE as compared to HYTANE. How is this reflected in the number of PSMs? Probably these difference are further intensified when considering PSMs.

      We have changed the sentence as suggested.

      Regarding the reduction of Nt-acetylation, we thank the reviewer for this question as it led us to find typos in the numbers in Figure 1E which are now corrected. These typos did not change the overall observation that with LATE we identify fewer Nt-acetylated peptides than Nt-free (dimethylated) peptides. As the reviewer anticipated (see below), the reduction in LATE-based “contribution” to the identification of Nt-acetylated peptides as opposed to the identification of dimethylated peptides, is pronounced when considering PSMs but this is not much different than the peptide-based data. Therefore, we prefer to keep the current presentation of Figure 1E.

      Figure 4: Comparison of HACAT cells N-terminal peptides identification with LATE and HYTANE when considering peptide sequences and PSMs. Peptides identified with both methods are in green and those that are unique to one method are in blue. Shared peptides were determined based on the sequence of the first 7 amino acids of the identified peptides. A. comparison for peptides with dimethylated N-terminal (free Nt) B. comparison for Nt-acetylated peptides.

      Page 6 - Informative to indicate how many of the in silico predicted putative DEVD P4-P1 cleavages were actually present in the list of 2049 putative cleavages identified.

      In our dataset, we identified 17 cleavages after DEVD motif. 11 were identified only with HYTANE, 3 were identified by both methods, and 3 more were identified only with LATE. Of note, it seems that in large-scale proteomic studies of apoptosis, the number of caspase cleavages after DEVD motif is quite low. For example, in the CASBAH database (PMID: 17273173__) __there are 10 reports of such cleavage out of 391 reported sites, and in DegraBase (PMID: 23264352) that combined many different apoptotic experiments there are 64 reported DEVD sites out of a total of 6896 P1-Asp sites.

      Page 6 - Unclear if any of the of 2049 putative cleavages, included non-canonical P1 cleavages besides the P1 Asp and Glu cleavages identified.

      These are 2049 putative cleavage sites with P1 Asp or Glu. We have changed the text to make it clearer.

      Page 6 - Were the 'regular' cells mock transfected?

      No. The control cells used for the cell-based experiments were the non-transfected cells from the same culture of HCT166. We chose this option to guarantee that exactly the same cells that were grown in the same dish went through the same FACS sorting as a control.

      Page 6 -Important to note that an ORF can have multiple N-termini besides neo-N-termini (e.g. in the case of alternative translation initiation)

      Thanks for the great point. We have added an indication if the neo-N-termini site has been reported as an alternative translation initiation site to all of the results of the cell-based experiments (Supplementary Tables S4, S5, S6, S9). We also changed the Figures and text accordingly. Our analysis of reported/unreported neo-N-temini is based on the TopFind database which includes information about alternative translation initiation sites from TISdb. Of note, since our focus is on caspase cleavages and we further select putative cleavages based on D/E in P1 and fold change, out of 973 peptides that we reported as putative caspase cleavage (Table S6) only one is in the vicinity of an alternative initiation site.

      Page 6 - The authors should be more careful with generalization when comparing LATE and HYTANE (and other degradomics approaches) as in this study LATE was only applied for the identification of caspase-3 neo-N-termini, which by its extended substrate specificity might hold specific features enabling the preferred detection by one technique over the other. Also note that as compared to less recent studies, evidently the MS instrument used is a key factor in the increase in cleavages reported in the current study.

      It is conceivable that caspase cleavage may differ from other proteases and thus theoretically work better with LATE, but we fail to see why this would also be the case for other N-terminomics method (like TAILS, Subtiligase, CoFRADIC, ChaFRADIC etc). We showed that LATE provides additional ORF Nt peptides identifications and demonstrated its effectiveness in E. coli (Supplement Figure S2) also, which has a proteome with a different amino acid composition to the human proteome. Furthermore, using LATE in the cell-based experiment led to the identification of many neo-Nt-peptides that do not match caspase cleavage patterns (as indicated for both HYATNE and LATE in Figures 3E and 3F). We reviewed the text again, and believe that we have used a fair description of the results especially when we compared them to previous studies.

      Page 9 - The authors should provide some info/supporting statistics in the text regarding the new putative substrates showing GO-enrichments (compared to which control?) similar to previously reported caspase-3 substrates.

      The results of the GO enrichment analysis are presented in Fig. S8 and details about how the test was performed are provided in the Materials & Methods. In the revised version, we are including the numerical data that include results of the statistical tests per GO term as Table S12. The enrichment analysis was performed with respect to the whole human proteome.

      Page 11 - Indicate that the 11 neo-N-terminal peptides of which note are the neo-Nt-peptides matching (signal peptide) cleavages indicated in the Uniprot database. Were any corresponding di-methylated neo-N-termini of these cleavages identified? In case of the 'other' proteolytic cleavages of which note, refer to these as not-annotated in UniProt.

      We thank the reviewer for pointing this out. We have added an indication that this analysis is based on UniProt annotations. Yes, all of the reported 11 neo-Nt-Acet peptides shown in Figure 4 were also found as neo-Nt-DiMet peptides.

      Page 11 - post-translational Nt-acetylation is abundant in plant and the responsible NAT has been identified, please reference these studies as well.

      We thank the reviewer for pointing this out regarding page 11. A relevant reference has been added in Page 11. In the discussion, we already referenced Nt-acetylation in plants in the discussion as well (see page 14).

      Page 12 - Define 'undoubtedly dependent on caspase-3 cleavage'

      We thank the reviewer for pointing this out. The word ‘undoubtedly’ has been deleted.

      Page 14 - The NAA30 discussion is not really relevant for the discussion of the post-translational Nt-acetylation of mitochondrial neo-N-termini.

      We thank the reviewer for pointing this out. This sentence has been deleted.

      Viewing the harsh in vitro caspase-3 cleavage condition used, namely 1 µg caspase 3 over 20 µg protein, the P1 specificities of all identified neo-N-termini should clearly be shown.

      The P1 specificities of all neo-N-termini found in the in vitro experiment are listed in the supplementary tables S2 and S3. For the reviewer’s convenience, we are providing the table with the P1 specificities below:

      Since acetylation of serine and threonine residues are reported forms of post-translational modification, and many so-called past-translational Nt-acetylated neo-N-termini harbour such AA residues in their N-terminal sequence, b-ion coverage for these neo-N-termini should be provided/inspected.

      We are not sure that we understand this comment. O-Acetylation of amino acids refers to their side chain. Since we are using Di-methylation labeling in both HYTANE and LATE, if we have a peptide with O-acetylated Ser or Thr at its first position, it is possible to distinguish it from the same peptide with Nt-acetylation by MS1 as illustrated in the following table for a hypothetical peptide SAAANPELKR (mass is MH+1)

      Regardless we include in the manuscript MS/MS spectra of NACA Neo-Nt-acetylated peptide by HYTANE and LATE

      Reviewer #2

      Major suggestions:

      • The LATE method relies on digestion with LysN. Can the authors comment on the digestion efficiency of the samples where the LATE workflow was applied?

      The LysN digestion details that we used were based on vendor (Promega) instructions combined with details from the Nature Protocol paper by Giansanti et al 2016 (PMID: 27123950__)__ that describes optimized digestion protocol for LysN. We tested LysN efficiency in terms of the identification of missed cleavage and found that it performed very well with a missed-cleavage rate of

      • The authors state that the number of peptides with acetylated N-termini was lower compared with HYTANE. Yet, the Nt-acetylation can occur co-translationally in approximately 85% of human proteins.

      Did the authors consider optimizing the method (e.g. by fractionating the sample) for better identification of such peptides?

      We thank the reviewer for this important comment. We are certain that it is possible to improve the output of LATE by fractionation and/or optimization by changes to the LC gradient as it is well established for most, if not all, bottom-up proteomics methods. In this work, we concentrated more on the proof of concept of the methodology and hence chose to work without fractionation. We performed one attempt to optimize the LC gradient but found that the results were not significantly different, and we thus used the same LC-MS methods that have been optimized for trypsin.

      Regarding the reduced identification of Nt-acetylated peptides, as we state in the manuscript following Cho et al 2016 (PMID: 26889926), we believe that this is mainly due to the reduced ionization efficiency of Nt-acetylated peptides compared to Nt-dimethylated peptides which is more pronounced when a C-terminal positive charge is missing (due to LysN digestion).

      Also, were the results of the study compared with searches done using other proteomic pipelines (e.g. FragPipe)?

      Unfortunately, when we started this project, MS-Fragger did not support LysN as the digesting enzyme. At the time TPP also provided better visualization and quantification support than FragPipe. Recently, we found that MSFragger is faster while providing similar identifications but we are not convinced of the quantification output via FragPipe. In addition, we performed comparisons of Comet to X!Tandem and while the searches took longer than with Comet, the total number of IDs did not improve significantly.

      Can the authors provide details on the settings used for searches done in COMET, especially for the samples treated with LysN?

      The settings are provided in Table S10 in the supplementary information (Page 14 of the PDF file).

      "Fractions containing relatively pure caspase-3 were pooled together and dialyzed against 20 mM HEPES 7.5, and 80 mM NaCl. Aliquots of the protein were stored at -80{degree sign}C"

      o What exactly is meant by 'relatively pure'?

      We apologize for the inaccurate description. The relevant text has been updated (Page 17) and now explains that this was based on Coommasse stain SDS-PAGE.

      Minor suggestions:

      • Please check the link for the Github as this reviewer could not open it.

      We thank the reviewer for pointing this out. We corrected the link. In any case, the relevant scripts can be found here: https://github.com/OKLAB2016

      • Please correct the spelling.

      The manuscript was proofread.

      Comments regarding figures:

      • Figure 2:

      o All figures comparing LATE and HYTANE utilize color green for LATE. Yet, in figure 2G, HYTANE is depicted in green-like color. Please consider staying consistent with the color scheme.

      We thank the reviewer for this comment. Done as suggested.

      Reviewer #2 (Significance (Required)):

      Significance:

      • The LATE method provides an excellent way to study proteases in vitro or in cell-based experiments. It enables deep investigation of N-terminome based on a simple and cost-effective workflow that utilizes digestion with LysN followed by chemical derivatization of α-amines. This approach allows for the identification of N-terminal peptides that may escape detection by other N-terminomics methods. With LATE, proteases' cleavage sites that might not so far be reporter due to technical limitations, can be studied and characterized. Hence, LATE is a useful addition to the N-terminomic toolbox.

      We thank the reviewer for the positive comments and general assessment of LATE.

      Reviewer #3

      In this manuscript, Hanna et al. report LATE, an N terminomics method similar to N-TAILS and HYTANE, with modifications that enhance or change coverages of the N-terminal proteome in proteomics datasets. LATE relies on selective N-terminal modification of protease-treated, LysN digested samples, enabling internal peptides to be depleted based on the presence of the unblocked lysine epsilon amine. Using LATE in comparison with HYTANE, the authors identified a large number of both known and unknown caspase-3 cleavage sites, both in vitro and in vivo. Because LATE enables identification of both proteolytic neo-N termini and natively blocked N termini such as those that are acetylated, the authors were able to discover a number of post-translationally acetylated proteolytic neo-N termini. This finding points to potential functional cross talk between apoptotic proteolysis and Nt-acetylation. Overall, this is a very nice manuscript that adds a valuable new tool to the N-terminal proteomics toolbox. However, the manuscript could be improved by addressing the following questions and comments.

      We thank the reviewer for this assessment.

      1. One of the benchmark points used to describe the need for a new technology such as LATE is the idea that there are 134 putative caspase-3 substrates in the human proteome, of which only about half can be identified based on ArgC cleavages. However, the 134 substrates seem to include only those that have the exact canonical DEVD motif. Many more substrates than this are already known for caspase-3. For example, >900 caspase-3 substrates were identified by Araya et al. alone. It might make more sense to apply a position-specific scoring matric to the human proteome to predict a maximum number of possible caspase-2 cleavage sites and how many would be expected to be identified using other technologies. Otherwise, please provide a rationale for why these 134 putative caspase-3 sites are representative.

      The reviewer is correct. Indeed, most of the identified caspase-3 cleavage are not exact matches to the DEVD motif. We used the DEVD as an example to illustrate the added value of using lysine-based digestion together with ArgC. We obtained a similar trend with some variations when we tested the feasibility of the identification of the human ORF Nt-peptides, E. coli ORF Nt-peptides and more. We are quite confident that any prediction will show a relatively similar distribution. To demonstrate this, we show here the relative contribution of each method for the identification of any peptide that begins after Asp in the human proteome.

      While the distributions are not identical, they are very similar, and the specific additions from LATE (LysN) are between 20% to 22% out of the total and it can help to expand the coverage by 42% to 45%.

      It is definitely plausible and have been previously demonstrated that selective N-terminal demethylation can be achieved under the right reaction conditions, and I do not doubt that it has been achieved here. However, I do not understand how the authors were able to conclude that alpha-amines are blocked with 95% efficiency and lysines are blocked at

      This is a very good point. The reviewer is correct and indeed we don’t have a way to establish if the dimethylation is on the side chain amine of lysine or on its N-terminal amine. A partial support for our claim is from labeling experiments that we (and others) conducted with tryptic and LysC peptides that clearly demonstrate that under the specified labeling conditions, 95% of the N-terminal amines are labeled and not the lysine side chain amines. However, at the end of the day, this does not change the outcome of LATE.

      Related to the above comment, Table S10 seems to indicate that MS/MS data from LATE were searched with dimethylation as a fixed modification at the N terminus. Were LATE samples searched with different parameters to generate Figure 1C? Are the dimethylated Ks identified mostly from missed cleavages and therefore not at the N terminus?

      We thank the reviewer for pointing this out. The search parameters used for the generation of Figure 1C have been added to Table S10. The reviewer is correct, the few dimethylated Ks identified in the search used for Figure 1C are mostly from missed cleavages.

      For both the in vitro and in vivo experiments, how many of the new caspase-3 cleavage sites occurred in proteins that were not previously known to be caspase substrates?

      In the in vitro experiments, we identified cleavages of 372 proteins that were not reported as caspase-3 substrates based on the databases we used as references. A line specifying this number was also added to text on page 7. In the cell-based experiment, we identified putative caspase-3 cleavages of 67 proteins that were not reported so far as caspase-3 substrates. This information has been added to the main text on page 10. We have added columns indicating the known/unreported protein substrates to Tables S2, S3, S4, S5, and S6.

      For the experiment in cells, can the authors explain the rationale for comparing cells in which apoptosis is induced with ABT-199 to ABT-199-treated cells with caspase-3 overexpression? What is the advantage over comparing ABT-199 treated cells to untreated cells

      Great question. An N-terminomics study of “common” apoptosis would lead mainly to the identification of effector caspases (caspase-3 and -7) substrates. Our aim was to focus mostly on the caspase-3 cleavages that occur in the cell during apoptosis. In choosing this gain-of-function approach we were motivated by the idea that it couldprovide new insights that would otherwise go undetected when using knockout or other loss-of-function approaches. The advantage of this system over comparing ABT-199 treated to non-treated cells (which we have now added as well) is that it can enhance the identification of caspase-3 specific cleavages.

      Can the authors discuss the timescale of cell death in ABT-199 treated cells vs. ABT-199 treated caspase-3-overexpressing cells. Ideally, data showing cell viability over time (e.g. Cell Titer Glo or MTT assays) would be presented, but if the authors could at least describe whether apoptosis is accelerated in the caspase-3 overexpressing cells, it would be helpful.

      Great suggestion. Following the reviewer’s suggestion we have characterized the effect of caspase-3 overexpression of the cells by XTT assay, and indeed caspase-3 overexpressing cells do show accelerated cell-death in response to ABT199 compared to non-transfected cells. These results are now presented as Supplement Figure S6B and are mentioned in the results section.

      The authors say that in their experimental design, they expect to see no difference between ABT-199 only and ABT-199/caspase-3 overexpression for substrates that are cleaved efficiently by endogenous caspases. If the new caspase-3 substrates are not cleaved efficiently by endogenous caspase-3, this seems to call into question their physiological relevance. Can the authors explain more thoroughly how these new substrates fit into the apoptotic program?

      We thank the reviewer for raising this issue. We are aware that our original cell-based experimental design may have some limitations, yet we chose this gain-of-function setup in order to identify caspase-3 substrates in a cell-based system. We believe that this setup does allow identification of substrates that are efficiently cleaved by endogenous caspase-3, such as cleavage and acetylation of NACA at Ser34 (and neo-Nt-acetylation after caspase-3 cleavage in general). To study the physiological relevance of the neo-Nt-acetylation, we have added to the revised manuscript a time-course N-terminomics characterization of early apoptosis events conducted in HCT116 cells (without caspase-3 overexpression). The results of these experiments are now shown in Figure 5C and also in the Supplementary Table

      The authors convincingly show that cleaved NACA is a neo-substrate for Nt-acetylation, suggesting functional crosstalk between proteolysis and acetylation. However, it is not clear if this acetylation event has a functional consequence, so it seems inaccurate to say at the top of page 3 that "This is the first demonstration of functional crosstalk between neo-Nt-acetylation and proteolytic pathways."

      The author is correct. We changed the text accordingly.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Manuscript number: RC-2022-01588

      Corresponding author(s): Erh-Min, LAI

      [Please use this template only if the submitted manuscript should be considered by the affiliate journal as a full revision in response to the points raised by the reviewers.

      If you wish to submit a preliminary revision with a revision plan, please use our "Revision Plan" template. It is important to use the appropriate template to clearly inform the editors of your intentions.]

      1. General Statements [optional]

      This section is optional. Insert here any general statements you wish to make about the goal of the study or about the reviews.

      The authors thank the reviewers for the positive and valuable comments, which have helped us to improve the quality of this work. We have addressed all comments by providing additional data and/or explanation with a detailed point-by-point response. The revised manuscript included new data: 1) viable cell counts of growth inhibition assay (Fig. 2A), 2) Quantitative data of microscope data (Fig. 2C, Fig. 4), 3) quantitative data of interabacterial competition (Fig. 5A, 5B), western blotting data of growth inhibition (Fig. S1A and S1B), secretion assay of single glycine-zipper mutants (Fig. 5C), and inclusion of full gel of western blot results (Fig. S3 and S5). By integrating these new results, we have substantially strengthened the findings that a glycine zipper motif of a type VI secretion effector T6SS Tde1contributes to its translocation across the cytoplasmic membrane of target cells.

      2. Point-by-point description of the revisions

      This section is mandatory. *Please insert a point-by-point reply describing the revisions that were already carried out and included in the transferred manuscript. *

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      Summary: In this manuscript, Ali et al. propose that a glycine zipper motif located at the N-terminus of the Agrobacterium tumefaciens T6SS DNase effector, Tde1, can transport the toxin across the cytoplasmic membrane and into the cytoplasm, where its target is found. To support these claims, they perform a series of secretion, competition, toxicity, and fluorescence microscopy assays showing that a mutation in two glycine residues affects toxicity of the effector during competition and its ability to enter a target cell, but not its secretion through the T6SS or its binding to the adaptor protein Tap1. The concept brought forth in this study is quite interesting and important - the notion that T6SS effectors have domains that aid in their transport into the cytoplasm of the target cell. This is similar to a recent finding that a domain common to bacterial pyocins and T6SS effectors can mediate DNase toxin transport through the target cell's cytoplasmic membrane (Atanaskovic et al., mBio, 2022); the authors should mention and discuss this recent work. Nevertheless, it is my impression that the results do not fully support the conclusions and proposed mechanism, even though the general idea seems correct.

      Ans: We thank this reviewer found this work interesting and important. We hope the revised manuscript including the new data and careful interpretation have substantialized the conclusions and proposed mechanisms. We also included the excellent work by Atanaskovic et al., 2022 and discussed the findings in the revision (see lines 344-349).

      Major comments:

      • An experiment that directly demonstrates the ability of the glycine zipper to mediate transport of a toxin across a membrane would greatly support and solidify the conclusions of this work. For example, showing the ability of a purified protein to enter spheroplasts or liposomes in a glycine zipper dependent fashion. Currently, the authors perform experiments that can only indirectly support the proposed function of the glycine zipper to enable the effector to cross the membrane, and as detailed below, some of these experiments are over-interpreted in my opinion. Ans: We agree that the direct evidence for the ability of the glycine zipper to mediate Tde1 transport across target cell membrane is to perform the in vitro translocation assay. Unfortunately, the attempts to purify sufficeint amounts of full-length or N-termial version of Tde1 have not been successful. Therefore, we are unable to perform this experiment. Accoringly, we have tried our best to carefully interpret the data and rephrase the statements accordingly.

      • Lines 153-159: It is not clear how much these results are relevant to the activity of the glycine zipper motif during effector delivery by the T6SS. If I understand correctly, the described experiments are of over-expression of the proteins in the E. coli cytoplasm, where glycine zipper-dependent membrane permeability and toxicity are detected. However, one would expect that if the effector is to be transported from the periplasm to the cytoplasm during T6SS delivery, then the glycine zipper should function from the periplasmic face of the cytoplasmic membrane, and not from its cytoplasmic face, as is the case in these experiments. Is it possible that the observed toxicity and membrane permeability be the result of over-expression in the "wrong place"? Ans: The reviewer is right that Tde1 should permealize cytoplasmic membrane from periplasmic side upon injection from the attacker based on our proposed model. The purpose of ectopic expression of Tde1 and its variant in E. coli is to dissect the region and motif of Tde1 DNase-independent toxicity and the ability in enhancing membrane permeability regardless of which sides of cytoplasmic membrane the Tde1 mediates toxicity and permeability. The results of glycine zipper-dependent toxicity and membrane permeability provide a ground work for the experiments of secretion and interabacterial compeittion in the context of active T6SS action to determine the role of glycine zipper in Tde1 export and translocation.

      • Fig. 4B: This figure appears to be very important, and the authors base a large part of their main conclusion regarding the role of the glycine zipper in membrane crossing on it. However, some controls are missing and part of the results observed in the figure do not match their description in the text. • Lines 233-237 - While the authors state in the text that GFP and mCherry signals did not overlap in E. coli cells co-cultured with Agrobacterium cells expressing Tde1(M)-GLGL, I see many double-colored cells in this sample (bottom panels in Fig. 4B). Actually, all cells appear to have both green and blue colors, except for a few cells that are only green but that also seem to be dead judging by their ghostly appearance in the phase contrast channel.

      Ans: We thank the reviewer pointed this out. By looking at this particular image more carefully, it is striking that the majority of cells seem to emit both green and blue colors from this Tde1(M)GLGL sample. We have performed a total of three indepenent experiments for this translocation assay and all results except this particular sample in this particular experiment are consistent in all three independent experiments. Honestly, we could not explain this result and a possibility is this sample might be accidentally mixed with another sample. Because this is the only sample with inconsistent result with another two independent experiments, we decided NOT to use the results from this independent experiment and instead performed another independent experiment. We now have included the quantitative data from three effective independent experiments and show the representative images in Figure 4.

      How is it that all cells in the bottom panels are blue (indicating that they are E. coli target cells)? Shouldn't a large portion of the cells be Agrobacterium cells that should not be blue, since these are added at the beginning of the competition assay at a 10:1 ratio in their favor? Ans: As explained above, we have no defined answer and decided to perform additional repeats, which are consistent with results of another two independent experiments.

      It is quite remarkable that so much GFP signal is transported into the E. coli target cells so that it is so clearly visible under the microscope. How do the authors know that the GFP signal overlapping with the mCherry is really inside the cell and not outside (for example, proteins secreted to the media that attach to the cell envelope)? Will the GFP signal remain if trypsin is added to the media before visualization under the microscope? Ans: Indeed, our quantitative data show there are ~50% cells have GFP overlapping with mCherry in the translocation positive samples. The signals should be inside the cells because no overlay signals were observed from N-Tde1GLGL or Tde1(M)GLGLeven though they are secreted.

      Can the authors quantify the ratio of E. coli cells that have overlapping green and blue colors over several experiments for each sample, to show that this phenomenon repeats and is statistically significant? Ans: Yes, see quantitative data in Figure 4.

      Can the authors explain why at least some of these E. coli cells should not be dead due to the toxicity mediated by the third effector of the Agrobacterium T6SS, Tae? Ans: In Agrobacterium tumefaciens C58, Tde1/2 are the major effectors contributing to antibacterial activity. Tae effector has little impact on interbacterial competition outcome (see previous publications Ma et al., 2014 doi: 10.1016/j.chom.2014.06.002.; Yu et al., 2020 doi.org/10.1128/JB.00490-20)

      Why were the microscopy competitions performed differently than the regular competition assays? Why wasn't AK media used in these competitions? How active is the T6SS under these conditions compared to the AK media? Ans: We have tried to use AK medium for the translocation assay but only very weak fluorescent signals can be observed likely due to the low expression when grown on this nutrient poor medium. In order to correlate the results of the compeittion assay with translcoation experiment, we have performed E. coli killing assay using LB medium that is used for translocation experiment now. For the interbacterial competition against agrobacterial siblings, we still used AK medium for competition because no detectable interbacterial compettion activity could be observed between two A. tumefaciens strains on LB agar. As reported earlier, stronger interbacterial competition outcome was detected from co-culture on AK than other nutrient rich medium while the secretion activity grown in AK medium is lower (Yu et al., 2020 doi.org/10.1128/JB.00490-20). These results indicate that the factors other than secretion activity also impacted recipient cell susceptibity, which however is not the main focous of this work.

      In the N-Tde1 sample, many Agrobacterium cells appear to have the GFP signal in foci rather than distributed throughout the cell (as it is in other samples), while the E. coli cells have a uniform and strong GFP signal. Can the authors comment on that? Ans: Thanks the reviewer for raising this question.We are also curious about the Tde1 glycine zipper-dependent GFP foci and now include this potential explanation in the Discussion of revised manuscript (line 387-406). To this end, we do not have an answer for it. Because glycine zipper repeats are known to interact with membrane, it is possible that Tde1 proteins may preferntially bind to microdomain of cytoplasmic membrane, which was recently found in A. tumefaciens (Czolkoss et al., 2021). We also found that Tde1 proteins (either tagged with HA or GFP) are proned for truncation when they are ectopically expressed in E. coli or when Tdi1 is absent or not equivalent. Thus, it is possible that Tde1-GFP proteins are truncated after translocation into E. coli cells, in which most GFP signals are emitted from free GFP instead of Tde1-GFP. The stability of free GFP derived from translocated Tde1-GFP may also explain the high percentage of E. coli cells exhibiting overlayed GFP/mCherry signals.

      It might be easier for readers to visualize this figure and see the signal distribution in the different cells if the authors show a zoomed in version in the main text, and provide the wide field images as a supplementary figure. Ans: We have tried to include zoom-in images but the resolution is not good. We have improved the quality of images in the Figure 4 and believe the images are clear to see individual and overlayed fluorescence signals.

      • Fig. 5C-D: The reduced expression and secretion of the GLGL mutant is considerable. How can the authors rule out that this reduction was the cause for the reduced observed toxicity of the mutant in 5A-B? Moreover, the results show that the GLGL double mutant is hampered in expression, secretion, and DNase activity, and it negatively affects overall T6SS activity. Since this mutant was used throughout the paper, and in the absence of a direct assay showing membrane transport mediated by the glycine zipper motif, the claim of the role of this motif in membrane crossing is not well substantiated by the results. If the authors were to show that the single glycine mutants used in Fig. 5D, which are stable and have an intact DNase activity, behave as claimed in the final conclusion sentence (lines 279-283), then the conclusions will be better substantiated by the results. Ans: Thank you very much for suggesting this important experiment. We have now constructed the single G39L and G43L variants expressed together with Tdi1 in A. tumefaciens tdei mutant for both secretion and interbacterial competition assays (see description in lines 259-280 and Fig. 5). As shown in Figure 5, both G39L and G43L variants are expressed and secreted at similar or even higher levels than wild type Tde1 but have no detectable antibacterial activity against either E. coli or A. tumefaciens 1D1609. This result substantializes the role of this glycine zipper motif in translocation.

      Minor comments:

      • Line 93: I am not sure that Ntox15 should still be referred to as a "novel" domain.

      Ans: despite the evidence of this domain as DNase, the name of Ntox15 is used. We think to keep this nomenclatture as it will be easier to be ditinquished from other nuclease or toxin domain.

      • Line 105: The section's heading does not actually describe its content. The results here only show toxicity upon over-expression of the effector or its mutant forms in E. coli. Therefore, this cannot be referred to as a "prey cell" since the effector was not transported into it during competition. Moreover, the results in Fig. 5A do not support DNase-independent toxicity during competition. Ans: The heading is changed to “Tde1 exhibits DNase-independent growth inhibition in E. coli” (line 115).

      • Please consider making all of the symbols in the growth assays semi-transparent. It is impossible to discern between the different, overlapping curves. Ans: The growth curve results are improved by changing line colors and reducing size bars (Fig. 1B, 1C; Fig. 2A, 2D)

      • Please consider making the size bars in all microscopy images more pronounced. They are barely visible in their current form. Also, it would be better to show images of the same magnification/zoom for the different samples, since the current presentation shows cells from different samples at different sizes, and it can be confusing to the readers. Ans: Amended (Fig. 2C; Fig. 4).

      • In Fig. 1B and in Fig. 2A the authors show that expression of Tde1(M) in cells is toxic, yet in Fig. 2D they see no toxicity. Can the authors please comment on this discrepancy? Ans: Fig. 2D showed the viability of E. coli cells after Tde1 variants were induced for 1 hr before ONPG uptake assay to indicate the increased membrane permeability is not due to cell death. In Fig. 1B, the growth inhibition of Tde1(M) is also not evident at 1 hr. So, the results are consistent.

      • I am not convinced that the assay in Fig. 2E can be used to determine bacteriostatic/bacteriolytic effect. It is not clear how such a distinction can be made from OD measurements, since an increase in OD can result from the entire population growing after removal of the stressor, or just part of the population that did not lyse/die. To make such a claim, the authors can spot bacteria on repressing media at different timepoints after protein induction, and then determine CFU.

      Ans: The increased OD600 value during recovery could be caused by either resumed cell division or cell elongation. Based on the newly added growth inhibition assay of all Tde1 variants which we showed nice correlation between CFU counting and OD600value (Fig. 2A, S2) and no increased cell size/length of E. coli cells expressing N-Tde1 or Tde1(M), we think the recovered OD600 value is supportive of N-Tde1 or Tde1(M) exhibiting bacteriostatic toxicity. In addition to that, our interbacterial competition data showed that Tde1(M)-Tdi1 which is still having intact glycine zipper doesn’t show significant detectable killing, supporting the bacteriostatic function of Tde1 glycine zippers. In fact, we performed this experiment based on Mariano et al.(Nat. Commun. 2019 doi: 10.1038/s41467-019-13439-0), which showed the recovery of OD600 value after removal of inducer as the evidence that the Ssp6 toxin is not bacteriolytic.

      • Fig. 3A: A control is missing. To verify that the N-terminal part of Tde1 is not promiscuously interacting with proteins, the authors should include a control sample testing its inability to precipitate a protein other than Tap1 in the same experiment. Ans: Our previous study has showed that Tde1 can co-immunprecipiate Tap1 but not a non-T6SS protein RpoA (Bondage et al., 2016 doi:10.1073/pnas.1600428113), indicating that Tde1 is not promiscuously interacting with proteins. Considering the tight biochemical interaction between Tap1 with N-Tde1 but not C-Tde1 that correlate with their ability for secretion upon loading onto VgrG1, N-Tde1 is unlikely to bind proteins non-specifically. This is also supported by the non-specific protein bands from cellular fractions recognized by anti-Tap1 are not co-immunoprecipitated by any of Tde1 variants (Fig. S3). We could repeat the experiments to include additional proteins as negative controls but we chose to use time for other more critical experiments during the limited revision time.

      • Fig. 3B: the blots are very "dirty". It is not clear how the authors were able to determine expression and precipitation of some truncations (for example, C2-Tde1 in the E. coli IP panel looks like a background band found in other lanes too).

      Ans: We agree that western blots of co-IP experiments in E. coli are not very clear due to the weak signals of some Tde1 variants and background. As pointed out by the reviewer 3, this result is not conclusive and rovide little additional information other than the co-IP results from A. tumefaciens. Because the interaction between Tde1 variants and Tap1 when expressed in E. coli are not physiologically relevant and not the main focus of this work, we have removed the E. coli co-IP results from this manuscript as suggested by the reviewer 3.

      • Lines 222-225 (Fig. 4A): I can't see C-1-Tde1(M)-sfGFP in the cellular blot. All the bands in this lane look like background bands that are also present in all other lanes. Therefore, I am not sure how the conclusion regarding this truncation's ability to be secreted was reached. Ans: We agree that C1-Tde1(M)-sfGFP is barely detectable due to its weak signal overlapping with cross-reacted bands. Since several attemps to improve the western blot quality by changing antibody and pre-blocking with protein lysages of vector control strain did not produce convincing results for detection of C1-Tde1(M)-sfGFP, we have rephrased the description of this result as “However, C1-Tde1(M)-sfGFP protein signal could not be unambiguously determined in the cellular fraction due to the overlapping of its predicted protein band with cross-reacted proteins, and no corresponding C1-Tde1(M)-sfGFP band was detected in the extracellular fraction.” (line 234-237).

      • Fig. 4A: the protein names above the lanes should include the sfGFP that is fused to them. Ans: Amended.

      • It would be preferable to show quantitative competition assays with statistics rather than pictures of a plate showing a single competition result, if conclusions or observations on minor differences in toxicity are made (for example, line 253: "The killing activity of Δtdei(Tde1GLGL-Tdi1) was largely compromised"). Since the authors performed each competition assay more than once, these data should be available to them. Ans: Amended. We have repeated the interbacterial competition experiments including single G39L and G43L variants for multiple biological repeats (see detailed in legends of Fig. 5A, 5B). The quantitative data with statistical analysis were added, which show no statistical difference of any glycine zipper mutants as comapred to Tde1(M) or when expressed in the T6SS mutant. Thus, there are no detectable antibacterial activity of glycine zipper mutants against either E. coli or A. tumefaciens siblings.

      • Fig. 5A: The author claim at the beginning of the manuscript (first results section heading: "Tde1 can cause DNase-independent growth inhibition of prey cells") that the N-terminal region of Tde1 is toxic on its own in the prey cell, yet in this competition assay Tdi1(M) shows no toxicity against the E. coli target cells. In the microscopy assay (Fig. 4B), it appears that a lot of Tdi1(M) enters the prey cell, since we can visualize it under the microscope. Can the authors clarify this discrepancy and explain why they do not expect to see target killing by this mutant even though they claimed it is toxic earlier? Ans: As describbed in earlier response, N-Tde1 amd Tde1(M) toxicity can exhibit toxicity by ectopic expression in E. coli. We mainly used this ectopic expression assay to dissect the region and motif contributing the toxicity. Compared to the interbacterial competiton process where Tde1(M) may only transiently permealze cytoplasmic membrane transiently as the final destination is cytoplasm where wild type Tde1 but not Tde1(M) exerts DNase toxicity. Thus, the toxicity of N-Tde1 and Tde1(M) can be only observed when the proteins are continuously produced in the cytoplasm. The role of N-Tde1, specifically the glycine zipper motifs, is to mediate Tde1 translocation across inner membrane, instead of exerting toxicity during the context of interbacterial competition.

      • Fig. 5B: the GLGL mutant seems to have some residual toxicity, not dissimilar to what is shown in 5A. Why are these similar results interpreted differently (in 5A they are "largely compromised", while in 5B "killing activity... was not detectable")? Also, why was Tde(M)1-Tdi1 used in Fig. 5A but Tdi1(M) without the immunity gene used in Fig. 5B? Ans: As described above, to better quantify the interbacterial competition outcomes, we have repeated the interbacterial competition experiments and used Tde(M)1-Tdi1 instead of Tdi1(M) for at least three biological replicates. The quantitative data with statistical analysis were carried out to clarify this ambiguity (Fig. 5A, 5B).

      • Fig. 5: Does the remaining third effector, Tae, not play a role in these competition assays? If, as shown in Fig. 5C, the entire T6SS is less active when a GLGL mutant is expressed, couldn't the different in toxicity shown in Figs. 5A-B be the result of lack of Tae secretion and toxicity?

      Ans: As decribed above, Tae effector has little impact on interbacterial competition outcome. The quantatitive interbacterial competition results (Fig. 5A, 5B) also clarify the ambiguity because single G39L and G43L variants are expressed and secreted at similar or even higher levels than wild type Tde1 but have no detectable antibacterial activity against either E. coli or A. tumefaciens 1D1609.

      • Lines 359-362: T6SS effectors that bind the inner Hcp tube were suggested to be only partially folded. Ans: Amended.

      Reviewer #1 (Significance (Required)):

      The concept of T6SS effectors providing their own mechanism of transport from the cytoplasm to the periplasm is very interesting. It will appeal to audience in a wide range of microbiology disciplines, including those interested in toxins, membrane transport, and even translational applications. A similar concept was recently proposed and demonstrated for a domain that is also found in T6SS effectors (Atanaskovic et al., mBio, 2022).

      Expertise: I have been studying the different aspects of T6SS for the past decade.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      This manuscript is focused on understanding how the Agrobacterium tumefaciens T6SS effector, Tde1, is translocated across the cell envelope of target cells and how this effector binds to the adapter Tap1. The authors show that GxxxG motifs in the N terminal region of Tde1 are required for delivery into the cytoplasm of target cells and permeabilising the cytoplasmic membrane. Given that these GxxxG motifs resemble glycine zipper structures that are found in proteins involved in membrane channel formation, the authors propose that these Tde1 motifs are involved in channel formation in the target cell. The authors also show that the N terminal region of Tde1 binds to Tap1 to facilitate loading onto the T6SS machinery but that the GxxxG motifs are not involved in this binding. Overall the manuscript was easy to read and followed a logical presentation of the findings. There are a few major comments that this reviewer has below - addressing these would allow the authors' claims to be more robustly supported. Ans: Thank you very much for the positive comments and valuable suggestion. We hope the revised manuscript including the new data and careful interpretation have substantialized the conclusions and proposed mechanisms.

      Major comments:

      1. Fig 1B: Why is this such a short growth experiment (5 hrs total with 2 hr pre and 3 hrs post induction)? Reporting on a growth experiment would normally be at least until the cells reach stationary phase but here the cells are still clearly in exponential phase. This reviewer would query what happens to growth rate in later exponential growth and into stationary phase? Is the toxic effect lessened in later stages of growth? Ans: We have indeed performed the growth curve analysis with longer time period. However, we noted that the growth at later time points are not always consistent and our interpretation is that the continuous expression of toxins may lead to the selection of mutants. Since the 3 or 4 hr time period already showed the toxicity phenotype, we have focused on this time frame for the growh experiments.

      2. It is indeed surprising that C2-Tde1(WT) does not inhibit growth despite it having a functional DNase domain and being expressed in the cytoplasm. Did the authors confirm that this protein variant was expressed by Western blot or other means? This should be done to confirm that this variant is indeed not impacting upon growth instead of it not impacting growth simply because it is not being expressed.

      Ans: Amended. All Tde1 variants including C2-Tde1 are expressed (data included in Fig S1)

      1. The letters used to report significance are not clear to this reviewer. The authors say that "The significant differences were shown by the different letters (p value

      For all fluorescence microscopy experiments how many fields of view were imaged for each biological replicate? Were the fields selected at random or was the field selection biased to what was present in the field before taking the image? The answers to all of these questions should be stated in the methods. Also the microscopy data presented in the manuscript is not quantitative. Quantification of the number of cells with PI vs Hoechst signal (in Fig 2C) and mcherry vs gfp signal (in Fig 4B) for all fields of view and for all biological replicates would be very informative and convince the reader that the authors have not just "cherry picked" the images they are showing in the manuscript. This could be performed manually or the authors could use the freely available image analysis program Fiji (https://imagej.net/software/fiji/) to perform these analysis in a semi-automated manner.

      Ans: The number of images and experiments were now described in the figure legends and the quantititive data are included (Fig. 2C).

      1. For the co-IP experiments in Fig 3 where interaction between HA tagged Tde1 and Tap1 is demonstrated the authors should also show that Tap1 does not interact with a different HA-tagged protein i.e. that the interaction is specific to Tde1 and not the HA motif. Ans: All Tde1 variants were tagged with HA. As shown in Fig. 3A, Tap1was not co-precipitated by C2-Tde1 and C1-Tde1(M), indicating that Tap1 specifically interacts with N-terminal region of Tde1.

      For all Western blot images there should be at least 2 protein standard markers present in each individual blot - i.e. for Fig 3A and B the bottom panel showing Tap1 detection only has the 35 kDa marker, it should have at least one more marker in it. The same is true for other panels in Fig 2, 3 and 4. Having at least two molecular weight markers in a panel is now standard for most journals when presenting Western blot images. Ans: Amended. We have now included the full gel of western blot results in Fig. S3 and S5 of those shown in main figures.

      For the competition assay serial dilution images in Fig 5A-B the images are a nice way to visually represent the experimental outcome but they should accompany graphs showing the competitive index of CFU/ml of the input prey and attacker vs the output prey and attacker for all biological replicates. This will convince the reader that the authors had equivalent amounts of the prey and the attacker going into the experiment and also that all attackers grew at the same rate and so were equally able to target the prey cell. This quantification could also provide more convincing out competition of ID1609 prey by C58 attacker (Fig 5B). Ans: Amended. As indicated above, we have repeated the interbacterial competition experiments for at leaset three biological replicates and show that quantitative data with statistical analysis (Fig. 5A, 5B).

      Minor comments:

      Line 40: should read "...demonstrate that the effector itself..." Ans: The sentence has been rephrased (line 40) .

      Line 41: "...we propose..." instead of "...we proposed..." since present tense makes more sense for this statement.

      Ans: Amended (line 42).

      Line 51: "Each specialized protein secretion system" instead of "Each of...." Ans: Amended (line 52).

      Line 76: "A glycine zipper structure..."

      Ans: Amended (line 83).

      Line 79: "For example..."

      Ans: Amended (line 86).

      Lines 96-100: The present tense should be used here as the current usage of past tense implies that this has been done in previous work and not in the current study - eg "we revealed", "we showed" would be better as "we reveal", "we show".

      Ans: Thanks for the advice. We have made changes throughout the manuscript.

      Fig 5B - The competition assay serial dilution images look a bit blurry, are there images the authors could use that are not blurry?

      Ans: Amended. As indicated above, we now show quantitative data with statistical analysis (Fig. 5A, 5B).

      Reviewer #2 (Significance (Required)):

      This work is significant in as while there is a great deal known about how T6SS effectors cause toxicity there is less known about how these effectors are loaded onto the T6SS machinery and very little known about how T6SS effectors are able to translocate across the cytoplasmic membrane of target cells to reach a cellular component that is in the cytoplasm. This work would be of wide general interest to researchers in the T6SS field as well as those interested in bacterial secretion systems.

      Reviewer expertise key words: Molecular microbiology, T6SS, interbacterial competition

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      EVIDENCE, REPRODUCIBILITY AND CLARITY

      Summary:

      In this work, Ali et al. demonstrate that the N-terminal GxxxG motif of the T6SS DNase effector Tde1 of Agrobacterium tumefaciens is required for interbacterial intoxication. Using a combination of cell viability, reporter, and microscopy assays, the authors demonstrate that over-expression of the N-terminus of Tde1 results in inner membrane permeability. Moreover, the authors show that both the interaction between Tde1 and its adaptor Tap1 as well as the T6SS-mediated secretion of Tde1 are dependent on the N-terminus of Tde1. Finally, using a combination of in vitro and in vivo experiments, the authors determine that the N-terminal GxxxG motif is essential to Tde1-dependent interbacterial killing by enabling effector entry into competing bacterial cells.

      Major comments:

      If N-tde1 is 1-97 aa, the predicted size is 9 kDa, but it shows up as ~17 kDa? Can the authors comment on this? Does N-tde1 or tde1 dimerize? Ans: The theoretical Mw of N-Tde1-HA is 10.64 KDa, which indeed migrated at higer position ~17 kDa. It is notable that full-length Tde1 with theoretical 29.5-kDa migrated slower in SDS-PAGE with a observed size ~36 kDa as observed previously (Ma et al., 2014 doi:10.1016/j.chom.2014.06.002). Similarly, the full-length HA-tagged Tde1(M) with theoretical 30.89 kDa migrated at a position ~38 kDa. Since the protein samples analyzed by SDS-PAGE including reducing agent, we cannot exclude the possibility that Tde1 or N-Tde1 may form dimer or oligomer that was disrupted by SDS-PAGE but it appears not forming dimer on SDS-PAGE.

      I have many concerns with the data and conclusions drawn from the data in Fig. 3B. I recommend removing it since (1) the data are not accurately represented in the text and (2) it is difficult to ascertain whether biologically relevant conclusions can be drawn from what happens with Agrobacterium proteins in E. coli. Below is a summary of my concerns regarding this section: I disagree with the authors' statements in lines 191-198. Their pulldown with E. coli is not consistent with their pulldown in C58. In fact, given the expression problems of some of the constructs in E. coli, I believe the data shown in Fig. 3B is inconclusive. The amount of Tap1 that co-IP'ed with N-Tde1GLGL and Tde1(M) is very low even though the expression levels of N-Tde1GLGL and Tde1(M) were relatively strong. Therefore, I do not feel confident concluding that these proteins "interact". Secondly, Tde1(M)GLGL was not expressed in E. coli, so no conclusions can be drawn. Moreover, the C1 and C2 variants were also not expressed well, so I believe the authors' statement in line 191-192: "Similar to the results in A. tumefaciens, the N-Tde1 and Tde1(M) interacted with Tap1 but not the C-terminal variants", is unjustified. You cannot rule out that C1 and C2 do not interact with Tap1 because C1 and C2, like Tde1(M)GLGL, were not expressed well in E. coli. Ans: We agree with the reviewer that the E. coli co-IP result is not conclusive due to the low expression and instability of proteins mostly during the process of cell lysis and purification, and it provides little additional information other than data from co-IP in A. tumefaciens. Because the interaction between Tde1 variants and Tap1 when expressed in E. coli are not physiologically relevant and not the main focus of this work, we have removed the E. coli co-IP results from this manuscript.

      Lines 211-214: It looks like C1-Tde1(M) inhibits T6SS secretion. I am aware that in Agrobacterium, it has been shown that effector loading is essential for secretion, but then why does the pTrc200 secrete Hcp? Also, in Fig. 4B, a strain expressing C1-Tde1(M) now secretes Hcp. Ans: Thanks for noting our previous finding that Tde loading is critical for secretion. Our data are indeed supportive of the effector loading in activating T6SS as only very low levels of Hcp secretion could be detected from the strain containing vector only or C1-Tde1(M). In our previous paper (Wu et al., 2020 https://doi.org/10.15252/embr.201947961), there is either little or no detection of Hcp secretions when effectors are not loaded, indicating that effector loading is important but not essential for Hcp secretion. Because overexpression of VgrG can also activate T6SS secretion in the absence of effector loading (Bondage et al., 2016 doi:10.1073/pnas.1600428113), we think the low level secretion under certain conditions could be caused by some cells with higher levels of VgrG protein concentration but more work is required to elucidate the underlying mechanisms.

      Minor comments:

      Fig. 2B could benefit from better labeling to indicate that most strains lack lacY. Also, why is BW25113 WT showing such a low OD420 if it has LacY? Or is WT without lacZ? Please clarify.

      Ans: We apologize for not labeling clearly. The BW25113 strain lacks lacZ, therefore all the ∆lacY strains were complemented with a plasmid encoding lacZ (pYTA-lacZ). We have now added the labels to avoid confusion (Fig. 2B).

      Reviewer #3 (Significance (Required)):

      SIGNIFICANCE

      It has been known for over a decade that T6SS effectors have both periplasmic and cytosolic targets (e.g., cell wall and DNA). However, it remains unclear (1) where within the target cell are T6SS effectors are delivered and (2) once delivered, how do effectors reach their intracellular target site. In this work, Ali et al. demonstrate that for Tde1, the N-terminal GxxxG motif is essential for Tde1 to reach its target (DNA). The authors identified Tde1 homologs in several bacteria, suggesting that this model may be relevant across a wide range of bacteria. Additional research is needed to (1) determine whether Tde1 is originally secreted into the periplasm and (2) understand how non-Tde1/non-GxxxG effectors reach their target site.

    1. Author Response

      Reviewer #2 (Public Review):

      This is an interesting study investigating the effects of sensory conflict on rhythmic behaviour and gene expression in the sea anemone Nematostella vectensis. Sensory conflict can arise when two environmental inputs (Zeitgeber) that usually act cooperatively to synchronize circadian clocks and behaviour, are presented out of phase. The clock system then needs to somehow cope with this challenge, for example by prioritising one cue and ignoring the other. While the daily light dark cycle is usually considered the more reliable and potent Zeitgeber, under some conditions, daily temperature cycles appear to be more prominent, and a certain offset between light and temperature cycles can even lead to a breakdown of the circadian clock and normal daily behavioural rhythms. Understanding the weighting and integration of different environmental cues is important for proper synchronization to daily environmental cycles, because organisms need to distinguish between 'environmental noise' (e.g., cloudy weather and/or sudden, within day/night temperature changes) and regular daily changes of light and temperature. In this study, a systematic analysis of different offsets between light and temperature cycles on behavioural activity was conducted. The results indicated that several degrees of chronic offset results in the disruption of rhythmic behaviour. In the 2nd part of the study the authors determine the effect of sensory conflict (12 hr offset that leads to robust disruption of rhythmic behaviour) on overall gene expression rhythms. They observe substantial differences between aligned and offset conditions and conclude a major role for temperature cycles in setting transcriptional phase. While the study is thoroughly conducted and represents and impressive amount of experimental and analytical work, there are several issues, which I think question the main conclusions. The main issue being that temperature cycles by themselves do not seem to fulfil the criteria for being considered a true Zeitgeber for the circadian clock of Nematostella.

      Major points:

      Line 53: 'However, many of these studies did not compare more than two possible phase relationships.....'. Harper et al. (2016) did perform a comprehensive comparison of different phase relationships between light and temperature Zeitgebers (1 hr steps between 2 and 10 hr offsets), similar to the one conducted here. I think this previous study is highly relevant for the current manuscript and -- although cited -- should be discussed in more detail. For example, Harper et al. show that during smaller offsets temperature is the dominant Zeitgeber, and during larger sensory conflict light becomes the dominant Zeitgeber for behavioural synchronization. Only during a small offset window (5-7 hr) behavioural synchronization becomes highly aberrant, presumably because of a near breakdown of the molecular clock, caused by sensory conflict. Do the authors see something similar in Nematostella? Figure 3 suggests otherwise, at least under entrainment conditions, where behaviour becomes desynchronized only at 10 and 12 hr offset conditions. But in free-run conditions behaviour appears largely AR already at 6 hr offset, but not so much at 4 and 8 hr offsets (Table 2). So there seems to be at least some similarity to the situation in Drosophila during sensory conflict, which I think is worth mentioning and discussing.

      We have added a more detailed discussion of our results in the context of Harper et al. 2016 (L468-476).

      Line 111: The authors state that 14-26C temperature cycle is 'well within the daily temperature range experienced by the source population'. Too me this is surprising, as I was not expecting that water temperature changes that much on a daily basis. Is this because Nematostella live near the water surface, and/or do they show vertical daily migration? Also, I do not understand what is meant by '...range of in situ diel variation (of temperature)'. I think a few explanatory words would be helpful here for the reader not familiar with this organism.

      In fact, one of our motivations for studying temperature is that Nematostella naturally experience extreme temperature variation. The data we cite (Tarrant et al. 2019) are from in-situ water measurements. Nematostella live in extremely shallow water (in salt marshes), and the local population in Massachusetts experience wide swings in temperature due to the temperate latitude.

      We have added this information to the Introduction (L88-90), and we also added a discussion of Nematostella’s ecology in the Discussion section (L591-654).

      Lines 114-117: I was surprised that clock genes can basically not be synchronized by temperature cycles alone. Only cry2 cycled during temperature cycles but not in free-run, so the cry2 cycling during temperature cycles could just be masking (response to temperature). Later the authors show robust molecular cycling during combined LD and temperature cycles (both aligned and out of phase), indicating that LD cycles are required to synchronize the molecular clock. Moreover, a previous study has demonstrated that LD cycles alone (i.e., at constant temperature) are able to induce rhythmic molecular clock gene expression (Oren et al. 2015). Similarly, the free running behaviour after temperature cycles does not look rhythmic to me. In Figure 2A, 14-26C there is at best one peak visible on the first day of DD, and even that shows a ~6 phase delay compared to the entrained condition. After the larger amplitude temperature cycle (8:32C) behaviour looks completely AR and peak activity phases in free-run appear desynchronized as well (Fig. 2B). Overall, I think the authors present data demonstrating that temperature cycles alone are not sufficient to synchronize the circadian clock of Nematostella. One way to proof if the clock can be entrained is to perform T-cycle experiments, so changing the thermoperiod away from 24 hr (e.g., 10 h warm : 10 h cold). If in a series of different T-cycles the peak activity always matches the transition from warm to cold (as in 12:12 T-cycles shown in Fig. 1A) this would speak against entrainment and vice versa.

      Thank you for these thoughtful comments and constructive suggestions. We have conducted an additional experiment, which provides further evidence that temperature cycles can, in fact, synchronize the circadian clock. To do this, we measured the behavior of animals entrained in cycles with a short (12h) period, half the length of a circadian period. This takes advantage of a phenomenon called “frequency demultiplication”, in which organisms in 12h environmental cycles display both 12h and 24h components--essentially, the clock perceives every other cycle as a “day” (Bruce, 1960; Merrow et al., 1999). The important thing is that the 24h behavioral component can only occur if the signal is entraining a circadian clock—otherwise, we would only observe a directly-driven 12h behavior pattern.

      We first show that this phenomenon occurs with 6:6 LD cycles—which we expected, because we know light is a zeitgeber. We then show that animals entrained to a temperature cycle with a 12h period also display 24h behavioral rhythms—and in fact the 24h component is stronger than the 12h component. We believe this is strong evidence that temperature is a bona fide zeitgeber in this system. This experiment is now explained in the Results (L127-154) and in Figure 2–Figure supplement 1.

      In terms of our original data, the reviewer is correct that the statistically-detectable free-running rhythms were weak and not visually obvious). Our confidence in thermal entrainment came from the fact that some individual animals had 24h rhythmicity in free-run, even if the signal was weak in the mean time series—this suggested that temperature must be at least capable of synchronizing internal clocks. It is also important to note that even light-entrained rhythms are “noisy” in cnidarians, which is why we were not surprised that the signal was weak. We have added a discussion of this observation in L601-612.

      Lines 210-226: As mentioned above, I think it is not clear that temperature alone can synchronize the Nematostella clock and it is therefore problematic to call it a Zeitgeber. Nevertheless, Figure 3A, B, D show that certain offsets of the temperature cycle relative to the LD cycle do influence rhythmicity and phase in constant conditions. This is most likely due to a direct effect of temperature cycles on the endogenous circadian clock, which only becomes visible (measureable) when the animals are also exposed to certain offset LD cycles. My interpretation of the combined results would be that temperature cycles play only are very minor role in synchronizing the Nematostella clock (after all, LD and temperature cycles are not offset in nature), perhaps mainly supporting entrainment by the prominent LD cycles.

      With our new data (see previous point), we believe we can safely say that temperature is a zeitgeber. We are not totally clear on what is meant by “a direct effect of temperature cycles on the endogenous circadian clock.” We argue that, because we see changes in free-running behavior during certain offsets, the timing of temperature cycles must affect the internal clock in a way that persists during constant conditions—it can’t just be a direct (clock-independent) effect of temperature.

      Gene expression part: The authors performed an extensive temporal transcriptomic analysis and comparison of gene expression between animals kept in aligned LD and temperature cycles and those maintained in a 12 hr offset. While this was a tremendous amount of experimental work that was followed by sophisticated mathematical analysis, I think that the conclusions that can be drawn from the data are rather limited. First of all, it is known from other organisms that temperature cycles alone have drastic effects on overall gene expression and importantly in a clock independent manner (e.g., Boothroyd et al. 2007). Temperature therefore seems to have a substantially larger effect on gene expression levels compared to light (Boothroyd et al. 2007). In the current study, except for a few clock gene candidates (Figure 2C), the effects of temperature cycles alone on overall gene expression have not been determined. Instead the authors analysed gene expression during aligned and 12 h offset conditions making it difficult to judge which of the observed differences are due to clock independent and clock dependent temperature effects on gene expression. This is further complicated by the lack of expression data in constant conditions. I think the authors need to address these limitations of their study and tone down their interpretations of 'temperature being the most important driver of rhythmic gene expression' (e.g., line 401). At least they need to acknowledge that they cannot distinguish between clock independent, driven gene expression and potential influences of temperature on clock-dependent gene expression rhythms. Moreover, in their comparison between their own data and LD data obtained at constant temperature (taken from Oren et al. 2015), they show that temperature has only a very limited effect (if any) on core clock gene expression, further questioning the role of temperature cycles in synchronising the Nematostella clock. Nevertheless, I noted in Table 3 that there is a 1.5 to 3 hr delay when comparing the phase of eight potential key clock genes between the current study (temperature and LD cycles aligned) and LD constant temperature (determined by Oren et al.). To me, this is the strongest argument that temperature cycles at least affect the phase of clock gene expression, but the authors do not comment on this phase difference.

      We agree with these points about the limitations of our study, and have revised the manuscript to phrase our conclusions more carefully. We still think it is reasonable to observe that temperature was a stronger drive of gene expression than light in our study, but this may not be true in other contexts.

      In terms of the comparison with Oren et al. 2015, we didn’t want to over-interpret these results because there are other differences between the studies (L1181-1185), including the use of a different source population. In addition, we would prefer denser sampling (2h time points rather than 4h) and larger sample sizes to make claims about phase differences.

      Network analysis: This last section of the results was very difficult to read and follow (at least for me). For example, do the colours in Figure 6A correspond to those in Figure 6B, C? A legend for each colour, i.e., which GO terms are included in each colour would perhaps be helpful. As mentioned above, I also do not think we can learn a lot from this analysis, since we do not know the effects of temperature cycles alone and we have no free-run data to judge potential influence on clock controlled gene expression. Under aligned conditions genes are expressed at a certain phase during the daily cycle (either morning to midday, or evening to midnight), which interestingly, is very similar to temperature cycle-only driven genes in Drosophila (Boothroyd et al. 2007). Inverting the temperature cycle has drastic effects on the peak phases of gene expression, but not so much on overall rhythmicity. But since no free-run data are available, we do not know to what extend these (expected) phase changes reflect temperature-driven responses, or are a result of alterations in the endogenous circadian clock.

      We have revised and streamlined this section and Fig. 6, including removing panel 6C. The colors do correspond across panels in the figure. For space, GO terms of select modules are included in Fig. 6, and GO results for all modules are included in the Supplemental Data and discussed in the Results.

      It is true that we can’t distinguish temperature-driven versus clock effects here, and it does seem like many modules simply follow the temperature cycle (which we say in this section). The most interesting finding from this section is probably that the co-expression structure (correlations between rhythmic genes) are substantially weakened during SC, and we do discuss certain modules of genes that lose or gain rhythmicity. We have revised this section to focus on the main points and have cut several of the less pertinent results.

      Reviewer #3 (Public Review):

      This article reflects a significant effort by the authors and the results are interesting.

      For the third set of experiments, are temperature and light really out of synch? While peak in temperature no longer occurs along with lights on, we do still have two 24 hour cycles where changes in the environmental cues still occur simultaneously (lights on with peak in temperature, lights off with min in temperature). I wonder what would happen if light remained at a 24 hour cycle and temperature became either sporadic (randomly changing cycles) or was placed on a longer cycle altogether (temperature taking 20 hours to increase from min to max, and then another 20 hours to go from max to min).

      Thank you for your interesting suggestions for future experiments. This point is addressed in our revisions responding to Reviewer #1, who requested a discussion of the phrase “sensory conflict.” We agree that the binary “in-sync vs. out-of-sync” may be too simplistic. Our original conception of sensory conflict was a situation in which light and temperature provide different phase information, as informed by experiments with only light (prior literature) or only temperature (this work).

      In our revised manuscript, we discuss the idea that “sensory conflict” is not always a useful framework because there are many possible relationships between light and temperature. Although our 12h offset is certainly less “natural” than our aligned time series, it may be useful to think of them simply as 2 different possible light and temperature regimes in which the two signals interact, rather than abstract ideals of “aligned” or “misaligned.”

      An area that could significantly benefit a broader readership would be to improve overall clarity of figures and rethink if all the results are necessary to convert the key findings of the paper. As written, the results sections is somewhat confusing.

      We have revised Figs. 1 and 6 for clarity, and we have also shortened the network analysis portion of the Results.

    1. Evaluation 3


      Ratings and predictions

      Ratings (1-100)

      • Overall assessment: 65 Confidence: Medium
      • Advancing knowledge and practice: 70 Confidence: Medium
      • Methods: Justification, reasonableness, validity, robustness: Not qualified
      • Logic & communication: 80 Confidence: Medium-to-high
      • Open, collaborative, replicable: Not qualified
      • Relevance to global priorities: 80 Confidence: High

      Journal predictions (1-5)

      • What ‘quality journal’ do you expect this work will be published in? 3.5 Confidence: Medium
      • On a ‘scale of journals’, what tier journal should this be published in? 3.5 Confidence: Medium

      Written report

      I am a political scientist specializing in science policy (i.e., how expertise and knowledge production influences the policymaking process and vice-versa), with a focus on “decision making under conditions of uncertainty,” R&D prioritization, and the governance of systemic and catastrophic risk. With respect to the various categories of expertise highlighted by the authors, I can reasonably be considered a “policy analyst.”

      Potential conflict of interest/source of bias: one of the authors (Dr. Anders Sandberg) is a friend and former colleague. He was a member of my PhD dissertation committee.

      A quick further note on the potential conflict of interest/bias of the authors (three of the four are associated with ALLFED, which, as the authors note, could stand to benefit financially from the main implication of their analysis - that significant funding be allocated to resilient food research in the short-term). In my opinion, this type of “self-advocacy” is commonplace and, to some extent, unavoidable. Interest and curiosity (and by extension, expertise) on a particular topic motivates deep analysis of that topic. It’s unlikely that this kind of deep analysis (which may or may not yield these sorts of “self-confirming” conclusions/recommendations) would ever be carried out by individuals who are not experts on - and often financially implicated in - the topic. I think their flagging of the potential conflict of interest at the end of the paper is sufficient - and exercises like this Unjournal review further increase transparency and invite critical examinations of their findings and “positionality.”

      I am unqualified to provide a meaningful evaluation of several of the issues “flagged” by the authors and editorial team, including: the integration of the sub-models, sensitivity analysis, and alternative approaches to the structure of their Monte Carlo analysis. Therefore, I will focus on several other dimensions of the paper.

      Context and contribution

      This paper has two core goals: (1) to explore the value and limitations of relative long-term cost effectiveness analysis as a prioritization tool for disaster risk mitigation measures in order to improve decision making and (2) to use this prioritization tool to determine if resilient foods are more cost effective than AGI safety (which would make resilient food the highest priority area of GCR/X-risk mitigation research). As I am not qualified to directly weigh in on the extent to which the authors’ achieved either goal, I will reflect on the “worthiness” of this goal within the broader context of work going on in the fields of X-risk/GCR, long-termism, science policy, and public policy - and the extent to which the authors’ findings are effectively communicated to these audiences.

      Within this broader context, I believe that these are indeed worthy (and urgent) objectives. The effective prioritization of scarce resources to the myriad potential R&D projects that could (1) reduce key uncertainties, (2) improve political decision-making, and (3) provide solutions that decrease the impact and/or likelihood of civilization-ending risk events is a massive and urgent research challenge. Governments and granting agencies are desperate for rigorous, evidence-based guidance on how to allocate finite funding across candidate projects. Such prioritization is impeded by uncertainty about the potential benefits of various R&D activities (partially resulting from uncertainty about the likelihood and magnitude of the risk event itself - but also from uncertainty about the potential uncertainty-reducing and harm/likelihood-reducing “power” of the R&D). Therefore, the authors’ cost-effectiveness model, which attempts to decrease uncertainty about the potential uncertainty-reducing and harm/likelihood-reducing “power” of resilient food R&D and compare it to R&D on AGI safety, is an important contribution. It combines and applies a number of existing analytical tools in a novel way and proposes a tool for quantifying the relative value of (deeply uncertain) R&D projects competing for scarce resources.

      Overall, the authors are cautious and vigilant in qualifying their claims - which is essential when conducting analysis that relies on the quasi-quantiative aggregation of the (inter)subjective beliefs of experts and combines several models (each with their own assumptions).

      Theoretical/epistemic uncertainty

      I largely agree with the authors’ dismissal of theoretical/epistemic uncertainty (not that they dismiss its importance or relevance - simply that they believe there is essentially nothing that can be done about it in their analysis). Their suggestion that “results should be interpreted in an epistemically reserved manner” (essentially a plea for intellectual humility) should be a footnote in every scholarly publication - particularly those addressing the far future, X-risk, and value estimations of R&D.

      However, the authors could have bolstered this section of the paper by identifying some potential sources of epistemic uncertainty and suggesting some pathways for further research that might reduce it. I recognize that they are both referring to acknowledged epistemic uncertainties - which may or may not be reducible - as well as unknown epistemic uncertainties (i.e., ignorance - or what they refer to as “cluelessness”). It would have been useful to see a brief discussion of some of these acknowledged epistemic uncertainties (e.g., the impact of resilient foods on public health, immunology, and disease resistance) to emphasize that some epistemic uncertainty could be reduced by exactly the kind of resilient food R&D they are advocating for.

      Presentation of model outputs

      When effectively communicating uncertainties associated with research findings to multiple audiences, there is a fundamental tradeoff between the rigour demanded by other experts and the digestibility/usability demanded by decision makers and lay audiences. For example, this tradeoff has been well-documented in the literature on the IPCC’s uncertainty communication framework (e.g., Janzwood 2020). What fellow-modelers/analysts want/need is usually different from what policymakers want/need. The way that model outputs are communicated in this article (e.g. 84% confidence that the 100 millionth dollar is more cost-effective) leans towards rigour and away from digestibility/usability. A typical policymaker who is unfamiliar with the modeling tools used in this analysis may assume that an 84% probability value was derived from historical frequencies/trials in some sort of experiment - or that it simply reflects an intersubjective assessment of the evidence by the authors of the article. Since the actual story for how this value was calculated is rather complex (it emerges from a model derived from the aggregation of the outputs of two sub-models, which both aggregate various types of expert opinions and other forms of data) - it might be more useful to communicate the final output qualitatively.

      This strategy has been used by the IPCC to varying levels of success. These qualitative uncertainty terms can align with probability intervals. For example, 80-90% confidence could be communicated as “high confidence” or “very confident.” >90% could be communicated as “extremely confident.” There are all sorts of interpretation issues associated with qualitative uncertainty scales - and some scales are certainly more effective than others (again, see Janzwood 2020) but it is often useful to communicate findings in two “parallel tracks” - one for experts and one for a more lay/policy-focused audience.

      Placing the article’s findings within the broader context of global priorities and resource allocation

      Recognizing the hard constraints of word counts - and that a broader discussion of global priorities and resource allocation was likely “out of scope” - this article could be strengthened (or perhaps simply expanded upon in future work) by such a discussion. The critical piece of context is the scarcity of resources and attention within the institutions making funding decisions about civilization-saving R&D (governments, granting organizations, private foundations, etc.). There are two dimensions worth discussing here. First, R&D activities addressing risks that are generally considered low-probability/high-impact with relatively long timelines (although I don’t think the collapse of global agricultural would qualify as low-risk - nor is the likely timeline terribly long - but those are my priors) are competing for scarce funding/attention against R&D activities addressing lower-impact risks believed to be shorter-term and more probable (e.g., climate change, the next pandemic, etc.). I think most risk analysts - even hardcore “long-termists” - would agree that an ideal “R&D funding portfolio” be somewhat diversified across these categories of risk. It is important to acknowledge the complexity associated with resource allocation - not just between X-risks but between X-risks and other risks.

      Second, there is the issue of resource scarcity itself. On the one hand, there are many “high value” candidate R&D projects addressing various risks that societies can invest in - but only a finite amount of funding and attention to allocate between them. So, these organizations must make triage decisions based on some criteria. On the other hand, there are also a lot of “low” or even “negative value” R&D activities being funded by these organizations - in addition to other poor investments - that are providing little social benefit or are actively increasing the likelihood/magnitude of various risks. I believe that it is important in these sort of discussions about R&D prioritization and resource scarcity to point out that the reosource pool need not be this shallow - and to identify some of the most egregious funding inefficiencies (e.g., around fossil fuel infrastructure expansion). It should go without saying— but ideally, we could properly resource both resilient food and AGI safety research.


      Evaluator details

      1. Name: Scott Janzwood
      2. How long have you been in this field? 10 years
      3. How many proposals and papers have you evaluated? ~25 proposals, ~10 papers
    2. Evaluation 1


      Ratings and predictions

      Ratings (1-100)

      • Overall assessment: 40 CI: 20-60
        • Comment: See main review
      • Advancing knowledge and practice: 30 CI: 20-60
        • Comment: The paper itself makes an important argument about resilient foods, but I don’t know if the additional element of AGI risk adds much to Denkenberger & Pearce (2016)
      • Methods: Justification, reasonableness, validity, robustness: 50 CI: 40-60
        • Comment: Very major limitations around the survey method, and implementation of certain parts of the parameter sensitivity analysis. However many elements of a high standard
      • Logic & communication: 60 CI: 40-75
        • Comment: Major limitations around the logic and communication of the theoretical model of cost-effectiveness used in the paper. Minor limitations of readability and reporting which could have been addressed before publication (such as reporting 95% CIs without medians, and not reporting overall cost and benefit estimates)
      • Open, collaborative, replicable: 70 CI: 40-75
        • Comment: Provided models are shared with any reader who asks, I couldn’t ask for more here. Limitations of survey replicability (particularly E model) prevent perfect score
      • Relevance to global priorities: 90 CI: 60-95
        • Comment: I’d be surprised if I ever read a paper with more relevance to global priorities, although as mentioned there are a few version of this argument circulating such as Denkenberger & Pearce (2016)

      Journal predictions (1-5)

      • What ‘quality journal’ do you expect this work will be published in? 2 CI: 1-2
      • On a ‘scale of journals’, what tier journal should this be published in? 2 CI: 1-2

      Written report

      This is a very interesting paper on an important and neglected topic. I’d be surprised if I ever again read a paper with such potential importance to global priorities. The authors motivate the discussion well, and should be highly commended for their clear presentation of the structural features of their model, and the thoughtful nature in which uncertainty was addressed head-on in the paper.

      Overall, I suspect the biggest contribution this paper will make is contextualising the existing work done by the authors on resilient food into the broader literature of long-termist interventions. This is a significant achievement, and the authors should feel justifiably proud of having accomplished it. However, the paper unfortunately has a number of structural and technical issues which should significantly reduce a reader’s confidence in the quantitative conclusions which aim to go beyond this contextualisation.

      In general, there are three broad areas where I think there are material issues with the paper:

      1. The theoretical motivation for their specific philosophy of cost-effectiveness, and specifically whether this philosophy is consistent throughout the essay
      2. The appropriateness of the survey methods, in the sense of applying the results of a highly uncertain survey to an already uncertain model
      3. Some specific concerns with parameterisation

      None of these concerns touch upon what I see at the main point of the authors, which I take to be that ‘fragile’ food networks should be contextualised alongside other sources of existential risk. I think this point is solidly made, and important. However, they do suggest that significant additional work may be needed to properly prove the headline claim of the paper, which is that in addition to being a source of existential risk the cost-effectiveness of investing in resilient food is amongst the highest benefit-per-cost of any existential risk mitigation.

      Structure of cost-effectiveness argument

      One significant highlight of the paper is the great ambition it shows in resolving a largely intractable question. Unfortunately, I feel this ambition is also something of a weakness of the paper, since it ends up difficult to follow the logic of the argument throughout.

      • Structurally, the most challenging element of this paper in terms of argumentative flow is the decision to make the comparator for cost-effectiveness analysis ‘AGI Catastrophe’ rather than ‘do nothing’. My understanding is that the authors make this decision to clearly highlight the importance of resilient food – noting that, “if resilient foods were more cost effective than AGI safety, they could be the highest priority [for the existential risk community]” (since the existential risk community currently spends a lot on AGI Risk mitigation). So roughly, they start with the assumption that AI Risk must be cost-effective, and argue that anything more cost-effective than this must therefore also be cost-effective. The logic is sound, but this decision causes a number of problems with interpretability, since it requires the authors to compare an already highly uncertain model of food resilience against a second highly uncertain model of AGI risk.
      • The biggest issue with interpretability this causes is that I struggle to understand what features of the analysis are making resilient food appear cost-effective because of some feature of resilient food, and which are making resilient food appear cost-effective because of some feature of AI. The methods used by the authors mean that a mediocre case for resilient food could be made to look highly cost-effective with an exceptionally poor case for AI, since their central result is the multiplier of value on a marginally invested dollar for resilient food vs AI. This is important, because the authors’ argument is that resilient food should be funded because it is more effective than AI Risk management, but this is motivated by AI Risk proponents agreeing AI Risk is important – in scenarios where AI Risk is not worth investing in then this assumption is broken and cost effectiveness analysis against a ’do nothing’ alternative is required. For example, the authors do not investigate scenarios where the benefit of the intervention in the future is negative because “negative impacts would be possible for both resilient foods and AGI safety and there is no obvious reason why either would be more affected”. While this is potentially reasonable on a mathematical level, it does mean that it would be perfectly possible for resilient foods to be net harmful and the paper not correctly identify that funding them is a bad idea – simply because funding AI Risk reduction is an even worse idea, and this is the only given alternative. If the authors want to compare AGI risk mitigation and resilient foods against each other without a ‘do nothing’ common comparator (which I do not think is a good idea), they must at the very least do more to establish that the results of their AI Risk model map closely to the results which cause the AI Risk community to fund AI Risk mitigation so much. As this is not done in the paper, a major issue of interpretability is generated.
      • A second issue this causes is that the authors must make an awkward ‘assumption of independence’ between nuclear risk, food security risk and AI risk. Although the authors identify this as a limitation of their modelling approach, the assumption does not need to be made if AI risk is not included as a comparator in the model. I don’t think this is a major limitation of the work, but an example of how the choice of comparator has an impact on structural features of the model beyond just the comparator.
      • More generally, this causes the authors to have to write up their results in a non-natural fashion. As an example of the sort of issues this causes, conclusions are expressed in entirely non-natural units in places (“Ratio of resilient foods mean cost effectiveness to AGI safety mean cost effectiveness” given $100m spend), rather than units which would be more natural (“Cost-effectiveness of funding resilient food development”). I cannot find expressed anywhere in the paper a simple table with the average costs and benefits of the two interventions, although a reference is made to Denkenberger & Pearce (2016) where these values were presented for near-term investment in resilient food. This makes it extremely hard for a reader to draw sensible policy conclusions from the paper unless they are already an expert in AGI risk and so have an intuitive sense of what an intervention which is ‘3-6 times more cost-effective than AGI risk reduction’ looks like. The paper might be improved by the authors communicating summary statistics in a more straightforward fashion. For example, I have spent some time looking for the probability the model assigns to no nuclear war before the time horizon (and hence the probability that the money spent on resilient food is ‘wasted’ with respect to the 100% shortfall scenario) but can’t find this – that seems to be quite an important summary statistic but it has to be derived indirectly from the model.

      Fundamentally, I don’t understand why both approaches were not compared to a common scenario of ‘do nothing’ (relative to what we are already doing). The authors’ decision to compare AGI Risk mitigation to resilient foods directly would only be appropriate if the authors expect that increasing funding for resilient food decreased funding for AI safety (that is to say, the authors are claiming that there is a fixed budget for AI-safety-and-food-resilience, and so funding for one must come at the expense of the other). This might be what the authors have in mind as a practical consequence of their argument, as there is an implication that funding for resilient foods might come from existing funding deployed to AGI Risk. But it is not logically necessary that this is the case, and so it creates great conceptual conclusion to include it in a cost-effectiveness framework that requires AI funding and resilient food funding to be strictly alternatives. To be clear, the ‘AI subunit’ is interesting and publishable in its own right, but in my opinion simply adds complexity and uncertainty to an already complex paper.

      Continuing on from this point, I don’t understand the conceptual framework that has the authors consider the value of invested dollars in resilient food at the margin. The authors’ model of the value of an invested dollar is an assumption that it is distributed logarithmically. Since the entire premise of the paper hinges on the reasonability of this argument, it is very surprising there is no sensitivity analysis considering different distributions of the relationship between intervention funding and value. Nevertheless, I am also confused as to the model even on the terms the authors describe; the authors’ model appears to be that there is some sort of ‘invention’ step where the resilient food is created and discovered (this is mostly consistent with Denkenberger & Pearce (2016), and is the only interpretation consistent with the question asked in the survey). In which case, the marginal value of the first invested dollar is zero because the ’invention’ of the food is almost a discrete and binary step. The marginal value per dollar continues to be zero until the 86 millionth dollar, where the marginal value is the entire value of the resilient food in its entirety. There seems to be no reason to consider the marginal dollar value of investment when a structural assumption made by the authors is that there is a specific level of funding which entirely saturates the field, and this would make presenting results significantly more straightforward – it is highly nonstandard to use marginal dollars as the unit of cost in a cost-effectiveness analysis, and indeed is so nonstandard I’m not certain fundamental assumptions of cost-effectiveness analysis still hold. I can see why the authors have chosen to bite this bullet for AI risk given the existing literature on the cost of preventing AI Catastrophe, but there seems to be no reason for it when modelling resilient food and it departs sharply from the norm in cost-effectiveness analysis.

      Finally, I don’t understand the structural assumptions motivating the cost-effectiveness of the 10% decline analysis. The authors claim that the mechanism by which resilient foods save lives in the 10% decline analysis is that “the prices [of non-resilient food] would go so high that those in poverty may not be able to afford food” with the implication that resilient foods would be affordable to those in poverty and hence prevent starvation. However, the economic logic of this statement is unclear. It necessitates that the production costs of resilient food is less than the production costs of substitute non-resilient food at the margin, which further implies that producers of resilient food can command supernormal profits during the crisis, which is to say the authors are arguing that resilient foods represent potentially billions of dollars of value to their inventor within the inventor’s lifetime. It is not clear to me why a market-based solution would not emerge for the ‘do nothing’ scenario, which would be a critical issue with the authors’ case since it would remove the assumption that ‘resilient food’ and ‘AGI risk’ are alternative uses of the same money in the 10% scenario, which is necessary for their analysis to function. The authors make the further assumption that preparation for the 100% decline scenario is highly correlated with preparation for the 10% decline scenario, which would mean that a market-based solution emerging prior to nuclear exchange would remove the assumption that ‘resilient food’ and ‘AGI risk’ are alternative uses of the same money in the 100% decline scenario. A supply and demand model might have been a more appropriate model for investigating this effect. Once again, I note that the supply and demand model alone would have been an interesting and publishable piece of work in its own right.

      Overall, I think the paper would have benefitted from more attention being paid to the underlying theory of cost-effectiveness motivating the investigation. Decisions made in places seem to have multiplied uncertainty which could have been resolved with a more consistent approach to analysis. As I highlighted earlier, the issues only stem from the incredible ambition of the paper and the authors should be commended for managing to find a route to connect two separate microsimulations, an analysis of funding at the margin and a supply-and-demand model. Nevertheless, the combination of these three approaches weakens the ability to draw strong conclusions from each of these approaches individually.

      Methods

      With respect to methods, the authors use a Monte Carlo simulation with distributions drawn from a survey of field experts. The use of a Monte Carlo technique here is an appropriate choice given the significant level of uncertainty over parameters. The model appears appropriately described in the paper, and functions well (I have only checked the models in Guesstimate, as I could not make the secondary models in Analytica function). A particular highlight of the paper is the figures clearly laying out the logical interrelationship of elements of the model, which made it significantly easier to follow the flow of the argument. I note the authors use ‘probability more effective than’ as a key result, which I think is a natural unit when working in Guesstimate. This is entirely appropriate, but a known weakness of the approach is that it can bias in favour of poor interventions with high uncertainty. The authors could also have presented a SUCRA analysis which does not have this issue, but they may have considered and rejected this approach as unnecessary given the entirely one-sided nature of the results which a SUCRA would not have reversed.

      The presentation of the sensitivity analysis as ‘number of parameters needed to flip’ is nonstandard, but a clever way to intuitively express the level of confidence the authors have in their conclusions. Although clever, I am uncertain if the approach is appropriately implemented; the authors limit themselves to the 95% CI for their definition of an ‘unfavourable’ parameter, and I think this approach hides massive structural uncertainty with the model. For example, in Table 5 the authors suggest their results would only change if the probability of nuclear war per year was 4.8x10^-5 (plus some other variables changing) rather than their estimated of 7x10^-3 (incidentally, I think the values for S model and E model are switched in Table 5 – the value for pr(nuclear war) in the table’s S model column corresponds to the probability given in the E model). But it is significantly overconfident to say that risk of nuclear war per year could not possibly be below 4.8x10^-5, so I think the authors overstate their certainty when they say “reverting [reversing?] the conclusion required simultaneously changing the 3-5 most important parameters to the pessimistic ends”; in fact it merely requires that the authors have not correctly identified the ‘pessimistic end’ of any one of the five parameters, which seems likely given the limitations in their data which I will discuss momentarily. I personally would have found one- and two-dimensional threshold analysis a more intuitive way to present the results, but I think the authors have a reasonable argument for their approach. As described earlier, I have some concerns that an appropriate amount of structural sensitivity analysis was undertaken, but the presentation of uncertainty analysis is appropriate in its own terms (if somewhat nonstandard).

      Overall, I have no major concerns about the theory or application of the modelling approach. However, I have a number of concerns with the use of the survey instrument:

      First, the authors could have done more to explain the level of uncertainty their survey instrument contains. They received eight responses, which is already a very low number of responses for a quantitative survey. In addition, two of the eight responses were from authors of the paper. The authors discuss ‘response bias’ and ‘demand characteristic bias’ which would not typically be applied to data generated by an approximately autoethnographic process – it is obvious that the authors of a survey instrument know what purpose the instrument is to be used for, and have incentives to make the survey generate novel and interesting findings. It might have been a good sensitivity analysis to exclude responses from the authors and other researchers associated with ALLFED since there is a clear conflict of interest that could bias results here.

      Second, issues with survey data collection are compounded by the fact that some estimates which are given in the S Model are actually not elicited with the survey technique – they are instead cited to Denkenberger & Pearce (2016) and Denkenberger & Pearce (2017). This is described appropriately in the text, but not clearly marked in the summary Table 1 where I would expect to see it, and the limitation this presents is not described clearly. To be explicit, the limitation is that at least two key parameters in the model are based on a sample of the opinions of two of the eight survey respondents, rather than the full set of eight respondents. As an aside on presentation, the decision to present lower and upper credible intervals in Table 1 rather than median is non-standard for an economics paper, although perhaps this is a discipline-specific convention I am unaware of. Regardless, I’m not sure it is appropriate to present the lowest of eight survey responses as the ‘5th percentile’, as it is actually the 13th percentile and giving 95% confidence intervals implies a level of accuracy the survey instrument cannot reach. While I appreciate the 13th percentile of 8 responses will be the same as the 5th centile of 100 samples drawn from those responses, this is not going to be clear to a casual reader of the paper. ‘Median (range)’ might be a better presentation of the survey data in this table, with better clarity on where each estimate comes from. Alternatively, the authors could look at fitting a lognormal distribution to the survey results using e.g. method of moments, and then resample from the new distribution to create a genuine 95% CI. Regardless, given the low number of responses, it might have been appropriate simply to present all eight estimates for each relevant parameter in a table.

      Third, the authors could have done more to make it clear that the ‘Expert Model’ was effectively just another survey with an n of 1. Professor Sandburg, who populated the Expert Model, is also an author on this paper and so it is unclear what if any validation of the Expert Model could reasonably have been undertaken – the E model is therefore likely to suffer from the same drawbacks as the S model. It is also unclear if Professor Sandburg knew the results of the S Model before parameterising his E Model – although this seems highly likely given that 25% of the survey’s respondents were Professor Sandburg’s co-authors. This could be a major source of bias, since presumably the authors would prefer the two models to agree and the expert parameterising the model is a co-author. I also think more work is needed to be done establishing the Expert’s credentials in the field of agricultural R&D (necessary for at least some of the parameter estimates); although I happily accept Professor Sandburg is a world expert on existential risk and a clear choice to act as the parameterising ‘expert’ for most parameters, I think there may have been alternative choices (such as agricultural economists) who may have been more obviously suited to giving some estimates. There is no methodological reason why one expert had to be selected to populate the whole table, and no defence given in the text for why one expert was selected - the paper is highly multidisciplinary and it would be surprising if any one individual had expert knowledge of every relevant element. Overall, this limitation makes me extremely hesitant to accept the authors’ argument that the fact that S model and E model are both robust means the conclusion is equally robust

      Generally, I am sympathetic to the authors’ claim that there is unavoidable uncertainty in the investigation of the far future. However, the survey is a very major source of avoidable uncertainty, and it is not a reasonable decision of the authors to present the uncertainty due to their application of survey methods as the same kind of thing as uncertainty about the future potential of humanity. There are a number of steps the authors could have taken to improve the validity and reliability of their survey results, some of which would not even have required rerunning the survey (to be clear however, I think there is a good case for rerunning the survey to ensure a broader panel of responses). With the exception of the survey, however, methods were generally appropriate and valid.

      Parameter estimates

      Notwithstanding my concerns about the use of the survey instrument, I have some object level concerns with specific parameters described in the model.

      • The discount rate for both costs and benefits appears to be zero, which is very nonstandard in economic evaluation. Although the authors make reference to “long termism, the view that the future should have a near zero discount rate”, the reference for this position leads to a claim that a zero rate of pure time preference is common, and a footnote observing that “the consensus against discounting future well-being is not universal”. To be clear, pure time preference is only one component of a well-constructed discount rate and therefore a discount rate should still be applied for costs, and probably for future benefits too. Even notwithstanding that I think this is an error of understanding, it is a limitation of the paper that discount rates were not explored, given they seem very likely to have a major impact on conclusions.
      • A second concern I have relating to parameterisation is the conceptual model leading to the authors’ proposed costing for the intervention. The authors explain their conceptual model linking nuclear war risk to agricultural decline commendably clearly, and this expands on the already strong argument in Denkenberger & Pearce (2016). However, I am less clear on their conceptual model linking approximately $86m of research to the widescale post-nuclear deployment of resilient foods. The assumption seems to be (and I stress this is my assumption based on Denkenberger & Pearce (2016) – it would help if the authors could make it explicit) that $86m purchases the ‘invention’ of the resilient food, and once the food is ‘invented’ then it can be deployed when needed with only a little bit of ongoing training (covered by the $86m). This seems to me to be an optimistic assumption; there seems to be no cost associated with disseminating the knowledge, or any raw materials necessary to culture the resilient food. Moreover, the model seems to structurally assume that distribution chains survive the nuclear exchange with 100% certainty (or that the materials are disseminated to every household which would increase costs), and that an existing resilient food pipeline exists at the moment of nuclear exchange which can smoothly take over from the non-resilient food pipeline.

      I have extremely serious reservations about these points. I think it is fair to say that an economics paper which projected benefits as far into the future as the authors do here without an exploration of discount rates would be automatically rejected by most editors, and it is not clear why the standard should be so different for existential risk analysis. A cost of $86m to mitigate approximately 40% of the impact of a full-scale nuclear war between the US and a peer country seems prima facie absurd, and the level of exploration of such an important parameter is simply not in line with best practice in a cost-effectiveness analysis (especially since this is the parameter on which we might expect the authors to be least expert). I wouldn’t want my reservations about these two points to detract from the very good and careful scholarship elsewhere in the paper, but neither do I want to give the impression that these are just minor technical details – these issues could potentially reverse the authors’ conclusions, and should have been substantially defended in the text.

      Conclusions

      Overall, this is a novel and insightful paper which is unfortunately burdened with some fairly serious conceptual issues. The authors should be commended for their clear-sighted contextualisation of resilient foods as an issue for discussion in existential risk, and for the scope of their ambition in modelling. Academia would be in a significantly better place if more authors tried to answer far-reaching questions with robust approaches, rather than making incremental contributions to unimportant topics.

      Where the issues of the paper lie are structural weaknesses with the cost-effectiveness philosophy deployed, methodological weaknesses with the survey instrument and two potentially conclusion-reversing issues with parameterisation which should have been given substantially more discussion in the text. I am not convinced that the elements of the paper which are robust are sufficiently robust to overcome these weaknesses – my view is that it would be premature to reallocate funding from AI Risk reduction to resilient food on the basis of this paper alone. The most serious conceptual issue which I think needs to be resolved before this can happen is to demonstrate that ‘do nothing’ would be less cost-effective than investing $86m in resilient foods, given that the ‘do nothing’ approach would potentially include strong market dynamics leaning towards resilient foods. I agree with the authors that an agent-based model might be appropriate for this, although a conventional supply-and-demand model might be simpler.

      I really hope the authors are interested in publishing follow-on work, looking at elements which I have highlighted in this review as being potentially misaligned to the paper that was actually published but which are nevertheless potentially important contributions to knowledge. In particular, the AI subunit is novel and important enough for its own publication.


      Evaluator details

      1. Name: Alex Bates
      2. How long have you been in this field? In the field of cost-effectiveness analysis, 10 years. I wouldn’t consider myself to be in the field of x-risk
      3. How many proposals and papers have you evaluated? I’ve lost count, but probably mid double figures - perhaps 50?
    1. So as a schematic one might think of different Christmases overlaying each other:An “Franco-German-European Christmas”, freighted with cultural and historical weight. Let us call that the“politico-sentimental Christmas”. And there is an “Anglo-American Christmas” fashioned by 19th-century bourgeois culture and 20th-century mass commercialism and mass production - “organized Fordist Christmas”, which now extends through its supply chains around the world And, by the late 20th century, we have “global Christmas”. The majority of people celebrating Christmas today may not even be in the North Atlantic world, its original cradle. Christmas is now a global commercial event.

      Hm. Sure a political-religious Christmas was in there somewhere? Still useful

    1. Author Response

      Reviewer #2 (Public Review):

      Susswein et al. analyze a fine-scale, novel data stream of human mobility, openly available from Safegraph, based on the usage of mobile apps with GPS and sampled from over 45 million smartphone devices. They define a metric $\sigma_{it}$, properly normalized, that quantifies the propensity for visits to indoor locations relative to outdoor locations in a given county $i$ at week $t$. For each pair of counties $i$ and $j$, they compute the Pearson correlation coefficient $\rho_{ij}$ between the corresponding $\sigma$ metrics. This generates a correlation matrix that can be interpreted as the adjacency matrix of a network. They then perform community detection on this network/matrix, effectively clustering together time series that are correlated. This identifies three main clusters of counties, characterized geographically as either in the north of the country, in the south of the country, and possibly in tourism active areas. They then show, via a simple model, how including over-simplified models of seasonality may affect infectious disease models.

      This work is very interesting for the infectious disease modeling community, as it addresses a complex problem introducing a new data stream.

      This work builds on several strengths, among which:

      It is the first analysis of the Safegraph dataset to capture seasonality in indoor behavior.

      It provides a simple metric to quantify indoor activity, that thanks to the dataset can be computed with a high level of spatial detail.

      It aims at characterizing clusters of counties with a similar pattern of indoor activity.

      It aims at quantifying the impact of neglecting finer-scale patterns of seasonality, for example considering seasonality to be homogeneous at the US level.

      We thank the reviewer for the positive review of our work.

      At the same time, it presents several weaknesses that should be addressed to improve the methodology, its results, and the implication:

      There is no quantitative comparison of the newly introduced metric for indoor activity with other proxies of seasonality (e.g. temperature or relative humidity). The (dis)similarity with other proxies may help in assessing the importance of this metric, showing why it can not be exchanged with other data sources (like temperature data) that are widely available and are not affected by sampling issues (more on that later).

      We have now added supplementary figures (Figure S3) to illustrate how indoor activity seasonality compares with temperature and humidity. We have also added text to the Results and the Discussion to discuss this point.

      A major flow of the analysis is to perform community detection on a network defined by the correlation between time series with an algorithm that is based on modularity optimization. As explained in Macmahon et al.[1], all modularity optimization methods rely on null assumptions that in the case of correlation between time series are violated. Therefore, there is a very strong potential bias in their results that is not accounted for. Possible solutions could be to proceed via the methodology presented in [1] or via a different type of algorithm (e.g. Infomap [2]). In both cases, as the network is thresholded (considering only a correlation larger than 0.9), a more quantitative assessment of the impact of the threshold value should be included.

      References

      [1] Mel MacMahon and Diego Garlaschelli Phys. Rev. X 5, 021006 (2015).

      [2] Martin Rosvall and Carl T. Bergstrom PNAS 105, 1118 (2008).

      We thank the reviewer for making this excellent point. We have now added Supplementary Figures S13 and S14. In Figure S13, we demonstrate the robustness of our clustering results with different correlation thresholds. (We have also corrected a typo in our original Methods section which mistakenly stated our correlation threshold as 0.9 rather than the 90th percentile which is what we used.) In Figure S14, we show the clustering results using a different clustering algorithm. In an effort to test a non-network-based clustering approach, we use a hierarchical clustering approach and find a consistent partition of the US to our main results.

      It is not clear what is the added value of the data on indoor activity, as no fitting to real data is performed. Although this may be considered beyond the scope of this paper, I think it would be crucial to quantify how much a data-informed model would better describe real epidemic data (for example in the case of COVID-19). For now, only the impact of neglecting heterogeneity in indoor activity is shown, comparing a model with region-average parameters vs a model with county-level average parameters. Given that the dataset comes with potential bias in sampling (more on this later) it would be good to assess its goodness in predicting real epidemic spread. When showing results from different models, no visible errors are shown on the plot. How have the errors been estimated?

      We appreciate this point by the reviewer, and agree that future work will have to consider how indoor activity seasonality affects our ability to capture observed transmission trends. However, such work would additionally need careful characterization of other seasonal factors hypothesized to drive transmission (including environmental and other behavioral factors), and is beyond the scope of our work. Instead, in Figure 4 we aim to (a) provide the infectious disease modeling community with empirically-inferred parameters for a simple sinusoidal model which is commonly used in infectious disease models to capture transmission seasonality; and (b) demonstrate the implications of ignoring geographic heterogeneity in transmission seasonality in theoretical models of disease dynamics, which are commonly used for scenario analysis and model-based intervention design. As we demonstrate, transmission seasonality described by such sinusoidal models, even when they are empirically characterized as in our case, can lead to meaningfully different epidemic dynamics when transmission seasonality varies from the assumptions.

      Additionally, there is no uncertainty included in Figure 4B because transmission seasonality is either based on empirical data point per time step, or on the fitted sinusoidal model (where the estimated parameters have negligible standard errors).

      The dataset is presented as representative of the US population. However, this has not been assessed over time. As adherence to social distancing is influenced by several socio-economic determinants the lack of representativity in certain strata of the population at a given time may introduce an important bias in the dataset. Although this is an inherent limitation of the dataset, it should be discussed in the paper more thoroughly.

      We agree with the reviewer that this is a limitation. However, we do not have any way of assessing demographic representation in the dataset over time. We have instead included an additional sentence into the Discussion section acknowledging this point.

      In conclusion, I think that the methodology should be revised to account for the fact that the analysis is performed on a correlation matrix. Capturing seasonal patterns of indoor activity can help in tackling the crucial problem of seasonality in human behavior. This could help in identifying effective strategies of disease containment able to curb disease spread at a lower societal cost than fully-fledged lockdowns.

      We thank the reviewer again for their helpful suggestions.

    1. the atman as i 00:20:34 said it's the witness the agent the enjoyer most importantly it's distinct from our body and mind it's their uh it's their owner and it's a permanent continuous thing unlike our bodies and minds which 00:20:48 are changing from moment to moment so they've got this kind of momentary impermanence but also as you may know they each come to an end we die um but the idea is that the self just 00:20:59 persists and goes on and on um and most importantly most most importantly when we identify the atman we're identifying what you are your essence or your core and so we might 00:21:13 think i change a lot my thoughts change my political preferences change my food preferences change my friends change but i remain the same as a self

      !- explanation of : Atman -the thing that remains the same while everything else changes

    1. Author Response

      Reviewer #1 (Public Review):

      In the current study, the authors reanalyze a prior dataset testing effects of D2 antagonism on choices in a delay discounting task. While the prior report using standard analysis, showed no effects, the current study used a DDM to examine more carefully possible effects on different subcomponents of the decision process. This approach revealed contrasting effects of D2 blockade on the effect of reward size differences and bias. Effects were uncorrelated, suggesting separate mechanisms perhaps. The authors speculate that these opposing effects explain the variability in effects across studies, since they mean that effects would depend on which of these factors is more important in a particular design. Overall the study is novel and well-executed, and the explanation offers interesting insight into neural processes.

      We thank the reviewer for judging our study as interesting and well-executed.

      Reviewer #2 (Public Review):

      The authors aim to test the hypothesis that dopamine mediates the evaluation of temporal costs in intertemporal choice in humans, with a specific goal of synthesizing the competing accounts and previous results regarding whether dopamine increases or decreases evaluation of delays in comparing differently delayed future rewards. To do this, they computationally dissect the impact of the drug amisulpride, a D2R antagonist, using a variant of a sequential sampling model, the drift-diffusion model (DDM), that is well established in decision-making literature as a cognitive process model of choice. This model allows the dissociation of starting bias from the rate at which decision evidence is integrated ('drift'), which the authors map to different accounts of the role of dopamine: the temporal proximity of an outcome is proposed to impact bias, while the cost of a delay to impact the drift rate of evidence evaluation/accumulation. Consistent with previous results, and perhaps integrating conflicting findings, the authors find that d2R blockade impacts both bias and drift rate in a cohort of 50 participants, demonstrating dopaminergic action at this receptor is implicated in dissociable components of intertemporal choice, with D2R block reducing the bias towards sooner, more temporally proximate rewards as well as enhancing the contrast between reward magnitudes irrespective of delay, effectively diminishing the effect of delay in the drug condition. These effects are consistent across a small subset of alternative models, confirming the multiple cognitive mechanisms through which D2R block impacts intertemporal choice is a robust feature of decisions on this task.

      Overall, this study is a detailed dissection of the specific effects of amisulpride on a type of future-oriented, hypothetical intertemporal choice, and provides consistent evidence integrating conflicting accounts that implicate dopaminergic signaling on evaluation of the cognitive costs, such as a delay, on choice. However the specificity of the empirical intervention and the task design limits the interpretation of the broader dopaminergic mechanisms at play in intertemporal choice, especially given the complexity of receptor specificity of this drug, dopamine precursor availability and individual differences and the specifics of the intertemporal choice in this task. As it stands, the results contribute an interesting, synthesized account of how D2R manipulation can impact evaluation of delays in multiple ways, that will likely be useful for motivating future studies and more detailed computational assessments of the cognitive process-level components of intertemporal choice more generally.

      We thank the reviewer for the positive overall evaluation of our study. We revised the manuscript according to the reviewer’s comments, addressing also the receptor specificity of amisulpride and the specifics of the administered intertemporal choice task, which further improved the quality of the manuscript.

      The focus of this study is important, and delineating the role of DA in intertemporal choice is of high relevance given DA disfunction is prevalent in many psychiatric disorders and a key target of pharmacological treatment. While the hypotheses of the current study are framed with respect to "costs", the task used by the authors reduces these to evaluation of a hypothetical delay, one which the participants do not necessarily experience in the context of the task. In some respects this is reasonable, given the prevalence of this task paradigm in testing temporal aspects of choice in humans in an economic sense. However, humans are also notoriously subject to framing effects and the impact of instructions in cognitive tasks like these, which can limit the generality of the conclusions, and in particular the specific ways in which a delay can be interpreted as costly (for eg cost as loss of potential earnings, cost as effortful waiting, cost as computational/simulation cost in future evaluation). Given the hypothesis recruits the idea of cost in assessing the role of dopamine, testing for generality in the effects of amisulpride in related but differently framed tasks seems critical for making this link in a general sense, and in connecting it to the previous studies in the literature the authors point to as demonstrating conflicting effects.

      We agree that it is important to discuss whether our findings for delay costs can be generalized to other costs types as well, such as risk, social costs, effort, or opportunity costs. Based on a recent literature review (Soutschek, Jetter, & Tobler, 2022), we speculate that dopamine may moderate proximity effects also for risk and social costs but not for effortful rewards, though we emphasize that these hypotheses still require more direct empirical evidence. We also discuss the issue that delays can be perceived as costly in different ways. While in some tasks participants actually experience the waiting time until reward delivery, such that delayed rewards are associated with opportunity costs, in our current task paradigm delayed rewards were virtually free of opportunity costs as participants could engage in other reward-related behaviors during the waiting time. Previous studies suggest that lower tonic dopamine levels reduce the sensitivity to opportunity costs (Niv et al., 2007), which seems in line with our finding that amisulpride decreases the influence of delays on the starting bias parameter. Nevertheless, we emphasize that further evidence is needed to decide whether dopamine shows similar effects for experienced and non-experienced waiting costs. In the revised manuscript, we discuss the cost specificity of our findings on p.22:

      “An important question refers to whether our findings for delay costs can be generalized to other types of costs as well, including risk, social costs (i.e., inequity), effort, and opportunity costs. In a recent review, we proposed that dopamine might also moderate proximity effects for reward options differing in risk and social costs, whereas the existing literature provides no evidence for a proximity advantage for effort-free over effortful rewards (Soutschek et al., 2022). However, these hypotheses need to be tested more explicitly by future investigations. Dopamine has also been ascribed a role for moderating opportunity costs, with lower tonic dopamine reducing the sensitivity to opportunity costs (Niv et al., 2007). While this appears consistent with our finding that amisulpride (under the assumption of postsynaptic effects) reduced the impact of delay on the starting bias, it is important to note that choosing delayed rewards did not involve any opportunity costs in our paradigm, given that participants could pursue other rewards during the waiting time. Thus, it needs to be clarified whether our findings for delayed rewards without experienced waiting time can be generalized to choice situations involving experienced opportunity costs.”

      Further, while the study aims to test the actions of dopamine broadly, the empirical manipulation is limited to the action of amisulpride, a D2R anatgonist. There is little to no discussion of, or control for, the relationship between dopaminergic action at D2 receptors (the site of amisulpride effects) and wider mechanisms of dopaminergic action at other sites eg D1-like receptors, and the interplay between activation at these two receptor types alongside baseline levels of dopamine concentration. This is necessary for a comprehensive account of dopamine effects on intertemporal choice as the authors aim to test, as opposed to a specific test of the role of the D2 receptor, which is what the study achieves. On a related note, in some preparations at least, amisulpride also acts at some of the 5-HT receptors, raising the possibility of a non-dopaminergic mechanism by which this drug might impact intertemporal decisions. This possibility, while it would not be expected to act without dopaminergic effects as well, is consistent with established effects of serotonin on waiting behaviors and patience. Granted, the limits of pharmacology in humans does not necessarily mean this can be controlled for, it should be kept in mind with a systemic manipulation such as this.

      We agree with the reviewer that it is important to distinguish between the contributions of D1 and D2 receptors to decision making, given that these receptor families are hypothesized to have dissociable functional roles. We therefore re-analyzed also data on the impact of a D1 agonist on intertemporal decision making (previous findings for this data set were published in Soutschek et al., 2020, Biological Psychiatry). This analysis provided no evidence for significant effects of D1R stimulation on parameters from a drift diffusion model. This suggests that D2R, rather than D1R, activation mediates the impact of proximity on intertemporal choices.

      In the revised manuscript, we report the findings for the D1 agonist study on p.16:

      “To assess the receptor specificity of our findings, we conducted the same analyses on the data from a study (published previously in Soutschek et al. (2020)) testing the impact of three doses of a D1 agonist (6 mg, 15 mg, 30 mg) relative to placebo on intertemporal choices (between-subject design). In the intertemporal choice task used in this experiment, the SS reward was always immediately available (delay = 0), contrary to the task in the D2 experiment where the delay of the SS reward varied from 0-30 days. Again, the data in the D1 experiment were best explained by DDM-1 (DICDDM-1 = 19,657) compared with all other DDMs (DICDDM-2 = 20,934; DICDDM-3 = 21,710; DICDDM-5 = 21,982; DICDDM-6 = 19,660; note that DDM-4 was identical with DDM-1 for the D1 agonist study because the delay of the SS reward was 0). Neither the best-fitting nor any other model yielded significant drug effects on any drift diffusion parameter (see Table 4 for the best-fitting model). Also model-free analyses conducted in the same way as for the D2 antagonist study revealed no significant drug effects (all HDI95% included zero). There was thus no evidence for any influence of D1R stimulation on intertemporal decisions.”

      We discuss the specificity of D2 receptors for moderating the proximity bias on p.17: “This finding represents first evidence for the hypothesis that tonic dopamine moderates the impact of proximity (e.g., more concrete versus more abstract rewards) on cost-benefit decision making (Soutschek et al., 2022; Westbrook & Frank, 2018). Pharmacological manipulation of D1R activation, in contrast, showed no significant effects on the decision process. This provides evidence for the receptor specificity of dopamine’s role in intertemporal decision making (though as caveat it is worth keeping the differences between the tasks administered in the D1 and the D2 studies in mind).”

      We also agree that amisulpride acts also on 5-HT7 receptors, such that it remains unclear whether also such effects contribute to the observed result pattern. We discuss this limitation in the revised manuscript on p.21:

      “Lastly, while the actions of amisulpride on D2/D3 receptors are relatively selective, it also affects serotonergic 5-HT7 receptors (Abbas et al., 2009). Because serotonin was related to impulsive behavior (Mori, Tsutsui-Kimura, Mimura, & Tanaka, 2018), it is worth keeping in mind that amisulpride effects on serotonergic, in addition to dopaminergic, activity might contribute to the observed result pattern.”

      Overall the modeling methods are robust and appropriate for the specific test of decision impacts of D2R blockade, and include several prima facie variable alternative models for comparison. Some caution is warranted, since there are not many trials per subject, and some trials are discarded as well as outliers, which raises the question of power. Given the models are fit hierarchically, which gives both group-level and individual-level parameter estimates, the elements are there to probe more deeply into individual differences, and to test how reliably this approach can dissociate the dual effects of bias and drift rate at the individual level, and perhaps correlate it with other informative subject measures of either dopamine activity/capacity or other dopamine-dependent behaviors. Alternative DDMs might also capture some of this individual variation, with meaningful differences potentially in model comparison at the individual level. It should be noted that the scope of these models do not exhaust the ways in which proximity (here, temporal) of rewards and contrast between choice options might be incorporated into a cognitive process model account of choice; all alternatives here rest on the same implicit 2-alternative forced choice assumption of the DDM, and the assumptions of this model are not here tested against other accounts of choice, for example the linear ballistic accumulator (LBA) and its derivatives. Further, the concept of proximity as a global feature of a trial (on average, how soon are these options overall?) is never tested on my read of the alternative models.

      We thank the reviewer for these interesting suggestions. First, to explore whether measures of dopaminerigc activity correlate with individual differences in drug effects on DDM parameters, we now report correlations between DDM parameters and performance in the digit span backward task as proxy for dopamine synthesis capacity (Cools et al., 2008). None of these correlation analyses showed significant results. In the revised manuscript, we report these analyses on p.13:

      “However, we observed no evidence that individual random coefficients for the drug effects on the drift rate or on the starting bias correlated with body weight, all r < 0.22, all p > 0.10. There were also no significant correlations between DDM parameters and performance in the digit span backward task as proxy for baseline dopamine synthesis capacity (Cools, Gibbs, Miyakawa, Jagust, & D'Esposito, 2008), all r < 0.17, all p > 0.22. There was thus no evidence that pharmacological effects on intertemporal choices depended on body weight as proxy of effective dose or working memory performance as proxy for baseline dopaminergic activity.”

      Regarding model comparisons on the individual level, we note that the hierarchical Bayesian modelling approach allows (to the best of our knowledge) computing indices of model fit like DIC only on the group, not the individual level (while accounting for individual differences). However, we agree with the reviewer that theoretically different models might work best in different individuals (depending, for example, on the individual sensitivity to proximity). While such fine-grained model comparisons on the individual level are beyond the scope of the current study (and might not yield robust results given the limited number of trials for each participant), we now discuss this limitation in the revised manuscript (p.17-18):

      “We note that the hierarchical modelling approach allowed us to compare models on the group level only, such that in some individuals behavior might better be explained by a different model than DDM-1. Such model comparisons on the individual level, however, were beyond the scope of the current study and might not yield robust results given the limited number of trials per individual.”

      Likewise, linear ballistic accumulator (LBA) models represent a further class of process models with different assumptions on the mechanisms underlying the choice process than DDMs. In LBAs, evidence is accumulated separately for each choice alternative, whereas DDMs assume only one accumulation process which integrates attributes from two choice options, limiting the use of DDMs to two-alternative forced-choice scenarios. Nevertheless, proximity effects might be incorporated also in LBA models via modulating the starting point of the option-specific accumulators as a function of proximity. To the best of our knowledge, there is no built-in function in JAGS that allows estimating LBA models in a hierarchical Bayesian fashion (in contrast to, e.g., STAN), such that in the context of the current study it is difficult to directly compare our DDM-based approach with LBA models. It is importance to emphasize, however, that similar to other studies we do not make any claims about whether the choice process per se is best explained by DDMs or LBA models; instead, we focus on how rewards and delay costs affect different components of the decision process within a class of decision models. Nevertheless, we discuss such alternative modelling approaches in the revised manuscript on p.18:

      “We also emphasize that alternative process models like the linear ballistic accumulator (LBA) model make different assumptions than DDMs, for example by positing the existence of separate option-specific accumulators rather than only one as assumed by DDMs. However, proximity effects as investigated in the current study might be incorporated in LBA models as well by varying the starting points of the accumulators as function of proximity.”

      Lastly, we thank the reviewer for the interesting suggestion to assess whether the starting bias parameter is affected by the overall proximity of offers (sum of delays) instead of by the difference in proximity between the options. We ran a further DDM to test this hypothesis, but this model explained the data worse (DIC = 9,492) than the original DDM (DIC = 9,478). Nevertheless, also the overall proximity DDM yielded a significant amisulpride effect on the impact of reward magnitude on the drift rate, HDImean = 0.83, HDI95% = [0.04; 1.75], underlining the robustness of this effect. In the revised manuscript, we report this analysis on p.12:

      “In a further model (DDM-4), we explored whether the starting bias is affected by the overall proximity of the options (sum of delays, Delaysum) rather than the difference in proximity (Delaydiff; see Table 3 for an overview over the parameters included in the various models). Importantly, our original DDM-1 (DIC = 9,478) explained the data better than DDM-2 (DIC = 9,481), DDM-3 (DIC = 10,224), or DDM-4 (DIC = 9,492). Nevertheless, amisulpride moderated the impact of Magnitudediff on the drift rate also in DDM-2, HDImean = 0.86, HDI95% = [0.18; 1.64], and DDM-4, HDImean = 0.83, HDI95% = [0.04; 1.75], and amisulpride also lowered the impact of Delaydiff on the starting bias in DDM-3, HDImean = -0.02, HDI95% = [-0.04; -0.001]. Thus, the dopaminergic effects on these subcomponents of the choice process are robust to the exact specification of the DDM.”

      Reviewer #3 (Public Review):

      Soutschek and Tobler provide an intriguing re-analysis of inter-temporal choice data on amisulpride versus placebo which provides evidence for an as-yet untested hypothesis that dopamine interacts with proximity to bias choices.

      The modeling methods are sound with a robust and reasonably exhaustive set of models for comparison, with good posterior predictive checks at the single subject level, and decent evidence of parameter recoverability. Importantly, they show that while there is no main effect of drug on the proportion of larger, later (LL) versus smaller, sooner (SS) choices, this obscures conflicting-directional effects on drift rate versus starting point bias which are under-the-hood, yet anticipated by the hypothesis of interest.

      We thank the reviewer for judging our findings as intriguing and the modelling approach as robust and convincing.

      While I have no major concerns about methodology, I think the Authors should consider an alternative interpretation - albeit an interpretation which would actually support the hypothesis in question more directly than their current interpretation. Namely, the Authors should re-consider the possibility that amisulpride's effects are mediated primarily by acting at pre-synaptic receptors. If the D2R antagonist were to act pre-synaptically, it would drive more versus less post-synaptic dopamine signaling.

      There are multiple reason for this inference. First, the Authors observe that the drug increases sensitivity to differences in the relative offer amounts (in terms of effects on the drift rate). With respect to the canonical model of dopamine signaling in the direct versus indirect pathway, greater post-synaptic signaling should amplify sensitivity to reward benefits - which is what the Authors observe.

      Second, the Authors also observe an effect on the starting bias which may also be consistent with an increase in post-synaptic dopamine signaling. Note that according to the Westbrook & Frank hypothesis, a proximity bias in delay discounting should favor the SS over the LL reward, yet the Authors primarily observe a starting bias in the direction of the LL reward. This contradiction can be resolved with the ancillary assumption that, independent of any choice attribute, participants are on average predisposed to select the LL option. Indeed, the Authors observe a reliable non-zero intercept in their logistic regression model indicating that participants selected the LL more often, on average. As such, the estimated starting point may reflect a combination of a heightened predisposition to select the LL option, opposed by a proximity bias towards the sooner option. Perhaps the estimated DDM starting point is positive because the predisposition to select the LL option has a larger effect on choices than the proximity bias towards sooner rewards does in this data set. To the extent that amisulpride increases post-synaptic dopamine signaling (by antagonizing pre-synaptic D2Rs) it should amplify the proximity bias arising from the differences in delay, shifting the starting bias towards the SS option. Indeed, this is also what the Authors observe.

      Note that it remains unclear why an increase in post-synaptic dopamine signaling would amplify one kind of proximity bias (towards sooner over later rewards) without amplifying the other (towards a predisposition to select the LL option). Perhaps the cognitive / psychological nature of the sooner bias is more amenable to interacting with dopamine signaling than the latter. Or maybe proximity bias effects are most sensitive to dopamine signaling when they are smaller, and the LL predisposition bias is already at ceiling in the context of this task. These assumptions would help explain why a potential increase in post-synaptic dopamine signaling both amplified the proximity effect of delay when it was smallest (when the differences in delay were smaller), and also failed to amplify the predisposition to select the LL option (which may already be maxed out). More importantly, the assumption that there are opposing proximity biases would also help explain why there is a negative effect of delay magnitude on the estimated starting point on placebo. Namely - as the delay gets larger, the psychological proximity of sooner over later rewards grows, counteracting the proximity bias arising from choice predisposition / repetition.

      We thank the reviewer for suggesting this alternative interpretation of our data. We agree that the administered dose of 400 mg amisulpride can show both postsynaptic (reducing D2R activation) and presynaptic effects (enhancing D2R activation), which in many studies makes it difficult to decide whether the observed behavioral effects are caused by presynaptic or postsynaptic mechanisms.

      The reviewer suggests that the observed stronger influence of reward magnitudes on drift rates under amisulpride compared with placebo speaks in favor of presynaptic effects, because according to theoretical accounts higher dopamine levels should increase reward seeking (e.g., Frank & O’Reilly, 2006). On the other hand, Figure 2C suggests that amisulpride (compared with placebo) increased the preference only for relatively high, above-average rewards. If the difference between reward magnitudes was below average, amisulpride reduced rather than increased the preference for the larger reward. In our view, this is consistent with the hypothesis that D2R activation implements a cost control, with higher D2R activation increasing the attractiveness of costly rewards and lower D2R activation reducing it. In other words, under low dopamine levels individuals should decide for the costlier reward only if the magnitude of the costlier reward is sufficiently large compared with the lower, less costly reward. In fact, this is exactly what we find in our data according to Figure 2C. In our view, the amisulpride effect on drift rates is thus compatible with both presynaptic and postsynaptic mechanisms of action, depending on the underlying conceptual account of dopamine, as we now discuss in the revised manuscript.

      According to the reviewer, also the observed influence of amisulpride on the starting bias speaks in favor of increased rather than reduced dopamine levels. We agree with the reviewer that the result pattern for the starting bias is somewhat complex and seems to combine the effects of two different biases: a general tendency to choose LL over SS rewards (intercept of starting bias where the difference in delays is close to zero), and a shift towards the SS option under placebo if one options has a strong (temporal) proximity advantage over the other. Amisulpride shows opposite effects on the two different biases, as it shifts the intercept of the starting bias further away from the LL option but also reduces the proximity advantage of the SS over the LL reward for larger differences in delay. The reviewer writes that “To the extent that amisulpride increases post-synaptic dopamine signaling (by antagonizing pre-synaptic D2Rs) it should amplify the proximity bias arising from the differences in delay, shifting the starting bias towards the SS option. Indeed, this is also what the Authors observe.” In contrast to that statement, in our study amisulpride reduced rather than increased the starting bias arising from delay (as in Figure 2K the regression line is flatter under amisulpride compared with placebo, despite the differences regarding the intercept). We believe that the amisulpride effects on both the intercept and the delay-dependent slope can be explained via postsynaptic effects: First, the shift of the intercept of the starting bias (small differences in proximity) from the LL towards the SS option under amisulpride is consistent with the assumption that lower dopamine reduces the preference for larger reward (e.g., Beeler & Mourra, 2018; Salamone & Correa, 2012). Second, the finding that amisulpride weakens the proximity advantage of SS over LL rewards (delay-dependent slope) is consistent with the proximity account by Westbrook & Frank (2018) according to which lower tonic dopamine should reduce proximity effects. Thus, if we assume that the result pattern for the starting bias parameter is driven by dopaminergic effects on two separate decision biases (as suggested by the reviewer), we believe that both effects can better be explained by pharmacologically reduced rather than increased dopamine levels.

      In the revised manuscript, we extensively discuss the question as to whether the observed drug effects are caused by postsynaptic versus presynaptic effects. We clarify that the amisulpride effect on drift rates seems consistent with both presynaptic and postsynaptic effects (depending on the underlying conceptual account). We moreover discuss that the starting bias effects may reflect the interaction between two different bias types, and the drug effects on both bias types can more easily be reconciled with postsynaptic than presynaptic effects. On balance, we believe that the observed effects are more likely to reflect lower as compared to higher dopamine levels, but the extended discussion of this issue gives all readers the opportunity to weigh the arguments for and against these alternatives. If the reviewer should not agree with some aspects of our argumentation as outlined above, we would of course be happy to modify the discussion according to the reviewer’s advice.

      In the revised manuscript, we modified the discussion of presynaptic versus postsynaptic effects as follows (p.20-21):

      “While higher doses of amisulpride (as administered in the current study) antagonize post-synaptic D2Rs, lower doses (50-300 mg) were found to primarily block pre-synaptic dopamine receptors (Schoemaker et al., 1997), which may result in amplified phasic dopamine release and thus increased sensitivity to benefits (Frank & O'Reilly, 2006). At first glance, the stronger influence of differences in reward magnitude on drift rates under amisulpride compared with placebo might therefore speak in favor of presynaptic (higher dopamine levels) rather than postsynaptic mechanisms of action in the current study. On the other hand, one could argue that amisulpride reduced the preference for the LL reward if the gain from the costlier LL option compared with the SS option was small (as suggested by Figure 2C), which is consistent with the cost control hypothesis of dopamine (Beeler & Mourra, 2018). The impact of amisulpride on the drift rate thus appears ambiguous regarding the question of pre- versus postsynaptic effects. The result pattern for the starting bias parameter, in turn, suggests the presence of two distinct response biases, reflected by the intercept and the delay-dependent slope of the bias parameter (see Figure 2K), which are both under dopaminergic control but in opposite directions. First, participants seem to have a general bias towards the LL option in the current task (intercept), which is reduced under amisulpride compared with placebo, consistent with the assumption that dopamine strengthens the preference for larger rewards (Beeler & Mourra, 2018; Salamone & Correa, 2012; Schultz, 2015). Second, amisulpride reduced the proximity advantage of SS over LL rewards with increasing differences in delay, as predicted by the proximity account of tonic dopamine (Westbrook & Frank, 2018). On balance, the current results thus appear more likely under the assumption of postsynaptic rather than presynaptic effects. Unfortunately, the lack of a significant amisulpride effect on decision times (which should be reduced or increased as consequence of presynaptic or postsynaptic effects, respectively) sheds no additional light on the issue.”

      Regardless of the final interpretation, showing that pharmacological intervention into striatal dopamine signaling can simultaneously modify a starting point bias and drift rate (in opposite directions - thus having systematic effects on choice biases without altering the average proportion of LL choices) provides crucial first evidence for the hypothesis that dopamine and proximity interact to influence decision-making. These results thereby enrich our understanding of the neuromodulatory mechanisms influencing inter-temporal choice, and take an important step towards resolving prior contradictions in this literature. They also have implications for how striatal dopamine might impact decision-making in diverse domains of impulsivity beyond inter-temporal choice, ranging from cognitive neuroscience (e.g. in numerous cognitive control tasks) to psychiatry (treating diverse disorders of impulse control).

      We thank the reviewer for highlighting the importance of the current findings for understanding dopamine’s role in decision making.

  5. Dec 2022
    1. Reviewer #3 (Public Review):

      A problem in synthetic ecology is that one can't brute-force complex community design because combinatorics make it basically impossible to screen all possible communities from a bank of possible species. Therefore, we need a way to predict phenomena in complex communities from phenomena in simple communities. This paper aims to improve this predictive ability by comparing a few different simple models applied to a large dataset obtained with the use of the author's "kchip" microfluidics device. The main question they ask is whether the effect of two species on a focal species is predicted from the mean, the sum, or the max of the effect of each single "affecting" species on the focal species. They find that the max effect is often the best predictor, in the sense of minimizing the difference between predicted effect and measured effect. They also measure single-species trait data for their library of strains, including resource niche and antibiotic resistance, and then find that Pearson correlations between distance calculations generated from these metrics and the effect of added species are weak and unpredictive. This work is largely well-done, timely and likely to be of high interest to the field, as predicting ecosystem traits from species traits is a major research aim.

      My main criticism is that the main take-home from the paper (fig 3B)-that the strongest effect is the best predictor-is oversold. While it is true that, averaged over their six focal species, the "strongest effect" was the best overall predictor, when one looks at the species-specific data (S9), we see that it is not the best predictor for 1/3 of their focal species, and this fraction grows to 1/2 if one considers a difference in nRMSE of 0.01 to be negligible.

      The same criticism applies to the result from figure 2-that pairs of affecting species have more negative effects than single species. Considered across all focal species this is true (though minor in effect size, Fig 2A). But there is only a significant effect within two individual species. Again, this points to the effects being focal-species-specific, and perhaps not as generalizable as is currently being claimed.

      Another thing that points to a focal-species-specific response is Fig 2D, which shows the distributions of responses of each focal species to pairs. Two of these distributions are unimodal, one appears bimodal, and three appear tri-modal. This suggests to me that the focal species respond in categorically different ways to species addition.

      These differences occur even though the focal bacteria are all from the same family. This suggests to me that the generalizability may be even less when a more phylogenetically dispersed set of focal species are used.

      Considering these points together, I argue that the conclusion should be shifted from "strongest effect is the best" to "in 3 of our focal species, strongest effect was the best, but this was not universal, and with only 6 focal species, we can't know if it will always be the best across a set of focal species".

      My second main criticism is that it is hard to understand exactly how the trait data were used to predict effects. It seems like it was just pearson correlation coefficients between interspecies niche distances (or antibiotic distances) and the effect. I'm not very surprised these correlations were unpredictive, because the underlying measurements don't seem to be relevant to the environment tested. What if, rather than using niche data across 20 nutrients, only the growth data on glucose (the carbon source in the experiments) was used? I understand that in a field experiment, for example, one might not know what resources are available, and so measuring niche across 20 resources may be the best thing to do. Here though it seems imperative to test using the most relevant data.

      Additionally and relatedly, it would be valuable to show the scatterplots leading to the conclusion that trait data were uninformative. Pearson's r only works on an assumption of linearity. But there could be strong relationships between the trait data and effect that are monotonic but not linear, or even that are non-monotonic yet still strong (e.g. U-shaped). For the first case, I recommend switching to Spearman's rho over Pearson's r, because it only assumes monotonicity, not linearity. If there are observable relationships that are not monotonic, a different test should be used.

      In general, I think the analyses using the trait data were too simplistic to conclude that the trait data are not predictive.

    1. Author Response

      Reviewer #1 (Public Review):

      This paper shows that nuclear pore complex components are required for Kras/p53 driven liver tumors in zebrafish. The authors previously found that nonsense mutation in ahctf1 disrupted nuclear pore formation and caused cell death in highly proliferative cells in vivo. In the absence of this gene, there are multiple mitotic functions involving the nuclear pore that are defective, leading to p53 dependent cell death. Heterozygous fish are viable but have reduced kras/p53 liver tumor growth, and this is associated with multiple nuclear and mitotic defects that lead to cancer cell death/lack of growth. This therapeutic window suggests targetability of this pathway in cancer. I think the data are robust, rigorous, and clearly presented. I believe this in vivo work will encourage therapeutic targeting of NPCs in cancer.

      We are pleased that this reviewer believes that our data are robust, rigorous, and clearly presented and that our in vivo work will encourage therapeutic targeting of NPCs in cancer.

      Reviewer #2 (Public Review):

      Overall this is a very interesting and important paper that demonstrates a novel synthetic interaction between nucleoporin inhibition and oncogene-driven hyperproliferation. This work is especially significant because of the paucity of effective treatments for hepatocellular carcinoma (HCC). The authors' demonstration that the Nup inhibitor Selinexor decreases larval liver size in KRAS-overexpressing zebrafish but does not cause toxicity in wild-type animals lays the groundwork for exploiting this class of drugs in HCC treatment. This paper represents an elegant demonstration of the utility of zebrafish models in cancer studies. The relevance of this work to human cancer is supported by the authors' studies using TCGA data, wherein they demonstrate that decreased NUP expression is associated with increased survival in HCC.

      Other major strengths of the paper include beautiful pictures demonstrating that ahctf1+/- decreases the density and volume of nuclear pores in TO(kras) larvae and increases the rate of multipolar spindle formation, misaligned chromosomes, and anaphase bridges. The experiments are very well-controlled, including detailed analysis of the effects of ahctf1 heterozygosity and Selinexor on wild-type animals. The inclusion of distinct methods for disruption nucleoporins (ranbp2 heterozygosity and drug treatment) bolsters the authors' conclusion that this represents a viable drug target in HCC.

      My major concerns are as follows:

      1) The authors state that "the beneficial effect of ahctf1 heterozygosity to reduce tumour burden persists in the absence of functional Tp53, due to compensatory increases in the levels of tp63 and tp73". However, tp63 and tp73 appear similarly upregulated in ahctf1 heterozygotes regardless of tp53 status. The authors do not provide enough evidence that tp63 and tp73 are compensating for tp53 loss. An alternative possibility based on the data presented is that the effects of ahctf1+/- are independent of tp53 family members, and the effects on apoptosis go through a different pathway.

      We agree with this reviewer that we did not provide enough evidence that tp63 and tp73 are compensating for tp53 loss. Accordingly, we have addressed this issue comprehensively.

      2) The authors state in multiple locations that nucleoporin inhibition decreases tumor burden. In my opinion, this is not strictly correct. The TO(kras) model clearly results in HCC in adults, but it's a little unclear whether the larval liver overgrowth is truly HCC or not based on the original paper by Nguyen et al. (2012 Dis Model Mech).

      We agree with these comments and accordingly, we performed several new experiments in adult fish.

      Reviewer #3 (Public Review):

      The nuclear transport machinery is aberrantly regulated in many cancers in a context-dependent fashion, and mounting evidence with cultured cell and animal models indicates that reducing the activity or expression of certain nuclear transport proteins can selectively kill cancer cells while sparing nontransformed cells. Here the authors further explore this concept using a zebrafish model for hepatocellular carcinoma (HCC) induced by liver-specific transgenic expression of oncogenic krasG12V. The transgene causes greatly increased liver size by day 7 in larvae, associated with a gene expression profile that resembles early-stage human HCC. This study focuses on Ahctf1, a nuclear pore complex (NPC) protein known to be essential for postmitotic NPC assembly. Using the krasG12V background, the authors analyze animals that are heterozygous for a recessive mutation in the ahctf1 gene that leads to ~50% reduction in ahctf1 mRNA (and likely the encoded protein). The authors show that the ~4-fold increase in liver volume of krasG12V animals is reduced by ~1/3 in the ahctf1 heterozygous mutants. This is associated with increased apoptosis, decreased DNA replication, up-regulation of pro-apoptotic and cdk-inhibitor genes, and down-regulation of anti-apoptotic gene. These effects found to be substantially Tp53-dependent. Consistent with previous Ahctf1 depletion studies, hepatocytes of ahctf1 heterozygotes show decreased NPC density at the nuclear surface, elevated levels of aberrant mitoses and increased DNA damage/double stranded breaks. Finally, the authors show that combining the achtf1 heterozygous mutant with a heterozygous mutation in another NPC protein- RanBP2- or treating animals with a chemical inhibitor of exportin-1 (Selinexor) can further reduce liver volume. Overall they suggest that combinatorial targeting of the nuclear transport machinery can provide a therapeutic approach for targeting HCC.

      This is an interesting study that bolsters the notion that reduction in the levels of discrete nucleoporins (and/or inhibiting specific nuclear transport pathways) can result in cancer cell-selective killing. Moreover, the work extends previous studies involving cultured cell and mouse xenografts to a new cancer model (HCC) and nucleoporin (Ahctf1). Whereas the authors describe multiple aberrant cellular phenotypes associated with the dosage reduction in ahctf1, the exact causes for reduction in liver size in the krasG12V model remain unclear. Although it would be desirable to parse effects of Ahctf1 related to NPC number, aberrant mitoses, licensing of DNA replication and chromatin regulation, this is a tall order at present, given the limited understanding of Ahctf1. However, useful insight on these and related questions could be gained with further analysis of the system as outlined below.

      We are pleased this reviewer thinks this is an interesting study that bolsters the notion that reduction in the levels of discrete nucleoporins (and/or inhibiting specific nuclear transport pathways) can result in cancer cell-selective killing. This reviewer also suggests that useful insight on these and related questions could be gained with further analysis of the system as outlined below:

      1) In the krasG12V model, it would be helpful to distinguish the contribution of increased cell death vs decreased cell proliferation to the change in liver size seen with heterozygous ahctf1. Is this predominantly due to decreased proliferation?

      We think this question is difficult to address, because the relative contributions of the two processes may vary with time. Our data show definitively that by 7 dpf, the impact of ahctf1 heterozygous mutation has disrupted multiple cellular processes, leading to a 40% increase in the number of hepatocytes expressing Annexin 5 (dying cells), and a 40% decrease in the number of hepatocytes incorporating EdU over a 2 h incubation (fewer cells in S-phase). Both responses are likely to contribute to the reduction in liver volume observed in response to ahctf1 heterozygosity. It is worth stating that in our experiments, we captured snapshots of apoptosis and DNA replication in the livers of larvae at 7 days post-fertilisation after 5d of dox treatment/KrasG12V expression. To answer the Reviewer’s question properly, we would need to monitor the behaviour of individual cells over time. If such experiments were technically possible, we think that some cells that undergo growth arrest in response to dox treatment might ultimately succumb to apoptosis (unless dox treatment is withdrawn) while other cells might enter into a state of prolonged senescence. However, given the technical challenges, we did not attempt to test this in the current manuscript.

      2) It would be good to know whether the heterozygous ahctf1 state blunts the overall level of Ras activity in krasG12V animals.

      We have addressed this interesting question thoroughly in new Fig. 1g, h. To do this, we used a commercial RAS-RBD pulldown kit followed by western blot analysis to determine the levels of activated GTP-bound Kras protein. Our results demonstrate that the levels of GTP-bound Kras protein, expressed as a proportion of total Kras protein, do not change in response to ahctf1 heterozygosity. We conclude from these data that the potentially therapeutic value of reduced ahctf1 expression in a cancer setting is not caused by inhibiting Kras activity.

      3) Notwithstanding the analysis of Tp53 target genes presented in this study, it would be helpful to see detailed transcriptional profiling of hepatocytes in the krasG12V model with the heterozygous ahctf1 mutation, and to assess the effects of Selinexor. GSEA type analysis offers a way to start untangling the effects of these pathways. Moreover this analysis could provide insight on the relevance of this model to human HCC.

      We used RNAseq to address the relevance of our larval model to human HCC. Specifically, we performed differential gene expression analysis to identify up- and downregulated genes in cohorts of ahctf1+/+ (WT) larvae versus dox-treated ahctf1+/+(WT);krasG12V larvae. We used gene set enrichment analysis to compare these differentially regulated transcripts with the gene expression signature of 369 patient samples in the Liver hepatocellular carcinoma (LIHC) dataset versus healthy liver samples in the TCGA. These analyses revealed a significant association between the patterns of gene expression in our larval model of zebrafish HCC and those of human HCC (Fig. 1-figure supplement 1c, d).

      The genetic experiments we report in Figures 4, 5, 6 show that WT Tp53 is required for the reductions in liver enlargement (Fig. 4), apoptosis (Fig. 5) and DNA replication (Fig. 6) that occurs in response to ahctf1 heterozygosity in dox-treated krasG12V larvae. We also used RT-qPCR to show that a Tp53-mediated transcriptional program was activated in these ahctf1 heterozygous livers (Fig. 5). Similarly, in adult livers, ahctf1 heterozygosity triggered the upregulation of Tp53 target genes, including pro-apoptotic genes (pmaip1, bbc3, bim and bax) and cell cycle arrest genes (cdkn1a and ccng1) (new Fig. 6-figure supplement 1). These results show that to obtain the full potential of ahctf1 heterozygosity in reducing growth and survival of KrasG12V-expressing hyperplastic hepatocytes requires activation of WT Tp53. This is an important conclusion from our paper that is likely to be relevant in a clinical setting, for instance in patient selection, if ELYS inhibitors are developed for the treatment of HCC in which the KRAS/MAPK pathway is activated.

      Also, one reviewer mentions performing genome-wide transcriptional profiling of hepatocytes in the krasG12V model in response to ahctf1 heterozygosity and the presence and absence of Selinexor treatment. While these are potentially interesting experiments, they are substantial in nature and not crucial for the main messages of our paper. Therefore, we respectively contend that they are beyond the scope of the current manuscript.

      4) Functions of Achtf1 in regard to chromatin regulation could be compromised in this model. Scholz et al (Nat Gen 2019) report that Ahctf1 is involved in increasing Myc expression via gene gating mechanism. It would be good to know what the effects are in this system.

      The Scholz, 2019 and Gondor, 2022 papers from the same group, are very interesting in that they demonstrate a novel role for the ELYS protein in addition to the ones we pursued in our paper. The authors showed that in HCT116 cells, a human colorectal cancer cell line in which proliferation is driven by aberrant WNT/CTNNB1 signalling, the longevity of nascent MYC mRNA was increased by accelerating its movement from the nucleus to the cytoplasm, thereby preventing its degradation by nuclear surveillance mechanisms. The authors showed that siRNA knockdown of AHCTF1 in HCT-116 cells reduced the rate of nuclear export of MYC transcripts without changing the transcriptional rate of the MYC gene. They proposed a mechanism that depended on the formation of a complex chromatin architecture comprising transcriptionally active MYC and CCAT1 alleles plus proteins including β-Catenin, CTCF and ELYS. Together these interacting components guided nascent MYC mRNA molecules to nuclear pores, enhanced their export to the cytoplasm to be translated, resulting in activation of a MYC transcriptional program that induced expression of pro-proliferation genes.

      In theory, this role of ELYS in protecting MYC from nuclear degradation might extrapolate to other cancer settings where MYC expression is elevated. While interplay between MYC and mutant KRAS to enhance cancer growth has been previously reported, to date, most emphasis on this interaction has focused on the role of mutant KRAS in increasing the stability of the MYC protein, for example via RAS effector protein kinases (ERK1/2 and ERK5) that stabilise MYC by phosphorylation at S62 (Farrell and Sears, 2014: https://doi.org/10.1101/cshperspect.a014365) (Vaseva and Blake 2018: DOI:https://doi.org/10.1016/j.ccell.2018.10.001). While we appreciate the novelty of the recent papers, the current findings are limited to -Catenin activated HCT-116 cells and may not be relevant to our zebrafish model of mutant Kras-driven HCC. Accordingly, we have not allocated a high priority to following this up in our current manuscript.

      6) The synthetic lethality argument pressed in this manuscript seems exaggerated. Standard anti-cancer treatments typically target several cellular pathways, and nucleoporins directly affect a multiplicity of pathways besides nuclear transport.

      While we do not disagree that standard anti-cancer treatments may target several cellular pathways, we believe our data are consistent with the accepted definition of a synthetic lethal interaction whereby single mutations in two separate genes (kras and ahctf1) cooperate to cause cell death, whereas cells harbouring just one of these mutations are spared.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1 (Evidence, reproducibility and clarity):

      The manuscript reports effects on brood size, lifespan and healthspan upon manipulation of C. elegans genes encoding RagA, TOR and Pol III orthologs, as well as other well-characterized lifespan-affecting genes. The results point to complex relationships among TOR and Pol III that are not fully resolved, suggest a role for rpc-1 Pol III that is additive with well-characterized lifespan pathways, indicate a late-life requirement for rpc-1 Pol III to limit lifespan, and, contrary to a previous publication, suggest a muscle requirement for rpc-1 Pol III for lifespan limitation.

      Major comments regarding key conclusions:

      The work demonstrates that brood size is reduced upon rpc-1 Pol III RNAi feeding from the L4 stage. However, no further analysis is provided to show how later aspects of reproduction impair brood. Minimally, ruling out effects on spermatogenesis would be important since sperm number limits self-fertile brood size. It is also unclear from the methods whether the brood size results include embryonic lethality (post-reproduction). Internal hatching, if it occurred, could also affect interpretation of the results. A change in the reproductive period should be noted if it occurred.

      The reviewer is correct that it is important to address the role of Pol III more thoroughly in relation to reproduction.

      • The brood size experiments we present simply record the number of hatched progeny. To develop this analysis further we will present the age-specific fecundity data that we generated whilst doing these assays to demonstrate the impact of Pol III on the reproductive period. In addition, we will quantify and present data on the total brood size (dead eggs and hatched progeny) to address whether Pol III also impact embryonic development.
      • At 25oC (the temperature that we did these experiments) very few animals suffered internal hatching and those that did were taken out of the analysis – therefore this is unlikely to skew the results.
      • The question as to whether Pol III limits egg or sperm function (or later developmental roles) is also interesting and is not yet addressed. To examine this we will: Quantify brood size (dead eggs and hatched progeny) in elegans +/- Pol III RNAi that have been exposed to males during the reproductive period compared to those that reproduce solely as hermaphrodites.

      The authors claim that, similar to the relationship previously concluded from aging studies, rpc-1 acts downstream of TORC1. However, this claim is not well supported. In an effort to circumvent early lethality caused by loss of let-363 ("CeTOR"), they use a mutation in raga-1 RagA and demonstrate a further reduction in brood with rpc-1 RNAi. If raga-1(ok386) were a null this result would demonstrate a relationship that is at least partially parallel, not linear. By contrast, double RNAi with let-363 was "non-additive", suggesting a more linear relationship. However, interpretation of these experiments requires (1) that the raga-1 mutation is null and affects only TORC1 signaling, (2) evidence that the double RNAi worked well (e.g., qPCR; see Ahringer et al. 2006 review regarding issues with multi-RNAi), and (3) failure to consider alternative effects of loss of let-363 (e.g., TORC2). Negative results with RNAi are particularly problematic in the absence of convincing evidence that the RNAi worked well. Moreover, results in Figure 1G are difficult to interpret since the initial values are low. Here and elsewhere the genetics descriptions are unconventional, hampering interpretation. For example, what is meant by a mutation being "incomplete"? That it acts as a hypomorph?

      We understand the concerns of the reviewer:

      For reference, this has been used in several other studies, e.g. doi.org/10.7554/eLife.49158

      • We agree that double RNAi can be challenging. Appropriate controls were used here e.g. each RNAi diluted 50:50 with control RNAi in the single treatments and phenotypes were observed in each case (either brood size or lifespan). However, to address the precise knockdown of rpc-1 and let-363 obtained with RNAi we will perform qPCR in response to single and double RNAi treatment (both in WT and raga-1 mutant elegans).
      • In addition, we will attempt to measure S6Kinase phosphorylation, a downstream readout of TORC1 signalling in response to raga-1 mutation or let-363 RNAi treatment with and without rpc-1 A phosphor S6 Kinase antibody is commercially available and has been used successfully in C. elegans - doi.org/10.7554/eLife.31268
      • Our apologies that the nomenclature was confusing. The CeTOR RNAi nomenclature was ’borrowed’ from other papers describing this tool e.g. org/10.7554/eLife.31268 and doi: 10.1371/journal.pgen.1000972. Here, to make our work clearer, we will change ceTOR to let-363 TOR RNAi and raga-1 to raga-1 RagA in the manuscript – as suggested by the reviewer (see below). The description of ‘incomplete’ mutations will also be amended, and informed by our proposed qPCR analysis.

      Another claim is that rpc-1 Pol III limits adult lifespan downstream of TOR. These results are not convincing. The two treatments (raga-1 mutation as "embryonic" and L4 stage "CeTOR" let-363 RNAi as late) are not directly comparable for reasons noted above, and the double RNAi problem hampers interpretation.

      Our lifespan data points out that the longevity increase upon Pol III knockdown is additive with TOR/let-363, suggesting a mechanism independent of TOR. Indeed, due to lack of ideal reagents, we were forced to try the double RNAi knockdown approach for TOR/let-363 and Pol III/ rpc-1. To make the data interpretation easier, and rule out the possibility of confounding background RNAi to the maximum possible extent, we have included appropriate RNAi controls. Wherever double RNAi has been used, the effect on the phenotype by 50% dilution of target RNAi with empty-vector control, has also been shown independently and used for the statistical comparison with combinatorial RNAi. Our results have shown that diluting let-363 RNAi and rpc-1 RNAi both to 50%, is enough to impart lifespan increase when initiated from L4 stage.

      The nomenclature might be easier to follow if the authors state the actual C. elegans genes manipulated (e.g., let-363 TOR versus raga-1 RagA) rather than using "CeTOR" as a catch-all since these genes are not identical in action.

      Thank you for this suggestion. We will implement this in the manuscript where appropriate.

      Based on genetic interactions (rsks-1, ife-2, ppp-1, daf-2 and germline loss) they show that rpc-1 RNAi further extends the long lifespan conferred by each of the mutant alleles tested, as well as germline loss induced by two different mutant conditions. These results, though negative, are important. The statement that rpc-1 does not affect global protein synthesis is somewhat overstated without additional experimental support.

      We thank the reviewer for supporting our inclusion of ‘negative data’. We agree that our statement relating to protein synthesis is overstated given the data presented. We will soften this to: “rpc-1 does not seem to affect the lifespan incurred by reducing global protein synthesis, although this does not rule out the possibility that Pol III affect protein synthesis by other mechanisms”.

      Extending and challenging their own previous work showing an intestinal focus of activity for rpc-1 in limiting longevity (Filer et al., 2017), and noting that RPC-1::GFP detection can be knocked down by RNAi in several tissues, they use a tissue restricted rde-1 expression approach (or sid-1 for neurons) to test the contribution of intestine, hypodermis, neurons, muscle and germline. This new analysis points to a role for the muscle. This result is intriguing and warrants further experiments. To shore up tissue-specific claims the authors could consider (1) additional drivers for intestine and muscle rde-1 in the RNAi experiments, or, ideally, a different approach such as tissue-specific protein degradation (again with multiple drivers), (2) a sufficiency experiment for muscle (wild-type muscle expression in the mutant to demonstrate reversal of the phenotype, or rescue of RNAi defects with an RNAi-insensitive reagent expressed in muscle).

      Thank for you appreciating the work we have done here and suggesting further experiments. To take your points one at a time: (1) We have already used the most robust tissue-specific alleles generated and reported in the C. elegans literature so far. It would be a significant amount of work to generate new rde-1 driven tissue specific alleles to double check the Pol III levels/ rpc-1 knockdown response in certain tissues, and we feel this is beyond the scope of this project. Suggestion (2) is interesting and would require us to generate a muscle specific rpc-1 strain. However, there are issues with this approach. Firstly, it would require that we have a rpc-1 mutant to rescue – which we don’t as it is embryonically lethal and secondly it would not be possible to do this experiment using RNAi as the RNAi would then knock down the muscle construct.

      The possible explanation for the differences in rde-1 results from the previous work should not be buried in the legends of Figure 3 and Figure S3. Perhaps this leaky background hypothesis should be directly tested (e.g., using the RPC-1::GFP to examine whether residual expression exists in ne219 but not in ne300)? In any case, legend to Figure S3 needs editing: The ne219 background is not itself "intestine-specific", as implied, and the last sentence of Figure S3 legend should be "Thus, the rde-1(ne219)...", right?

      The differences between the different tissue-specific strains is interesting. On reflection we agree with the reviewer that it should be included in the main text. We will describe the differences between the two rde-1 alleles ne219 and ne300 in the appropriate section in the manuscript and state our results.

      Finally, they show that late-adult rpc-1 RNAi extends lifespan over control RNAi and that, by several movement assays, healthspan is improved upon L4 rpc-1 RNAi, even when RNAi is active in muscle (based on WM118).<br /> The most significant new results are that rpc-1(RNAi) affects brood size, can extend lifespan (though modestly) after day 5 of adulthood, and that muscle may be involved rather than intestine.

      Additional comments:

      Text throughout should clarify TOR vs presumed TORC1. Methods are insufficient. Important aspects of the lifespan methods and raw data are missing - e.g. exact numbers of worms censored. Exact information regarding statistical analysis is lacking (e.g., which tests, corrections for multiple testing). References should be given for all strains. For the rde-1 strains, it would be helpful to include, in addition to the transgene alleles, the actual promoters used to claim tissue specificity. Note, worms do not have "skeletal" muscle, as implied in the discussion. Figure 5 was not helpful for this reviewer. Figure legend to S3A is confusing: the intestinal signal appears stronger or at least equal, not weaker, in the rpc-1 RNAi background. Were these images collected using the exact same exposure settings?

      To address this we will:

      • Standardise genetic notation throughout the manuscript (see specific comments above)
      • Provide more detail on the transgenic alleles used e.g. promoters driving rde-1.
      • The majority of strains were obtained from the CGC but wherever appropriate we will also supply a reference.
      • Expand and revise Material and Methods section to appropriately describe all the statistical analyses performed.
      • Revise lifespan methods to include censoring detail and lifespan Tables to include information on censored animals.
      • Remove the reference to ‘skeletal muscle’ and replace with ‘body wall muscle’.
      • Once we have generated new data on the specific knockdowns and downstream targets achieved with let-363 TOR RNAi and raga-1 RagA mutation, as well as on the brood size/dead eggs effects, we will incorporate this information into Fig. 5A for better clarity and readability.
      • We can see on reflection that Figure S3A is confusing, mainly due to the gut autofluorescence in both the control and rpc-1 RNAi conditions. We will amend this figure to make this clear and include a selection of close up images of each tissue to make it easier to see the tissue specific knockdown by RNAi.

      Reviewer #1 (Significance):

      See above. Study will be of interest to aging community.

      Reviewer #2 (Evidence, reproducibility and clarity):

      The study by Malik and Silva et al describes results of the study investigating the role of RNA Polymerase III in regulating fecundity and lifespan in C. elegans. The authors show that knockdown of Pol III, similar to mTOR suppression, is detrimental for reproduction. Likewise, suppression of either Pol III or mTOR in adult animals extends lifespan via apparently the same pathway. In contrast, Pol III knockdown has an additive effect on lifespan in combination with other established genetic lifespan-extending approaches suggesting that they are working via different mechanisms. Furthermore, using the tissue-specific knockdown of Pol III the authors found that suppression Pol III expression is the muscle, but not other major worm tissues, is sufficient for its lifespan extending effect. Finally, the lifespan extension is also observed when Pol III knockdown is initiated late in adulthood. The overall conclusion is that suppression of Pol III expression late in animal life, particularly in the muscle, is a potential strategy to extend life- and health-span. Overall, the study is well-designed, the tools and results are robust and analysed appropriately. The data presentation is excellent, and the manuscript is clearly written. Addressing the points below will help to improve the clarity further.

      We thank the reviewer for their very positive response to our study and are pleased that they found the data convincing. We are extremely pleased that the reviewer agrees with the design and tools used in this study. We can address all of the review’s comments – as discussed below.

      Major:

      Significant amount of GFP signal is still present in RNAi treated animals, what is the tissue that maintains particularly high levels of expression (Fig. 3A) and how does it affect the conclusions? What is the level of Pol III reduction in different tissues? Could different efficiency of knockdown explain the tissue-specific effect of Pol III downregulation on lifespan? It would be important to show (and, if possible, to quantify) the knockdown efficiency in different tissues using the available reporter

      • This experiment had originally been done to test the efficiency of the RNAi, particularly in tissues where rpc-1 RNAi did not impact lifespan. The reviewer is right though, and this information could be analysed further to enhance our study. Figure 3A shows C. elegans expressing the rpc-1::3xflag::gfp reporter. This was used to a) determine the expression pattern of RPC-1 and b) determine the effect of rpc-1 RNAi on this. We noted that RPC-1::GFP is expressed a wide number of tissues and when the reporter strain is treated with rpc-1 RNAi, it is decreased in all tissues. The ‘green’ observed in the RNAi treatment is unfortunately attributable to autofluorescence generated by lysozymes in the C. elegans intestine and masks some of the effects we saw by eye.
      • To establish the tissue-specific efficiency of Pol III knockdown and also address the confounding issue of the autofluorescence we will now use a combination of quantitative and qualitative fluorescent microscopy to measure the percentage RPC-1::GFP knockdown in each tissue relevant to this study.

      Minor:<br /> Fig. S3B is not cited in the text and the legend for the figure is somewhat confusing, potentially containing errors, this needs to be clarified.

      We thank the reviewers for pointing this out. The legend for this figure will be re-written as a result of the analysis described above and we will cite it in the main text.

      Reviewer #2 (Significance):

      This is the first thorough study of Pol III knockdown as a lifespan extending strategy in C. elegans. In addition to the different laboratory model (previous study of Pol III in ageing primarily focused on Drosophila), this manuscript also offers several novel insights into consequences of Pol III perturbation at phenotypic, as well as mechanistic level in terms of interaction with other longevity pathways. The study will be of interest to those interested in processes underlying longevity and ageing. Considering that this topic is currently in fashion the publication will probably attract attention of not only specialist but also general public.

      We are extremely pleased that the reviewer shares our enthusiasm for this study and that they find the experimental evidence compelling.

      Reviewer #3 (Evidence, reproducibility and clarity):

      Summary: The paper by Yasir Malik et al investigates the genetic interrelationship between TOR signalling and Pol III expression regarding fecundity and longevity in C. elegans. Based on a previous study that defined a role of Pol III downstream of TOR in longevity across various species, this study looks particularly at the relative timing and tissue requirements for TOR and Pol III inhibition in longevity. Data indicate that Pol III acts downstream of TOR in regulating fecundity while there are additive effects regarding survival. The Pol III effect on longevity is based on its role in the muscle. Finally, health-span parameters mirror the survival data.

      Major comments: This is a nice study the relies on genetic interaction to ask how TOR and Pol III interact. I find the observation that Pol III inhibition extends survival when initiated at day 5 of adulthood very exciting. In general, the study would benefit from additional data that back up the genetic observations._We thank the reviewer for appreciating the study and the novel insights it provides about the TOR-Pol III inter-relationship. We can address reviewer’s comments with the a few, limited experiments. Discussed below.

      In Fig. 1, experiments are done to inhibit TOR to varying degrees in order to perform epistasis experiment. Of course these are difficult to interpret without the use of full KOs/loss of function. So while this is a good solution, it would be important to quantify the level to which TOR signalling is inhibited, optimally with biochemical experiments. We fully appreciate the reviewer’s point. A similar concern was raised by reviewer 1. We propose to address this in two ways: 1) by quantifying mRNA levels by qPCR of let-363 in response to either let-363 TOR RNAi; and 2) by determining the extend of TORC1 activity by using a biochemical readout of the pathway’s activity – S6 Kinase phosphorylation using Western blotting as described here: doi.org/10.7554/eLife.31268 2. General brood size is very low in the WT worms. Normally, one would expect 250-300 offspring per adult worm. It would be helpful if the authors could address this.

      Indeed, as pointed out by the reviewer, the WT worms have a brood size of 250-300 eggs when kept at 20oC. but C. elegans exhibit different brood sizes dependent on temperature and these decline in size with increasing temperature. The experiments shown here were carried out at 25oC, where C. elegans produce less offspring. Our observation is in agreement with other studies of similar nature e.g. doi:10.1371/journal.pone.0112377 and doi.org/10.1371/journal.pone.0145925

      1. Why were lifespan assays performed at 25C? The standard temperature for the worm is 20C and here I think this is very relevant as the TOR pathway is responsive to suboptimal conditions. I wonder if the results are also true for lower temperatures.

      The reviewer raises an interesting point. This study follows from the previous study of Filer et al., Nature 2017 which demonstrated the role of Pol III in ageing. During this study we found and reported that there was a high proportion of intestinal bursting when lifespans were carried out at 20oC, which was ameliorated by carrying out the experiments at 25oC. This was quantified in the original manuscript. To maintain consistency, we continued carrying out Pol III lifespans at this slightly higher temperature. Due to this limitation it is not possible to test the impact of TOR signalling on Pol III at lower temperatures.

      Minor comments: 1. It would help to better delineate the rationale for the experiments in Fig. S1. Experiments here are aimed to find mediators of TOR effects distinct from Pol III. Such distinct mediators would be additive to Pol III (as is the case in the figure) and downstream of TOR.

      Interpreting epistasis analysis is challenging. We were looking for interactors of Pol III using this targeted genetic approach and working on the premise that if two genes interacted then their effects would be non-additive. However, the reviewer is correct that if two genes are doing the same thing independently then their effects may be additive. Although our data does not suggest these mediators interacting with Pol III in the same pathway, it does not rule out the other possibility. When we re-work the manuscript we will explain our rational more clearly and outline the two scenarios.

      Reviewer #3 (Significance):

      Strengths: The study advances our knowledge regarding the timing of the Pol III targeting intervention for survival effects.<br /> Limitations: The study relies only on genetic data and not all of it is conclusive.

      This study will be interesting for the geroscience community with an eye on TOR inhibition and is relevant to worm biology. I work with C. elegans as a genetic model and I am interested in protein homeostasis, metabolism, health, and longevity.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Manuscript number: RC-2022-01480

      Corresponding author(s): Ananda, Sarkar

      1. General Statements

      We are thankful to Review commons platform that helped our manuscript critically reviewed with very constructive and valuable feedback. This gave us the opportunity to do the experiments accordingly and significantly improve the manuscript. We are hopeful that this platform will help our manuscript get published in a journal of repute.

      2. Point-by-point description of the revisions

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      The manuscript entitled "LDL1 and LDL2 histone demethylases interact with FVE to regulate flowering in Arabidopsis" characterized that LDL1 regulates flowering by binding on the chromatin of MAF4 and MAF5 to repress their expression. Further the authors proposed LDL1/LDL2-FVE model. Here are some comments for this manuscript.

      Major problems: 1. This experiment is still not testing or showing/concluding that the whole complex forms on the MAF4 and MAF5

      Response: We understand reviewer’s concern regarding the complex. Previously FVE was shown to be a part of co-repressor complex including HDA6, HDA5 and FLD to regulate the expression of FLC and its clade members during floral transition1-3 We showed that LDL1 binds directly to the chromatin of MAF4 and MAF5 to suppress their expression (Figure 1 and 2). Furthermore, we discovered that LDL1 and LDL2 interact with FVE to influence floral transition (Figure 8 and 9). Hung et al., 2018 reported the interaction of LDL1 and LDL2 with HDA6 to regulate circadian rhythm4 and we found that the expression of MAF4 and MAF5 was upregulated in ldl1ldl2hda6 than ldl1ldl2 (Figure 5C and 5D). Therefore, our experimental data, together with previously reported data makes it evident that LDL1 and LDL2 are a part of co-repressor complex through their interaction with FVE and HDA6, which we concluded here. We agree with the reviewer that an additional experiment, such as complex pull-down, will be helpful, but in our opinion, it will only provide additional confirmatory evidence.

      2.It is not shown LDL1/LDL2 repress MAF4 and MAF5 by removing H3K4me2 activity. It would be useful to test whether the methylation level of MAF4 and MAF5 has been altered in ldl1/ldl2 mutant

      Response: We found altered methylation level in MAF4 and MAF5 chromatin during floral transition in ldl1 and ldl1ldl2 mutants (Figure 6 and 7). We observed that the absence of LDL1, or both LDL1 and LDL2 disturbs the shift in H3K4 methylation status on MAF4 and MAF5 during floral transition and ends up in a more active (enriched in H3K4me3 marks) chromatin state at 19 days. This result, taken together with the increased MAF4 and MAF5 expression in ldl1 and ldl1ldl2 double mutants (Figure 5C and 5D) indicates that LDL1/LDL2 repress MAF4 and MAF5 by altering H3K4 methylation.

      3.I suggest that further research is required to provide conclusive evidence concerning the physiology function of LDL1/LDL2-FVE. Such as the expression pattern of LDL1/LDL2, the methylation level of MAF4 and MAF5 before or after floral transition

      Response: Taking this suggestion into account, we performed quantification of rosette leaves and flowering time of fvec, ldlfvec and ldl2fvec along with WT, ldl1 and ldl2 (Figure 9). We also observed decreased expression of floral activator genes, FT and SOC1 (targets of MAF4 and MAF5) in fvec, ldlfvec and ldl2fvec in comparison to the WT (Supplementary Figure 10C), which corresponds to their late flowering phenotype.

      To understand the role of LDL1and LDL2 during floral transition, we first analyzed the expression of LDL1 and LDL2 during floral transition (Supplementary Figure 8). We observed that the expression of LDL1 and LDL2 expression peaks at 16 days and gets stabilized till 19 days. Then we checked the enrichment of H3K4me1, H3K4me2 and H3K4me3 on MAF4 and MAF5 chromatin in ldl1 and ldl1ldl2 plants with respect to the WT at 16 days (before floral transition) and 19 days (after floral transition). We found an increase in the conversion of H3K4me1 to H3K4me3, when LDL1 and LDL2 were not present (Figure 6 and 7).

      Reviewer #1 (Significance (Required)):

      The manuscript provide some evidences how LDL1 involve in flowering through epigenetic regulation.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      Mahima and colleagues investigated LDL1/LDL2-MAF4/MAF5 in Arabidopsis flowering time control. The manuscript contains some interesting observations. To my point of view, however, the data need to be consolidated to support conclusions drawn in the manuscript.

      • Title: it does not correctly reflect the manuscript content. Data in relation with FVE were limited to Fig 6, where the data themselves appear preliminary.

      Response: We agree with the reviewers that our title didn’t reflect the manuscript content precisely and are happy to take this criticism into consideration. We have revised the title to, “LDL1 and LDL2 affect the dynamics of H3K4 methylation on the chromatin of MAF4 and MAF5 to allow floral transition in Arabidopsis”. Additionally, have provided the quantification data for fvec, ldlfvec and ldl2fvec with respect to WT, ldl1 and ldl2 plants (Figure 9)

      • Abstract: most conclusions are over-stated. The current data shown in the manuscript cannot support such strong conclusions.

      Response: We have rigorously revised the abstract and toned down the overstated conclusions

      • Introduction: It is necessary to make clear that the role of the LDL1 and LDL2 genes in flowering time control had been well established in previous studies, including their repression of transcription of FLC, MAF4 and MAF5 (Berr et al., 2015, Plant J 81:316).

      Response: We have revised the introduction to include the previously known roles of LDL1 and LDL2 in regulating flowering time.

      • Results:

      Regarding LDL1-overexpression lines, 'Relative expression' in Supplementary Fig 2B referred to normalization to WT? The phenotype of plants needs to be shown.

      Response: Yes, the level of upregulation of LDL1 expression in different T1 plants (after selection from Hygromycin) was calculated with respect to the WT.

      Regarding flowering time, have the observation and measures been performed in the same experiments for the ldl1, ldl1 flc, ldl1 maf4 and ldl1 maf5 mutants (Fig 3 and Supplementary Fig 1)? The late-flowering phenotype of ldl1 shown in Fig 3D-F is much severe than the same mutant shown in the other Figs, any explanation? What's the interpretation that ldl1 is epistatic to flc, maf4 and maf5?

      Response: We agree with the reviewer’s observation which is correct. The following quantifications were taken at various points during the study:

      flc, ldl1, and ldlflc (Supplementary Figure 1)

      WT, ldl1, and ldl1maf4 (Figure 3A, 3B and 3C)

      WT, ldl1, and ldl1maf5 (Figure 3D, 3E and 3F)

      The rosette leaf numbers and flowering time of the plants in Figure 3D-3F are more severe than the others because seeds were directly sprinkled onto the soil in this phenotyping, whereas in previous phenotypings, plants were grown on 1/2MS plates before being transferred to soil. However, all the components of a single experiment were grown in the same condition. We appreciate your observation, the present data does suggest ldl1 being epistatic to flc, maf4 and maf5.

      The in vitro test of LDL1 for its enzyme activity (Fig 4) appears preliminary and fragmented. The quantification data in Fig 4C-D need repeats. Have other histone methylation types (e.g. H3K4me3, H3K27me3, H3K36me3) been tested? The only two types (H3K4me2 and H3K9me2) shown are both down-regulated by LDL1-GST. Can H3K9 demethylation also play a role in flowering time control? In any case, the current in vitro data only are not sufficient to draw the strong conclusions as those appeared in the manuscripts.

      Response: Before concluding that LDL1 has H3Kme2 and H3K9me2 demethylase activity, we confirmed it several times__. __Please refer to the PDF file for “response to reviewers” for supporting data.

      We analyzed the western band intensity by calculating the area under the curve with imageJ software, which varies between experiments depending on the band intensities, therefore, rather than plotting absolute values of band intensity, we plotted the ratio of LDL1-GST/GST from three independent experiments in Figure 4B. We did perform a preliminary experiment to see if LDL1 has demethylation activity against different methylation marks, such as H3k4me1, me3, H3K9me1, and me3 (1=GST, 2=LDL1-GST), but there was no significant change in the methylation marks in the presence of LDL1. Please refer to the PDF file for “response to reviewers” for supporting data.

      H3K9 is a repressive chromatin mark, and its removal would suggest gene activation. Upregulation of FLC, MAF4, and MAF5 in ldl1 and ldl2 mutant suggests LDL1 and LDL2 removes H3k4me2 methylation marks during flowering. However, JMJ28, Jumonji C (JmjC) domain-containing histone demethylase have been shown to positively regulate flowering by removing repressive H3K9me2 marks from the chromatin marks from the chromatin of CONSTANS (CO)5.

      In the manuscript, it is saying that LDL1 binds on the chromatin of MAF4 and MAF5. However, I cannot find any data shown to support this conclusion.

      Response: We would like to refer to Figure 2A and B where we have provided this information.

      Protein-protein interactions, e.g. LDL1/LDL2-FVE in Fig 6A and LDL1-LDL2 and LDL1-HDA5 in Supplementary Fig 5, are examined in yeast two-hybrid assay. Other independent assays would be required.

      Response: We have confirmed the interaction of LDL1 and LDL2 with FVE using co-immunoprecipitation assay (Figure 8B). Since Co-IP is a confirmatory experiment, we have done it for positive interactions found through Y2H only. Moreover, in the current manuscript our focus has not been on HDA5, so we didn’t proceed with further experiments.

      The study of genetic interaction between fve and ldl1/ldl2 (Fig 6B-D) looks very preliminary. It is unclear how ldl1 fve and ldl2 fve were obtained: by crosses or by CRISPR-Cas9 using ldl1 and ldl2? The phenotypes need more investigations and some molecular data regarding flowering regulatory genes (e.g. MAF4/5) are necessary. In any case, the current title and the related conclusions drawn in the manuscript are over-stated.

      Response: We performed the quantification of the genetic interaction between fve and ldl1/ldl2. The binary vector pHSE401-FVE was transformed in ldl1 and ldl2 to produce ldl1fvec and ldl2fvec, respectively. We previously mentioned it in the material methods, but we have now updated it in the results section to avoid confusion.

      Following the suggestions, we have scored the phenotype (Figure 9) and checked the expression of flowering regulatory genes (Supplementary Figure 10C).

      Fig 7 showed data about MAF5-FLC, MAF5-SVP and MAF5-MAF5 interactions in yeast two-hybrid and about transcriptional repressor activity assay in tobacco leaves using the LUC-reporter. Again, the data need to be confirmed and reproducibility of experiments need to be shown. In addition to proFT:LUC, it is also necessary to have an internal normalization reference construct. Anyway, currently it is far away to allow a strong conclusion such as drawn in the manuscript that MAF5 interacts with FLC and SVP and repress FT to delay floral transition. Response: We have confirmed the interaction of MAF5-FLC, MAF5-SVP and MAF5-MAF5 using co-immunoprecipitation (Figure 10B). We quantified the firefly luciferase activity under proFT using renilla luciferase under pro35s as an internal control and the ratio of LUC/REN represented the promoter activity of FT promoter (Figure 10C).

      Reviewer #2 (Significance (Required)):

      Topic is interesting, but data are poor to support the conculsions drawn.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      LDL1 and LDL2 histone demethylases interact with FVE to regulate flowering in Arabidopsis Summary This work study the role on flowering time of LDL1 and LDL2, two Arabidopsis homologs of the histone demethylase LSD1. Although this phenotype was previously described, the authors explore if LDL1 and LDL2 regulate other genes in addition to the floral repressor FLC. In fact, mRNA expression experiments and genetic analyse suggest that LDL1 modules flowering regulating the expression of MAF4 and MAF5, two FLC-like genes that has been less characterized. The also provide some in vitro biochemical evidence of the demethylase activity of LDL1 protein and yeast-two-hybrid data showing the interaction with FVE, another chromatin regulator involved in flowering time.

      Major comments 1. Lines 116-117. Please rephrase these lines and remove panels C, D and E from figure 1 (these could be supplementary material). The flowering time phenotype of MAF4 and MAF5 in Col background is very well documented and was described before, see Gu et al Nat. Comm., 2013 (10.1038/ncomms2947) and Kim et al. Plant Cell, 2013 (10.1105/tpc.112.104760)

      Response: As per the suggestion, we have modified the discussion and moved the panels 1C, 1D and 1E to the supplementary.

      Lines 128-130 and Fig Sup3. The proLDL1:LDL1-GUS cannot be described as fully functional because its flowering time and LDL1 mRNA expression levels has not been compared to the wild-type plant. The line flowers earlier that the ldl1 mutant but it may only partially complement the flowering phenotype.

      Response: We have provided additional experiment that the transgene is functional in proLDL1:LDL1-GUS (ldl1) with respect to the WT plants (Supplementary Figure 5A).

      Line 135 and Figure 2. How the Chip data was normalized? What are you comparting in your statistical significance tests? Only two regions of each gene were analysed; to assess the binding of LDL1 to MAF4 and MAF5 loci more regions must be analysed.

      Response: Normalization of the ChIP data and significance of enrichment of LDL1 was calculated with respect to the fold enrichment in the empty vector control (EV (ldl1)) plants. We only examined the promoter and exon1 of MAF4 and MAF5 for LDL1 enrichment because Hung et al,2019's6 study demonstrated that LDL1 is enriched on the promoter and exon1 of the downstream protein coding genes. However, to check for methylation marks during flowering, we have employed different primer sets on various positions between the promoter and exon1 on MAF4 and MAF5 chromatin.

      Figures 6C and 6D. The genetic analysis of ldl mutant with fve-c line is prelaminar and incomplete. The epistasis cannot be evaluated as no quantitative flowering time data is provided. A questionable picture of one lonely plant cannot sustain the conclusions of lines 207-208.

      Response: We have modified the picture and quantified the flowering time data to show genetic interaction of ldl1 and ldl2 with fvec mutant plants (Figure 9).

      METODS. Please clarify the used mutant alleles for LDL1 LDL2, MAF4, MAF5 and FLC; if they has been previously described; if they are full knock-outs; and, consequently, use the appropriated allele name across the manuscript.

      Response: As per the suggestion, we have clarified the different mutant alleles used in the study.

      Minor points: 6. I think the title does not describe the work - the interaction with FVE is very relevant but it is not the central theme of the article.

      Response: We have changed the title of the study to “LDL1 and LDL2 affect the dynamics of H3K4 methylation on the chromatin of MAF4 and MAF5 to allow floral transition in Arabidopsis”.

      It would be very informative to have short-day flowering tome data of the genetic combinations of ldl mutants with flc, maf4 and maf5 mutations.

      Response: We absolutely agree that elaborate SD experiment may open interesting avenue for LDL1 mediated regulation of flowering, which might be good for future studies. However, ldl1ldl2 shows late flowering, while maf4 and maf5 exhibit the early flowering phenotype irrespective of the day length7,8.

      I found the Discussion section rather too long.

      Response: We have shortened the discussion to make it more focused.

      Reviewer #3 (Significance (Required)):

      Although it is clear that LDL proteins regulate MAF4 and MAF 5. I found that the manuscript lacks of a general overview of flowering time regulation. At the end, it is not clear how LDL proteins regulate flowering time because they regulate FLC, FWA, MAF4 and MAF5: What is more important? Which is the main role of each protein? Are they reductant or do they have specialized functions? In a nut shell, this study is an interesting piece of work for the flowering time field: However, in my opinion, some of the presented data are redundant with previous works and the manuscript may not be relevant for a general audience.

      1. Yu, C.-W. et al. HISTONE DEACETYLASE6 Interacts with FLOWERING LOCUS D and Regulates Flowering in Arabidopsis. Plant Physiology 156, 173-184 (2011).
      2. Luo, M. et al. Regulation of flowering time by the histone deacetylase HDA 5 in A rabidopsis. The Plant Journal 82, 925-936 (2015).
      3. Yu, C.-W., Chang, K.-Y. & Wu, K. Genome-wide analysis of gene regulatory networks of the FVE-HDA6-FLD complex in Arabidopsis. Frontiers in plant science 7, 555 (2016).
      4. Hung, F.-Y. et al. The Arabidopsis LDL1/2-HDA6 histone modification complex is functionally associated with CCA1/LHY in regulation of circadian clock genes. Nucleic acids research 46, 10669-10681 (2018).
      5. Hung, F.-Y. et al. The Arabidopsis histone demethylase JMJ28 regulates CONSTANS by interacting with FBH transcription factors. The Plant Cell 33, 1196-1211 (2021).
      6. Hung, F.-Y. et al. The expression of long non-coding RNAs is associated with H3Ac and H3K4me2 changes regulated by the HDA6-LDL1/2 histone modification complex in Arabidopsis. NAR Genomics and Bioinformatics 2 (2020). 7 Berr, A. et al. The trx G family histone methyltransferase SET DOMAIN GROUP 26 promotes flowering via a distinctive genetic pathway. The Plant Journal 81, 316-328 (2015).

      8 Kim, D.-H. and Sibum, S. Coordination of the vernalization response through a VIN3 and

              FLC gene family regulatory network in Arabidopsis. *The Plant Cell *__25, __454-469 (2013)
      
    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      We are grateful to the reviewers for their efforts in critically reading our work. Their meaningful input led us to make the revisions detailed below in our “point-by-point” answers to the reviewer’s comments. The insightful comments have helped us significantly improve the manuscript, allowing us to more accurately quantify and convey our data – we are thankful for that.

      Reviewer #1

      1. The Figure 1 legend indicates that the BirA tagged strains are mated with ~6000 AviTag yeast strains but results in Figure 2 pie chart account for 4812 total readouts. Presumably 1000 or more strains could not mate or did not produce viable diploids with the BirA tagged strains? It would be helpful to explain this differential. We thank Reviewer #1 for pointing out this gap which occurred exactly as they have interpreted. We have now corrected the figure legend to say exactly how many strains were in the library (5330) and have clearly stated the attrition of strains.

      If possible, suggest including more of the raw data (in supplementary) that supports the pie chart in Figure 2. Table S1 shows the 111 proteins that display preference for Ssh1 (out of 586 total interactors?) and the fold change (in rank order) for interaction preference. At a minimum, similar data on Sec61 preference and the list of positive interactors should be included. There may also be useful information in the relative biotinylation signal for each BirA and AviTag combination when significantly above background. This is presumably a readout of AviTag protein abundance, dwell time and orientation to BirA activity. The data could be useful to other investigators.

      This is a very good suggestion. We have now added a supplementary table (Supplementary Table S2) with the interaction results for proteins that preferred Sec61 and proteins that did not show any preference.

      The authors might want to be more cautious in interpreting impact of the UPR on ssh1 phenotypes in the results and discussion. The Wilkinson et al 2002 paper referenced used very different conditions to detect UPR in ssh1 deletions strains. Jonikas et al 2009 does not detect a chronic UPR in ssh1 deletion cells and the conditions used in the current study seem more similar to the 2009 report. It seems more likely that deficits in translocating/localizing specific proteins causes the observed phenotypes instead of chronic UPR due to reduced ER levels of PDI.

      *We agree that as result of the different conditions it is difficult to compare our data to the Wilkinson et al 2002 paper. We have therefore adjusted the text to remove this interpretation. *

      Reviewer #2

      1. Why was BirA used to study transient interactions? Biotinylation through BirA is slow (that is why it takes several hours to label proximity proteins) and thus it may not be suitable for capturing transient interactions. Instead, TurboID would be more suitable as the biotinylation reaction is faster than BirA. A reasonable explanation using BirA is required. We thank the reviewer for this comment which indeed also reflects our “process” of thinking. Originally, we did try to use TurboID to identify potential cargo proteins. We now have a very robust methodology to look at protein substrates by TurboID (see: https://www.biorxiv.org/content/10.1101/2022.04.27.489741v1) and so this would have obviously been the easier and faster method. However using this approach we mainly observed ribosome subunits and cytosolic proteins for Sec61 and very few, mostly cytosolic, proteins for Ssh1. Our interpretation of this is that since all biotinylation of TurboID strains occurs in parallel there is “competition” from the highly abundant and strong interactors and this does not leave a possibility to detect the low-abundance and even more transient interactions that we would like to measure. It seems that although birA/AviTag are much slower, the specificity and singular ligation site that should be exposed also in co-translational-translocation events, are more suitable for this specific experimental setup. We have now explained this also in the text.

      One key question is whether biotinylated proteins identified by this method are substrates or proteins just proximal to Sec61 or Ssh1 due to close cellular localization (e.g. ER membrane) or same protein complex members. An experiment or analysis would be required to confirm that the proteins they identified are indeed potential substrates.

      *This is indeed an extremely important point and we have now carefully addressed it in the text. We are certain that the reviewer is right and that many of the biotinylated proteins are same complex members and cytosolic components that happen to be in proximity (maybe regulators?) just as the reviewer suggested. We now clearly write this in the results section. This is why we focused on signal peptide containing proteins. These proteins CAN NOT be complex members nor biotinylated simply due to proximal location on the ER membrane. This is since they reside inside the lumen of the ER if they are soluble or are inserted (if they contain also a transmembrane domain) with their tagged N’ facing the lumen of the ER (So called Type I proteins). The only way such proteins could be biotinylated by the slow BirA on the cytosolic surface is if they passed through the pore of the translocon. *

      Along the same line, if proteins identified by this approach are bona fide substrates of Sec61 and Ssh1, proteins having signal peptides should be enriched in the candidate list of substrates. However, it does not look like that according to Figure 2A where the secretome proteins/total proteins ratio appears to be similar among the 4 categories (e.g., Ssh1 preferring, No preference, and Not interacting or excluded). The authors should comment on this.

      *We thank Reviewer #2 for highlighting this point that was not clear from our text and figures. There is definitely an enrichment of Signal Peptide (SP) containing proteins amongst the proteins that we think are bona fide substrates however this was not visualized clearly. To highlight this point we have modified Figure 2 and added a bar graph showing the distribution of SP and TMD proteins within the potential secretome. This graph now highlights the enrichment of SP containing proteins in the groups of proteins that preferred Sec61 or Ssh1 in comparison to the group that did not show a preference. *

      *We also now add a citation from a new manuscript from the Hegde lab that suggests that indeed soluble SP containing proteins are the key clients for the translocon pore (https://pubmed.ncbi.nlm.nih.gov/36261528/). We have also added a section to the discussion as to why we do not see differential enrichment of SRP or its receptor on either pore although in the past this was suggested to be the key difference between the two translocons. *

      Figures 1-2: They should comment on the reproducibility of the method. How many independent experiments were performed? If performed, how was reproducibility of results?

      Thank you for highlighting that this was not clarified enough – we have now extended the materials and methods section to make all of the above issues clear and apparent. In short, we performed 3 biological repetitions for each experiment and for each biological repeat we performed 3 technical repeats making our results altogether rely on 9 repeats. We then excluded proteins in two cases

      1. If strains were missing in either of the collections (so there was no complete set to compare them) – this caused us to drop 661 strains.
      2. In cases where the proteins were expressed at very low levels of extracted poorly in our assay – we defined this as the signal being ten standard deviations (or more) lower than the rest of the signals on the same membrane – this caused us to lose an additional 93 strains. Importantly, the SD between all 9 repeats never rose above 3 (see graph below showing al strains arranged by order in library and the SD between all 9 repeats) and also now stated clearly in the text) hence we think that our method is highly reproducible

      Figure 3: It is important to know the overlap of proteins commonly identified in both the interaction screening and protein localization assay. A Venn diagram that compares results between the two high-throughput assays would be useful.

      *We have indeed considered making this Venn diagram (See below). However, since the connection between the screens is not direct due to the fact that the protein localization is downstream to translocation events or maybe completely independent of it, we found that the number of specific proteins that are in both screens is low. However, there is a much larger overlap in joining processes and functions, therefore we decide to make the grouping showed in Figure 4B. We would prefer not to show this figure in the manuscript however we leave this to editorial decision. *

      Figure 4A (GO term): The authors mentioned that " the most consistent and repeating GO term group was those related to budding and polarity process. These include: "Establishment or maintenance of cell polarity"; "Development process involved in reproduction"; "Bipolar cellular bud site selection"; "Cell budding" and "Structural constituent of cell wall". Are protein sets in these functional categories similar or different? I am asking because GO enrichment analysis often provides apparently different functional categories but similar protein sets are included.

      Indeed, this reviewer is totally correct and this is also the case here to some extent. There is some level of overlap between the GO terms. However, in our case this overlap is quite small: Out of the 77 genes that have one of those GO terms assigned only 2 have all 4, 9 have 3 and 4 have 2 of the GO terms therefore we believe that in this case this issue with GO terms hierarchy and assignment is not redundant. We are happy to highlight this in the figure or text if this is deemed to be important.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      1. General Statements

              We greatly appreciate the valuable comments from the referees, which have generally been very positive and constructive. The three referees have emphasized the significance of our study that opens a new direction of research regarding the role of RNA modification in viral defense. In addition, the reviewers confirm our view that the audience of our work would be broad.
      
              The major concerns of the reviewers are limited to four main points:
      
      1. i) to be clearer in our description on the effect of the m6A-YTHDF axis on the viral infectivity and avoid making assumptions on effects on replication (ref. #1 and #3);
      2. ii) reviewer 1 finds that the title and conclusion of this manuscript defining YTHDF proteins (ECTs) as "direct effectors of antiviral immunity" is misleading. Nonetheless, as detailed below, Reviewer 1 confuses mere knowledge of effects of m6A with those conferred by YTHDF proteins binding to m6A, and indeed overlooks nearly all evidence presented in the paper for how m6A in AMV confers antiviral resistance (i.e. mechanistic insight); iii) the discussion on the relative importance of antiviral RNA silencing and m6A-YTHDF against AMV;

      3. iv) to establish more clearly whether the phase separating capability of IDRs in the reading proteins correlates with the antiviral activity (reviewer 2). We have already completed substantial experimental work to address several of these points. Nonetheless, we find it prudent to ask for an extension of the revision time beyond four weeks to allow for repeats of a few of the infection experiments in question. In the following section, we specify a plan of action for the revisions.

      2. Description of the planned revisions

      • *Regarding the four major concerns raised by the reviewers, we will experimentally address the last two, whereas we think the first two do not need any further experimental work, as explained in section 4. Thus, the working plan for points #3 and #4 will be as follows:

      iii) the discussion on the relative importance of antiviral RNA silencing and m6A-YTHDF against AMV and related viruses

      As we mention in the manuscript (discussion, first chapter), AMV *“is one of only very few studied plant RNA viruses for which no anti-RNAi effector has been identified. In addition, prunus necrotic ringspot virus (PNRSV), a virus genetically and functionally closely related to AMV (Pallas et al, 2013), does not induce easily detectable siRNAs, unlike nearly all other studied plant RNA viruses (Herranz et al, 2015)”. *

      Thus, we do not come up with a strong judgment on whether RNAi is more or less important than m6A-YTHDFs for AMV resistance.

      In any case, although these indirect observations seem to be quite solid, we agree with the reviewer that conclusive evidence to discard RNAi as a defense layer against AMV, at least at the time where ECTs are acting, is lacking. Thus, we plan to evaluate how the absence of the main components of the RNAi machinery affects AMV infection and if this ‘universal’ defense layer interferes/overlaps with the ECTs antiviral defense observed here. Realistically, this will take us 8-10 weeks. The experiments within this topic are based on established and published methods and thus, on solid experience. We do not expect any fallback solution and the results will be conclusive in this sense. We also note that the very time-consuming part of constructing mutants defective in both RNAi and m6A-ECT components (in this case, ect2/ect3/rdr6), as well as a first round of infection assays has already been completed at this point

      iv) To establish more clearly whether the phase separating capability of IDRs in the reading proteins correlates with the antiviral activity (Reviewer 2).

      We agree with Reviewer 2 that this is an interesting and important question. Hence, we have teamed up with the group of Prof. Kresten Lindorff-Larsen, expert in molecular simulations of protein folding and interaction. The Lindorff-Larsen group has recently published a powerful computational approach to simulate phase separation behavior of intrinsically disordered proteins (IDPs) or regions of proteins (IDRs) (Tesei et al., 2021, Accurate model of liquid-liquid phase behavior of intrinsically disordered proteins from optimization of single-chain properties, PNAS 118, (44) e2111696118). Applying this simulation method to the Arabidopsis ECT proteins establishes two facts that we will incorporate into a revised version:

      • The IDR of ECT2 shows marked phase separation propensity, in agreement with the experimental evidence published in Arribas-Hernández et al., 2018, Plant Cell.
      • The deletion mutant of ECT2 (ΔN5) with defective antiviral activity, yet unaffected ability to accelerate growth of leaf primordia shows markedly reduced phase separation propensity driven, in the main, by the many tyrosine residues in the region deleted in the mutant. These results suggest that phase separation capability indeed correlates with antiviral activity.

      Since not only ECT2, but also ECT3, ECT5 and, to some extent, ECT4, participate in AMV resistance, we plan further simulation work on these proteins during the first two weeks of January 2023 before submission of a revised version of the manuscript.

      3. Description of the revisions that have already been incorporated in the transferred manuscript

        • All the minor concerns raised by the three reviewers have been addressed and we have incorporated all of their suggestions in this intermediate version.

      4. Description of analyses that authors prefer not to carry out

      • *As previously mentioned, we believe that points 1 and 2 do not require an experimental approach to be addressed for the following reasons:

      i) to be clearer in our description on the effect of the m6A-YTHDF axis on the viral infectivity and avoid making assumptions on effects on replication (ref. #1 and #3)

              We agree with the reviewer that the term 'inhibition of viral replication' was not very appropriate because the idea that was intended to be conveyed was that of viral accumulation.        Hence, we will change this use of language, and we thank the reviewer for pointing out this inaccurate description.
      

      When it comes to differences between effects on infection in inoculated and non-inoculated leaves, there may be a slight misunderstanding, perhaps because we were not clear enough in our originally submitted version. In reality, there are some differences even in inoculated leaves between wild type and ect mutants, especially in the triple mutant, but the slightly higher accumulation in ect mutants is not clearly observed in every experiment and hence, does not always rise to the level of significance. Although it is possible that, at local level, ALKBH9B-mediated m6A would have other ECTs-independent effects, similar to what has been described for some animal viruses (Baquero-Pérez et al., 2021. Viruses), we think that the most likely explanation for this phenomenon is a combination of infection titers and ECT redundancy.

      The suggestion to use protoplasts is very accurate, but it would not resolve any doubt in this scenario, because ECTs are mainly expressed in mitotically active cells (Arribas-Hernández et al, 2020, 2018) and, since mature tissues make up the better part of the leaves used to isolate protoplasts, only few of the isolated cells would be useful. In addition, we previously showed that AMV accumulation is reduced in alkbh9b protoplasts compared to WT (Martínez-Pérez et al., 2021. Front. Microbiol.), which suggests that m6A levels of vRNAs are critical for the first stages of the infection, but in that case no problems with the expression pattern of the demethylase were expected.

      ii) The title and conclusion of this manuscript defined YTHDF proteins (ECTs) as "direct effectors of antiviral immunity", which is misleading. Effector molecules of an antiviral immunity cannot be identified when the effector mechanism is unknown;

      In this regard, we have a very different vision from the one the reviewer proposes. We believe that it is not correct to say that the effector molecules of an antiviral immunity cannot be identified until its mechanism is demonstrated. In fact, RNA silencing effectors were discovered long before their mechanism was elucidated in detail. One molecular interpretation of the Flor’s seminal gene-for-gene model, in terms of receptor/effector recognition, is that specific interaction between the receptor and its recognized (cognate) effector protein triggers resistance.

      Furthermore, we strongly believe that we provide enough arguments to propose a model, although, as we comment in the end of the discussion, “we view this model as a conceptual framework of value in the design of future experiments to test its validity”. The reasoning that we show here is the following:

      1. The m6A binding proteins are necessary for the antiviral response.
      2. At least ECT2 recognizes AMV RNAs in vivo and that its m6A-binding capacity is necessary to play a role in AMV infection.
      3. Simply losing methylase activity – with the same developmental defects as ect2/3/ – does not lead to the same degree of loss of resistance, and you can affect AMV resistance without affecting developmental functions of ECT2. Altogether, these observations justify the proposal that m6A exerts antiviral effects by acting as binding sites of ECT proteins in viral RNA, which we consider a clear mechanistic advance.

      Bearing in mind that m6A-modified vRNAs might concentrate in replication complexes and that MeRIP-seq methodology to map m6A revealed site multiplicity in the genome of some RNA viruses (Gokhale et al., 2016. Cell Host&Microb; Martínez-Pérez et al., 2017; Lichinchi et al., 2016. Nat Microbiol; Lichinchi et al., 2016. Cell Host&Microb; Marquez-Molins et al, 2022), our results recalled the previously proposed model in which m6A sites multiplicity causes the phase separation of these RNAs through the interaction of the IDRs of the YTH proteins (Ries et al, 2019; Fu & Zhuang, 2020; Gao et al, 2019). Now, with the new simulations of phase separation behavior, although still a model that requires further experimental tests, we have better evidence to support the model that it is related to LLPS of ECT-bound viral RNA. Therefore, we firmly believe that our title conceptually reflects the basic concepts of resistance induction in virus-plant interactions.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer 1

      Although this is an interesting, and generally well-performed study, it is primarily observational and there are few mechanistic insights provided into how MUC13 modulates barrier function. The authors propose a presumably direct interaction between MUC13 and PKC, which apparently sequesters PKC, preventing this kinase from triggering PKC-dependent increases in TJ barrier function; however, there is no evidence that a MUC13-PKC interaction occurs, that MUC13 is phosphorylated by PKC, or that phosphorylation of MUC13 has any impact on its function or overall barrier function. Thus, the hypothesis is not directly tested and all observations in this manuscript are generally correlative in nature.

      While the MUC13 cytoplasmic tail contains a putative PKC-binding motif, we indeed do not show a direct interaction between MUC13 and a member of the PKC family in this manuscript. Unfortunately, we have so far not been able to successfully perform (co-)immunoprecipitation of MUC13 with our current anti-MUC13 antibodies.

      To provide more insights into the possible MUC13-PKC interaction, we plan to perform several experiments.

      • First, we will determine the expression levels of the different PKC isotypes (PKC alpha, beta, gamma, delta, epsilon, and zeta) in the HRT18 cell lines by western blot.
      • Next, we will determine the localization of the relevant PKC isoforms and MUC13 by immunofluorescence microscopy. We are curious to see if we can find a colocalization between MUC13 and a PKC member on the lateral or apical membrane. If we can demonstrate a colocalization, we could follow up with a proximity ligation assay, but this would require the MUC13 antibody directed against the cytoplasmic tail (which only detects the lateral population) and might therefore be challenging.
      • Furthermore, since PKC delta protein levels were upregulated in the total lysate of ∆MUC13 cells, we will test a PKC delta-specific inhibitor in the TEER assay.

        Consider quantifying all blots (Fig. 5C, Fig. 6B).

      As suggested, we will quantify both blots.

      Consider using dot-plots for all quantified data.

      The graphs will be altered to include individual measurement points.

      Reviewer 2

      Fig2E showed two bands with different size in the two MUC13 WT control cell lines. They hypothesized that this could be the consequences of glycosylation different patterns. A sample with untransfected HRT18 might be included in the western blot panel. Additionally, what is the 100kDa band?

      Mucin blots are notoriously difficult and these MUC13 blots are the result of a lot of trial and error. We repeated the Western Blot with original HRT18 cells, HRT18 original cell line, as well as the two CRISPR control cells used in the study (WT 1 and WT 2) and one of the full-length MUC13 knockout cells. The higher band was absent from the MUC13 knockout cells, but a small shift in the MUC13 band size can be noted in the WT 1 cells compared to the original and the WT 2 cell lines, possibly indicating a change in the glycosylation pattern. The 100 kDa band remains detectable in all cell lines including the ∆MUC13 cell line, therefore we consider this to be an aspecific background band of the MUC13 antibody. We will add a more extensive Western Blot analysis to the manuscript.

      Did the transfection of the inducible GFP-MUC13 plasmid induce any decrease of Claudin1/3/4 in HRT18 or Caco2 cells? Same question regarding PKCdelta.

      These are indeed interesting questions. We will perform these experiments with our MUC13-overexpression HRT18 cells.

      Reviewer 3

      Moreover, the authors should determine if MUC13∆CT localize to TJs, as suggested by the working model in Figure 7C. The subcellular localization of MUC3∆CT could give critical clues for its function, but Figure 2G fails to provide any information and the authors do not present any additional data concerning the localization of MUC13∆CT. Detection of MUC13 in membrane fractions of WT, MUC13∆CT and cells lacking the mucin domain could be a feasible strategy forward.

      We will perform additional immunofluorescence experiments to determine the subcellular localization of MUC13-∆CT more accurately. However, detection of the extracellular domain by western blot, as suggested, is not possible due to the incompatibility of the extracellular MUC13-directed hybridoma antibody with the western blot technique. We currently do not have a suitable antibody that recognizes the ED and can be used for western blot.

      The authors introduce an inducible MUC13-GFP fusion protein into WT and ∆MUC13 cells and show that it reverses the enhanced TEER upon MUC13 deletion. Unfortunately, the "Materials and Methods" section lacks adequate information on how this fusion protein was designed. Critical questions are the position of the GFP tag within MUC13, whether the fusion protein is correctly processed in HRT18 cells, and if it localizes to the apical or apico-lateral membrane domains? Figure 2H is of low magnification and fails to provide information on the subcellular localization of the MUC13-GFP fusion protein.

      The materials and methods section will be adjusted to describe all the design details of the fusion protein. The GFP tag was added to the MUC13 C-terminus with a GGGS linker sequence in between. Processing of the fusion protein seems correct as we observed MUC13-GFP localization to both lateral and apical membranes and no access intracellular build up. As suggested by the reviewer, we will add more detailed immunofluorescence pictures to the manuscript.

      Figures 6B-C suggest that PKCdelta levels increase in ∆MUC13 cells, which correlates with higher enrichment of Claudins in membrane fractions. The authors then inhibited PKCdelta and observed reduced recruitment of Claudins to membrane fractions. Since the family of Claudins are differentially regulated by phosphorylation (PMID: 29186552), the authors should investigate the TEER phenotype of WT, ∆MUC13 and MUC13∆CT upon PKC inhibition.

      We must clarify that figures 6C-D are done using the PKC inhibitor targeting all conventional PKCs (alpha, beta, gamma) as well as delta (https://www.tocris.com/products/gf-109203x_0741). We recently obtained a PKCdelta-specific inhibitor which we will test in the TEER build-up experiments.

      Moreover, the authors predict phosphorylation sites in MUC13CT and suggest a link between PKC and MUC13 (Figure. 6A), however no evidence is presented to support this hypothesis. The authors should either determine if PKC phosphorylates MUC13 and if this modification has implication on MUC13 localization and TJ function, or remove statements regarding MUC13 phosphorylation. The data provided suggest that PKC regulates TJ proteins independent of MUC13.

      We will adjust the manuscript to put less emphasis on the putative PKC motifs in the MUC13 cytoplasmic tail. For further details on how we will proceed regarding the possible MUC13-PKC interaction see question 1 from reviewer #1.

      Figure 5C. Quantification of at least 3 independent experiments is required.

      These data will be added to the manuscript.

      Figure 6B. Quantification of at least 3 independent experiments is required.

      These data will be added to the manuscript.

      Reviewer 4

      OPTIONAL: MUC13 is expressed both, in the basolateral membranes and in the apical membrane of intestinal epithelial cells (IECs). Does the authors check the relevance of MUC13 in the formation of microvilli in IECs? Are microvilli different (microvilli staining, number of positive cells to microvilli, length, width or distribution of microvilli) in ΔMUC13 and in MUC13-ΔCT? How the glycocalyx looks like in these cells genetically modified for MUC13?

      HRT18 cells do not seem to develop microvilli. However, we plan to stain these cells with a microvilli-specific antibody (ACTUB). The HRT18 cells express mostly MUC13 and relatively low levels of the larger TM mucin MUC1. To study changes in the glycocalyx, we will stain using a MAL-II antibody which targets α-2,3 sialic acids, which are abundantly present in mucins. In this way, we will determine any big changes in the total glycocalyx that may occur in response to the removal of MUC13.

      In the figure 1D would be nice to represent the co-localization of MUC13 together with occluding in a graph in each Z-stack so you can visualize in which part of the cell is maximum colocalization of these both components.

      These data will be provided.

      In the figure 1E, would be great to compare between the two different MUC13 antibodies the apical fraction stained in HRT18 and Caco-2. Specially in the HRT18 cell line since the first antibody did not label apical MUC13 expression meanwhile the second antibody detects the apical expression in these cells. How much lateral lateral stain the C terminal antibody compare with the extracellular antibody for MUC13 and how much stain apically the C terminal antibody compare with the extracellular antibody? Would be nice to see some comparative results using the intensity by Z-stack and plotting in a graph.

      This is a good suggestion as it is quite intriguing that both MUC13 antibodies seem to target (partially) different MUC13 populations. We will perform co-staining with both MUC13 antibodies to provide information on which MUC13 populations are detected by each antibody (apical vs lateral membrane).

      Manuscript would be improved if in the figure 2H to compare within the same cell line the number of MUC13 positive cells in the WT, number of MUC13 positive cells in WT+pMUC13 and the number of MUC13 positive cells in the ΔMUC13+pMUC13

      We will quantify the percentage of MUC13-GFP positive cells in both the WT and ΔMUC13 backgrounds by either microscopy or flow cytometry.

      In figure 5C would be helpful to plot in a graph the normalized expression of each TJ protein and compare between the different cells used (WT, ΔMUC13 and MUC13-ΔCT) as you did in figure 5A

      We will provide the quantification data of three independent experiments.

      Description of the revisions that have already been incorporated in the transferred manuscript

      Reviewer 1

      In addition, this model does not explain why all kinase inhibitors tested reverse the increase in TER observed in deltaMUC13 cell lines. Does this reflect the lack of inhibitor specificity or the likelihood that many kinases are involved?

      As stated in the manuscript, we think that MLCK, ROCK, and PKC are all essential for TER buildup in the ∆MUC13 cells. Because the roles of MLCK and ROCK are well established, we choose to follow up on the PKC results. We adjusted the text to clarify this point.

      The authors do observe that there is an increase in expression of several tight junction-associated proteins, including the claudins, in deltaMUC13 cells. Affected CLDNs include 1, 2, 3, 4, 7, 12. (1) While it appears the authors are arguing that this increased claudin expression results in increased barrier function, they do not sufficiently highlight the well-known role that CLDN2 has in cation transport, and both CLDN-4 and -7 have also been implicated in paracellular ion flux (although this is apparently cell-type specific). These observations would seem to argue against a simple correlation between claudin expression and tight junction barrier function.

      The reviewer is right about the different functions of claudins. Claudin-2, -4 and -7 have (potentially) pore-forming properties, while the other claudins restrict paracellular passage. It has been previously demonstrated that the magnitude of paracellular ion and water flux is reflected by the specific repertoire of claudin family members (Shashikanth et al., 2022). In this paper, overexpression of claudin-4 was shown to mobilize and affect polymeric strands of claudin-2, thus blocking its channel activity. Our mass spectrometry data demonstrated a striking increase in claudin-1, -2, -3, -4, -7, and -12 in the MUC13 knockout membranes compared to WT. We hypothesize that the claudin repertoire in the MUC13 knockout cells leads to a more restricted paracellular route (as observed in the TEER and tracer experiments). The pore-forming claudins may be subject to “interclaudin interference” therefore leading to restriction of the total paracellular ion and water flux. We have adjusted the text of the manuscript to clarify this point.

      We attempted to investigate claudin-2 expression levels in isolated membranes by Western Blot but were unsuccessful as the antibody did not detect any protein while claudins-1 and -4 could be detected with the same method.

      Furthermore, the authors should note the disconnect between paracellular ion flux mediated by claudins and the flux of markers such as dextrans and lucifer yellow, which can be dissociated from claudin function.

      We acknowledge that the flux of larger particles (the leak pathway) is not regulated by claudins (which regulates the pore pathway). We aimed to assess both the pore and the leak paracellular pathways, by using different techniques including TEER, small solutes (Lucifer Yellow CH), and larger molecules (4 and 70 kDa FITC-Dextrans). HRT18 wild type cells are already very restrictive to the pass of larger molecules (FITC-Dextrans) but are more permeable to smaller solutes such as Lucifer Yellow (400 Da). We observed that removal of the MUC13 cytoplasmic tail did not affect the TEER, but reduced the paracellular passage of Lucifer Yellow, demonstrating that manipulation of MUC13 can affect both the pore and leak pathways. We adjusted to text to include this point.

      The increased expression of claudins in the nominally tail-minus MUC13 without a corresponding change in TER would again seem to argue against a simple correlation;

      MUC13-dCT cells showed consistently increased levels of claudins-1 and -2, but not the other claudins. This claudin repertoire (with high claudins-1 and -2, but lower claudin-3, -4, -7, and -12) is apparently not enough to increase TEER. We think that this again reflects the importance of the total claudin composition for the control of the paracellular pathway.

      Watch the use of decimal points instead of commas (lines 253 and 256).

      Corrected.

      Line 543: MilliQ is not a washing agent (or is it?). (Line 535) We use MilliQ as a final step before mounting the glass slides to remove any possible salt deposition that would affect the visualization by microscopy.

      We have specified this in the text.

      Line 553: TER is the product of total resistance times the area. The units are ohms times area.

      Indeed, we have changed this mistake (line 545).

      Line 630: Please provide the transfer conditions (voltage, amp, watts?) and transfer buffer when describing the Western blot protocol.

      For immunoblotting of MUC13, protein lysates were transferred to 0.2 µm PVDF membranes using the Trans-Blot Turbo Transfer system (Biorad). The transfer was run using the protocol (High MW) which consisted in running for 10 min at 25 volts (V) and 1,3 amperes (A). These experimental data were added to the manuscript.

      Reviewer 2

      My main concern about this manuscript is that the authors analyzed MUC13 role in intestinal homeostasis and function using colorectal cancer cells. As helpful as cancer cells are, we should always be cautious about extrapolating roles in normal intestinal epithelium or IBD pathology. Obviously, these finding are also interesting in a cancer context. Using GEPIA (http://gepia.cancer-pku.cn/), I observed that MUC13 is overexpressed in colorectal cancer COAD-TCGA dataset (compared to normal colon from GTEX). Similar results were obtained previously by Gupta et al. (ref #10). I am aware that this would be difficult to confirm the main findings in a non-cancerous intestinal cell line but this limit (normal intestine using cancer cells) should be at least discussed in the manuscript.

      We appreciate the reviewers’ comments and are aware of the downsides of using cancer-derived cell lines. We have performed the GEPIA analysis ourselves and have an ongoing project about the possible role of MUC13 in colorectal cancer progression. In a separate project, we are collaborating with the Gaultier Laboratory at the University of Virginia which has generated a MUC13 knockout mouse. This model will allow us to study the role of MUC13 in non-cancerous tissue. We recently received intestinal biopsies from these mice which will be stained with MUC13 and claudin antibodies to determine localization in healthy tissue. These experiments will reveal if MUC13 colocalizes with claudin on the lateral membrane in the healthy mouse intestinal tract. In future experiments, we will also address MUC13 localization and function in human intestinal organoids. We have adjusted the discussion to refer to the limitations of using cancer cell lines.

      Massey et al (Micro 2021, PMC7014956) previously showed that MUC13 overexpression increased rigidity in PDAC cells and discussed involvement MUC13 link with EMT. MUC13-Her2 interaction was also associated with decrease of E-cadherin suggesting an EMT phenotype. This should be included in the discussion section.

      The discussion has been adjusted to include the link with EMT.

      The authors performed mass spectrometry analysis. Results are deposited on ProteomeXchange but are not yet publicly released. Among the 1189 membrane protein identified. Did the authors observed alteration of EMT proteins? (decrease of vimentin for example). In the discussion section (lane 347), the authors mentioned the relationship between other membrane bound mucins such as MUC1, MUC4, MUC16 or MUC17 and AJ/TJproteins. Did the authors observed any alteration of these mucin in the mass spectrometry data?

      The mass spec analysis was performed on membrane fractions, therefore our dataset will not contain true cytosolic proteins. One of the key EMT proteins, Vimentin, is a cytosolic protein, and indeed it was not found in our dataset. Other EMT-related proteins are shown in the following table. TGF beta 1 was slightly decreased, while E-cadherin and Integrin beta 6 were slightly increased in the ∆MUC13 cells compared to WT cells.

      Gene Name

      Mean WT

      Mean ∆MUC13

      Mean MUC13-∆CT

      TGFBI (TGB beta 1)

      20,54

      16,48

      18,83

      CDH1 (E-cadherin)

      22,69

      24,57

      24,24

      ITGB6 (Integrin beta 6)

      18,86

      21,74

      19,19

      Vimentin - Cytosolic

      -

      -

      -

      CDH2 (Cadherin-2, N-cadherin)

      -

      -

      -

      Mucins are large proteins comprised of densely O-glycosylated mucin domains, which makes them extremely challenging to study by mass spectrometry (MS) (Rangel-Angarita et al., 2021). We did not specifically employ mucin-directed technologies in this dataset, thus making the detection of mucins hard. No mucins other than MUC13 were detected. For MUC13, two peptides corresponding to the EGF-like domains in the extracellular domain, a region that is less densely glycosylated. We added a sentence to the description of the mass spec results to include the EMT proteins and other mucins.

      Minor points:

      Lane 126: HRT18 and Caco2 colon cancer cells instead of intestinal epithelial cells

      Corrected.

      Lane 181 and lane 514: add "full length" MUC13 DNA sequence

      Corrected.

      Lane 234: TEER was measured every 12h. How the authors did observed the largest increase at 42h? Was it 48h? Please clarify.

      We aimed at measuring every 12 h, however the exact measurements were done at 18h, 24h, and 42 h post-infection. We have corrected this in the manuscript.

      Reviewer 3

      Line 43 and 46. "Enterocytes" should be replaced with "intestinal epithelial cells", since enterocytes are themselves a distinct subpopulation of IECs.

      We have changed it in the manuscript.

      Lines 58-60. References in support of the statements should be added.

      We added a reference to this sentence.

      Lines 188-190. Authors comment on "roundness" of different cell lines. If the parameter is critical for the manuscript, the authors should quantify this phenotype.

      The parameter is not critical for the manuscript. We removed the sentence.

      Figure 3A. Staining of cell lines should include panels showing localization of MUC13.

      Co-staining of MUC13 with occludin in HRT18 cell lines can be found in figure 1D, and MUC13 with E-cadherin in supplementary figure 1.

      Lines 323-327 and 390-392. Sentences on these lines contradict each other. The sentences should describe/discuss quantified data presented in Figure 6D.

      The reviewer is right that we should be discussing the quantified data in 6D. We adjusted the sentence in line 323-327.

      Proteomic data sets should be made publicly available on data depositories.

      All proteomics raw data were deposited to the ProteomeXchange Consortium with the dataset identifier PXD029606.

      Reviewer 4

      OPTIONAL: In the figure 2E, is the extracellular antibody still detecting the MUC13-ΔCT?

      No, unfortunately the antibody directed against the MUC13 ED is not compatible with western blot.

      In the figure 2G, would be nice to comment possible reasons why the deletion in the first cell line of the MUC13-CT you can still detect with the extracellular antibody some lateral expression of MUC13 meanwhile in the second cell line, the same deletion (MUC13-CT) you cannot see any lateral MUC13 staining with the extracellular antibody.

      Yes, this is indeed a puzzling finding, especially because the CRISPR deletion is the same in both cell lines. We will add a sentence about possible reduced stability of the MUC13 without CT domain that leads to a different outcome in both cell lines.

      It would be nice that the results from Figure 3H are better explained since it is difficult to follow.

      We adjusted the text to explain the experiment in more detail.

      2. Description of analyses that authors prefer not to carry out

      Reviewer 1

      The authors may be overly reliant on TER measurements. Epithelial cells have two parallel resistive pathways: transcellular and paracellular. TER measure the contribution of both. Thus, an increase in TER could result from a decrease in transcellular ion transport. The authors need to measure transcellular ion flow or selectively measure the junctional resistance in a select set of experiments to rule this possibility out.

      The reviewer is right that TEER is a sum of the resistance of the transcellular and paracellular pathways. However, due to the high resistance of cell membranes, the current predominantly travels via the paracellular route (Elbrecht et al., 2016). For this reason, TEER measurements are widely accepted techniques for the assessment of ions passage through the paracellular pathway (Shen et al., 2011).

      Reviewer 3

      Figure 1C. Caco2 and HRT18 cells exhibit distinct MUC13 expression patterns when probed with an antibody against the MUC13 CT; MUC13 localizes almost exclusively to lateral cell junction in HRT18 cells, while a higher portion of MUC13 is present on the apical surface of Caco2 cells. This observation has two possible explanations: 1) the two cell lines express distinct forms of MUC13, or 2) the two cell lines carry distinct machineries for anchoring MUC13 to apical versus apico-lateral membranes. Thus, The authors should take the opportunity to determine the impact of MUC13 deletion on TEER and TJ function in Caco2 cells. Proteomic analysis and functional assays in Caco2 cells may provide more a general mechanism for how MUC13 regulates TJ proteins.

      Yes, this would be a great line of investigation. However, we aimed to knockout MUC13 in Caco-2 cell lines (with the same CRISPR/Cas9 protocol as the HRT18 cells) but were unable to obtain Caco-2 knockout clones. We think this might be a consequence of the poor capability of Caco-2 cells to grow as single colonies (a required step in the protocol). Another option is Caco-2 MUC13 knockout cells have reduced viability.

      The authors generate cell lines that either lack MUC13 or express MUC13 lacking the cytoplasmic domain. Loss of MUC13 cells resulted in enhanced TEER and increased recruitment of TJ proteins to membrane fractions. MUC13∆CT cells show moderate recruitment of TJ proteins to membranes and no increase in TEER but inhibit paracellular diffusion of Luciferase Yellow across monolayers. Figure 3A suggests that Occludin redistributes to tricellular junctions in ∆MUC13 cells, whereas it is found more laterally in WT and MUC13∆CT cells. These finding suggest that full-length MUC13 interferes with TJ protein complexes. However the impact of the extracellular and intracellular (CT) domains is not fully elucidated. Does the O-glycosylated mucin domain interfere with the extracellular domains Occludin and Claudins? The authors should clarify the contribution of the mucin domain to the observed phenotype, for example by performing the described experiments in a cell line expressing MUC13 lacking the mucin domain.

      Mucins are type I membrane proteins with the N-terminal part of the protein on the extracellular site. Therefore, a CRISPR method to specifically remove the glycosylated domain but leave the remainder of the protein in frame is challenging. An additional difficulty is that the ED contains a lot of repeats, complicating the design of specific guide RNAs. To specifically address the contribution of the glycosylated domain, we could complement the MUC13 knockout cell with a construct lacking the ED. However, this would not be comparable to the endogenous MUC13∆CT cell line presented in this manuscript. In future studies, we will strive to address the functions of the different MUC13 domains in more detail.

      Figure 5A. Turnover of TJ proteins in membrane fractions occurs faster than over a period of 1-3 days (PMID: 18474622). The authors should determine TJ protein turnover over a period of minutes and hours.

      We acknowledge the findings in this interesting paper concerning the continuous remodeling of tight junctions. However, the readout of our biotinylation assay is degradation and the timeframe of degradation turns out to be days and not hours. Within this timeframe remodeling is taking place but it cannot be captured in the total lysate.

      Reviewer 4

      OPTIONAL: The authors show that the probiotic Lactobacillus plantarum increase epithelial barrier independently of MUC13. Have the authors considered to use other probiotics as Lactobacillus paracasei (10.3389/fcimb.2015.00026), Akkermansia muciniphila (10.1038/emm.2017.282) or some metabolic products from intestinal microbiota as short-chain fatty acids (SCFAs) (10.3389/fphys.2021.650313) to check what is the role of MUC13 and if it is related with other microbe or microbiota metabolite?

      Thank you for the suggestion. We have an ongoing project in which we investigate the impact of different probiotic bacteria and plan to investigate whether they have an impact on the epithelial barrier function in a MUC13-dependent manner. This study will lead to a separate publication.

      OPTIONAL: The authors successfully delete MUC13 in IECs, both, full length and the cytosolic tail. Have the authors considered targeting the deletion of the PTS domain in MUC13? Could affect that something different from paracellular trafficking as the extracellular detection of microbes and microbial products?

      Removal of a domain in the extracellular domain of MUC13 with CRISPR is challenging because mucins are type I membrane proteins, the repeats and possible frameshift, as described above.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Official Revision Plan Document:

      Manuscript number: #RC-2022-01681

      Corresponding author(s): Nicholas, Leigh

      1. General Statements

      We sincerely appreciate these positive and helpful reviews. We are grateful for the constructive comments and we outline our responses below. Addressing these comments will further broaden the impact of the work and increase the power, reliability, and application of single cell approaches while decreasing the cost and labor intensive collection steps.

      As single cell sequencing approaches have entered the mainstream, we are still finding flaws and artifacts from these methods. A major limitation of widely used collection approaches is a difficulty in obtaining biological replicates, which are required to generate robust sequencing datasets. In general, a lack of biological replicates has been a major oversight in the vast majority of single cell studies, and any technique that can facilitate biological replicate collection should be widely applied. The elegance of SNP-based demultiplexing lies in the fact that it can be applied regardless of any external label, applied to previously collected data, and the data are already collected for every sample sequenced. We were pleased to have the reviewers agree and identify the many conceptual advances in this manuscript, with one major critique being noted by one reviewer as a lack of novelty.

      Regarding the lack of novelty, we appreciate that SNP-based demultiplexing was not developed as a method within this manuscript, but disagree that a broad benchmarking and validation study that opens the doors to the use of SNP-based demuxing in any species with sufficient between animal genetic heterogeneity lacks novelty. To address this concern, we will now further emphasize the drawbacks and artifacts that can arise in the currently common practice of pooling samples and choosing not to demultiplex, while improving our explanation of our discoveries in this manuscript. The lack of biological replicates in single cell sequencing studies is rampant and needs to be addressed with approaches such as those demonstrated here. We also want to emphasize the importance of validating and benchmarking bioinformatic approaches with orthogonal, priorly established approaches (eg. wet-lab based methods), which had previously not been conducted for SNP-based demultiplexing, outside of human samples. The inbred nature of common lab animals and broad range in quality and availability of genomic resources make this a major step forward in bringing SNP-demultiplexing to all labs. We believe that our paper broadly extends, benchmarks and most importantly validates the advantages and limitations of SNP-based demuxing across various species.

      2. Description of the planned revisions

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      “Cardiello et al tested if souporcell (https://pubmed.ncbi.nlm.nih.gov/32366989/) can be used to demultiplex samples for some model organisms, based on identified SNPs. For this, they used synthetic multiplexed data, publicly available datasets and some new datasets, spanning samples from five model organisms. Their analysis indicates that souporcell could be used to

      demultiplex scRNA-seq experiments for multiple species, which offers a cost-beneficent approach.

      The manuscript reads well and shows this approach can work for different model organisms. However, unfortunately, I am confused about the amount of novelty in this manuscript. The method, souporcell, is already published. The authors indicate souporcell is not validated in non-human samples, but the original paper states that their method works with malaria parasite data (Fig 3b, FigS4). Adapting and using an available tool for different model organisms is good and groups working on different model organisms may find this manuscript useful, but the same could be said for the original article. Due to these reasons, I am not sure whether this manuscript has novelty sufficient for publication.”

      __Our response: __We appreciate this constructive criticism that helped us realize that our novelty was not clearly stated in the first version of the manuscript. We need to improve our Introduction and our verbiage as to what has been previously performed and how this current manuscript provides novel insight into multiple previously unanswered questions which broadly extend the utility of SNP-based demultiplexing. To address this comment, we will revamp our Introduction, Results, and Discussion to more clearly highlight the novelty of this work.

      Planned revisions:

      Defining “validation”. We define validation as establishing the accuracy or validity of a method. Therefore, validation of SNP-based demultiplexing for use in non-human species requires comparison to an already proven, orthogonal method, such as a wet-lab based demultiplexing approach. The souporcell paper does not validate (i.e., confirm with an orthogonal wet-lab method) the results from souporcell in any species but humans. This lack of validation for SNP-based demultiplexing in samples from non-human species made it unclear how and if these approaches would work in other species. Human samples are expected to perform exceptionally well in this approach due to their extremely high genetic diversity and wealth of available genomic resources. Thus, while it was exciting that the original souporcell authors chose to try applying their algorithm to a non-human (e.g., malarial parasite) dataset, the paper left many unanswered questions about potential uses and accuracy. In addition to validating the accuracy of souporcell results in many species, we demonstrated that souporcell shows a relatively poor ability to call doublets in many non-human vertebrates. In addition to highlighting a novel drawback of the method, this demonstrates the need to validate the accuracy of different aspects of tools like souporcell when applied to new systems rather than use souporcell or other SNP-demuxers prior to validation. Highlighting other novel findings in this work: For instance, our assessment of which genomic resources are required for using SNP-based demultiplexing in different species, whether this could be applied to lab animals likely to be inbred to various degrees (and to address other reviewers comments, the inbred level permitted), assessment of the accuracy of SNP-demultiplexing in species with alignment references of varying qualities (i.e., only de novo transcriptome) and genomes of varying sizes (up to 30Gb, 10 times larger that of human, which can be extremely computational intensive), and the exploration of pooling and demultiplexing of multiple species in a single library. Making clear how we made the necessary adjustments to the original souporcell pipeline to successfully apply it to datasets with various resources available in these species.

      (Reviewer #1): I also wrote down two minor points below:

      1- Doublets assigned by souporcell compared to the fluor-based assignment look random. In Fig 2 doublet recovery rate looks smaller, and in fig 3 doublet rate prediction looks more random. This is a bit confusing. Is there any explanation for this?”

      __Our response: __We agree and thus noted in the manuscript that the detection of doublets in these datasets by Souporcell are not very reliable.

      Planned revisions:

      We will expand our Discussion to include brief hypotheses for factors that likely contributed to poor doublet detection by souporcell in these analyses. In the Discussion we will clearly suggest complementary approaches for improving the detection/removal of doublets in pooled scRNA-seq experiments through applying external gene expression-based doublet detection programs. We will also attempt to use these programs on at least one of our datasets to see how well independant doublet detection methods complement souporcell on pooled datasets. A full benchmarking of these doublet detection methods already exists and will be referenced in our Discussion.

      Reviewer #1: “2- The authors discussed the immune system cells might show some variability in their discussion (referring to fig 3), but this is not clearly shown in the figures as data. Having a percentage bar graph could make it clearer for the readers.”

      __Our response: __This is a valid point that we plan to address with the addition of a new figure as well as some clarifications in the text.

      Planned revisions:

      We will make a supplemental figure for Figure 3 in which we clearly demonstrate animal to animal variability. (bar plot of absolute cell numbers present from each individual animal present in each cell cluster as requested). In the new supplemental figure we will also include a new UMAP plot of fluorescently assigned cell identities belonging only to one of the three animals, which makes it easier to visualize the difference in numbers of cells from each animal present in each individual cell cluster. We will also cite papers that have already demonstrated the phenomena of animal to animal variability in scRNA-seq datasets. We will further emphasize that even in the absence of animal-to-animal variability in co-clustering, that demultiplexing pooled datasets is important because differential expression analysis is greatly enhanced with biological replicates.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      Major comments:

      “1. SNP-based demultiplexing performed well on some species, such as zebrafish and Africa green monkey, from which over 90% of the cells analyzed were correctly identified. However, this accuracy decreases in Pleurodeles samples when a common SNPs VCF file is absent (Fig.3). It showed that cell identity can be more precisely defined with the increase of average read depth (Fig 3B). So, I am wondering whether the mis-defined cells shown in Fig. 3E, actually are cells with lower reads. It is better if the authors can test such a correlation between the cell identity and the depth of reads using the data from Fig. 3E.”

      __Our response: __We are thankful to reviewer #2 for raising such a great point. We do see the accuracy of the benchmarking results for this experiment increase with increasing sequence depth/cell quality. However, the reasons for this are potentially more complex than just higher accuracy of souporcell in higher quality cells: The fluorescent-based demultiplexing that is being used for “ground truth” in benchmarking souporcell for this figure is more accurate in cells with higher read depth because more fluorescent gene reads are likely to be captured. Therefore analyzing the accuracy of souporcell relative to fluorescent-based demultiplexing over varying read depths can be confusing because it is possible that both methods improve in accuracy with higher read depth. Figure 3B attempts to illustrate this concept, and to demonstrate why we chose to benchmark only the cells with sufficient read depth (read depth between 5K, and 40K, and >1 fluorescent gene read per cell). We plan to complement our manuscript with additional figures and text that will make this clearer.

      Planned revisions:

      We will produce a plot similar to Figure 3B, but with a Y axis that is the percent agreement between the two methods. For Figure 4 we will also make a plot showing percent agreement between demux methods versus read depth. This plot will be a useful comparison to investigate whether scRNA read depth is directly affecting the quality of souporcell’s SNP-based demux results. Plotting this comparison for a dataset in which Cellplex/Cell hashing is the benchmarking demux method is a more fair test of the effect of sequencing depth on the souporcell demux results because cellplex results rely on reads from the cellplex library, which are an independent sequencing library from the scRNA reads. We will investigate whether the use of a common VCF file or lack thereof affects souporcell accuracy. To test this, we will try repeating souporcell demux of one dataset with and without a common VCF file input to see if the VCF file inclusion affects the accuracy of souporcell results.

      Reviewer #2:

      2. Please discuss limitations of this approach in the manuscript. (1) To which extent, when SNPs are roughly present in the individuals of same species, SNP-based demultiplexing can be applied, e.g., individuals from an inbred strain (c57bl6 mice) would not work.(2) The authors experimentally tested two newt species using SNP-based demultiplexing. When multiple species are experimentally applied, may the cell/nuclei size variation cause problem?”

      __Our response: __We agree with Reviewer 2 that this paper brings up many technical questions about the limits to which SNP-based demultiplexing will succeed. These limitations should be addressed more thoroughly in our Discussion section.

      Planned revisions:

      We will expand our Discussion to more fully discuss the predicted limits for SNP-based demuxing for separating pooled cells from genetically similar individuals. We referenced the single paper previously published which reported that Freemuxlet, a similar approach to souporcell, did not succeed when applied to cells pooled from multiple animals within an inbred mouse strain, but did succeed across mouse strains (though without any validation of results). We will expand this Discussion to address the expected effects of genetic diversity on the success of SNP-based demultiplexing methods. We will also note in this expanded Discussion that SNP-based demuxing worked in this paper on siblings (some of the xenopus, some of the zebrafish), and other SNP-based demuxers have been used successfully for demuxing cells from closely related individuals including human siblings (scSplit) and human maternal/fetal pairs (souporcell). We will expand our Discussion to address the potential drawbacks of pooling cells from different species or tissue types including the possibility of a bias in scRNA-seq sample preparation methods. We expect that variations in cell or nuclei sizes between species could cause biases in cell capture depending on the scRNA-seq library preparation method, especially with microfluidic based scRNA-seq preparation methods. We will search for a dataset that would allow for synthetic pooling of inbred mouse data and, if available, put this through our synthetic pooling and demuxing pipeline. While other papers have reported this does not work with other SNP demux tools, and on comments on the souporcell github (https://github.com/wheaton5/souporcell/issues/154) it does not seem to be working, we feel this would be a nice test/reference for showing the limitations for SNP-based demuxing in highly genetically similar individuals.

      (Reviewer #2)* *

      “3. What is the upper limit number of samples when using this model. Please make some estimation or discussion about it.”

      __Our response: __We think this is a pressing question for the future of SNP-based demuxing and deserves further discussion in this manuscript. This is directly addressed by the authors of souporcell in a github thread with regard to human samples (worked on 21 human samples, may work in up to 40). At this point, we have no reason to believe that the limit on sample numbers should be different in other species.

      Planned revisions:

      We will include discussion about potential limits for the maximum number of samples that can be pooled and demuxed using this approach. As discussed below in response to reviewer 3, we will quantify the genetic differences in pooled datasets in this manuscript in order to give readers an improved prediction of how well SNP-based demuxers are likely to work on their animals of interest. We will look for previously published pooled dataset from zebrafish that includes multiple dozens of samples and attempt to SNP-demultiplex this pool. While we will be unable to validate the accuracy, given how well SNP-based demuxing has performed we can at least determine if cell origins are assigned.

      Reviewer #2: Minor comments:

      “1. Please add an algorithm principle of this model.”

      __Our response: __Thanks for the suggestion, we will do so.

      Planned revision:

      We will direct readers to the algorithm principle of souporcell in the original paper and include a flowchart of our workflow for running souporcell piece by piece as we have done in the manuscript. As mentioned above, we will make clear how we made the necessary adjustments to the original souporcell pipeline to successfully apply it to datasets with various resources available in these species.

      Reviewer #2:

      “2. Give a clear definition of doublets including the ground truth and Souporcell result.”

      __Our response: __We appreciate this recommendation. For the purposes of this paper our definition of a ‘doublet’ is a dataset represented by a single cell barcode that actually contains more than one cell. However, true doublets can be identified with absolute certainty only in our synthetically pooled datasets, because no demultiplexing approach used for benchmarking is 100% accurate. Therefore, ‘true doublet’ will refer to known doublets based on synthetically pooled dataset ground truths. Further, for our experimental datasets we will also use ‘confirmed doublet’ to refer to cells that were called doublets by both the ground truth and souporcell. And we will use ‘contested doublet’ to refer to cells in which the experimentally derived ground truth and souporcell result disagree about a potential doublet.

      Planned revision:

      We will insert a clear definition of doublets used in this paper as described above, including the complexity in identifying which doublets are real given the relationship between ground truth and the souporcell results for each experiment.

      Reviewer #2:

      “3. Authors should indicate the time cost of running one round of such analysis, the minimal computational requirements?”

      __Our response: __This is an important point and will be helpful to readers.

      Planned revision:

      We will add to the manuscript information on the required time, RAM consumption, and computational requirements for running various setups for souporcell.

      Reviewer #3: Major comments:

      “The manuscript makes a convincing case for the ability of a preexisting SNP-based demultiplexing tool, called souporcell, to demultiplex pooled samples. The study uses three methods for validation: 1. In silico data pooling; 2. Pooling of transgenic lines; 3. Pooling of cells tagged with CMOs (10x genomics). The results are consistent across experiments.

      The authors propose that souporcell is a solution for demultiplexing pooled samples whenever sample tagging methods are not feasible. Although the authors test this approach in several species and conditions, the validation does not cover all possible cases and situations, obviously. Indeed, the authors recommend potential users to run pilot validation experiments with a secondary demultiplexing methods.

      However, the manuscript would become more useful if the following points are addressed:

      First, what is the genetic relatedness of the individuals pooled in the experiments? What is the SNP frequency in the samples analyzed, and how does that compare to SNP frequency in mouse strains? (The number of SNPs in the VCF is reported in a supplementary table but not discussed in the main text). This point is extremely important: as the authors mention, it is not possible to demultiplex samples from the same mouse strain. Inbreeding is relatively common in laboratory species, even unconventional ones; therefore, information on genetic relatedness and SNP rate would help readers assess whether SNP-based demultiplexing has a good chance to work in their systems. Addressing this point does not require any additional experiments, and computing from the single-cell reads how many SNPs distinguish the individuals pooled here should be straightforward.”

      __Our response: __We appreciate the comments raised by reviewer #3.These are valuable critiques and will greatly improve the manuscript.

      Planned revisions:

      We will expand our Discussion with a paragraph on the limits for genetic differences required for SNP-based demuxing to work, as mentioned in response to Reviewer 2. This will include references to Table 1 values on SNP numbers utilized in each analysis, and hypotheses on the absolute limits for genetic relatedness. We will expand Table 1B to include green monkey. As mentioned in response to Reviewer 2, if previously published data we will also try applying souporcell to data from an inbred mouse line to test run an extreme case of applying SNP-based demuxing to data from very inbred animals. We will more clearly annotate the known relationship between individuals in our experiments, and will discuss this within our Discussion. We will contact the zebrafish and axolotl authors and ask if these animals were siblings. We will identify and apply a method for quantifying the genetic relationship between individuals in each scRNA-seq experiment in this study, to enable us to provide readers with a quantitative measure of genetic diversity present in each experiment. This analysis should shed some light on the requirements for genetic variability in order for SNP-based demultiplexers to succeed.

      Reviewer #3:____

      “Moreover, the relatively limited number of samples pooled does not validate the use of souporcell with a larger number of samples. For example: in developmental studies, often dozens of embryos are collected and pooled. What are the potential caveats of using souporcell for demultiplexing larger number of samples? The Discussion would be a good place to warn potential users of the limitations of the approach.”

      __Our response: __We agree this could still be a limitation, and for developmental studies with multiple dozens of samples, further exploration of optimal demultiplexing methods or the combination of computational and wet-lab based demux methods may be required.

      Planned revision:

      We will expand our Discussion on predicted limits for SNP-based demuxing of high sample pools, as discussed in response to Reviewer 2. We agree that developmental projects often involve pooling large numbers of samples, so it is worth clearly outlining the benefits and risks of planning to use SNP-based demultiplexing on such high sample pools, and to outline the limits as discussed by the developer of souporcell. As stated above, we will work to identify a previously published pooled zebrafish dataset with multiple dozens of samples and run souporcell on it. While this will not provide any validation it will at the least determine if we are able to assign cell origins, which have thus far been very reliable when assignments have been made.

      Reviewer #3: Minor comments:

      “- is the accuracy of doublet detection rate a function of number of samples? This can be tested by repeating the monkey in silico experiment with three individuals.”

      __Our response: __This is a good question. We do not thing that the number of samples substantially affects the accuracy of doublet detection by souporcell, but we will test this.

      Planned revision:

      As suggested, we will repeat the monkey analysis with 3 samples to see how this changes doublet detection. Overall, due to the low quality of doublet detection by souporcell found in this manuscript, we will expand our Discussion of doublet detection to propose some potentially useful recommendations for making conservative doublet calls with souporcell external programs (addressed above in response to Reviewer 2. We expect that the more substantial filtering of the monkey datasets relative to the zebrafish dataset prior to pooling contributed to this question. To make these differences more obvious we will more deliberately emphasize the differences in dataset filtering for each experiment.

      Description of the revisions that have already been incorporated in the transferred manuscript

      4. Description of analyses that authors prefer not to carry out

      From Reviewer 1:

      “More generally, showing more direct evidence for the variability of different cell types (not just the immune system) could be informative for scRNA-seq users.”

      __Our response: __We do not plan to conduct extensive analyses of other published single cell datasets to provide a further reason for why it is important to have biological replicates for single cell experiments. When building this manuscript, we chose not to pursue the option of publishing an analysis of published single cell datasets in which we could identify artifactual results and animal to animal variability, because we worried that this would be harmful to future open science efforts, and therefore, counterproductive. Further, past papers have already demonstrated the issue of batch effects and animal to animal variability in scRNA-seq datasets, and the requirement for biological replicates to facilitate differential expression analysis. As mentioned above, we will do a better job citing the papers that address these points.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      The manuscript "An Sfi1-like centrin interacting centriolar plaque protein affects nuclear microtubule homeostasis" by Wenz and co-authors describes the detection and analysis of the Sfi1-like protein in apicomplexan parasite Plasmodium falciparum. The authors examined the protein localization and function in asexual stages during parasite replication in the red blood cells. The authors detected PfSlp in the PfCentrin1 pulldown, created PfSlp conditional knockdown strain, and evaluated growth and morphological deficiencies associated with the PfSlp deficiency. The study's primary finding is that PfSlp inhibits the extension of nuclear MTs.

      Major comments

      The key conclusion is appropriate but is poorly supported by experimental evidence. The transitional, experiment-to-experiment conclusions are preliminary and may require additional experiments. The authors did not present a convincing model of the PfSlp1 function in mitosis.

      We appreciate the reviewer’s evaluation that our key conclusions are appropriate, but also have taken some of the valid comments below into account and added some conclusive experimental data and partly modified the choice of words when interpreting the data. We are now fully convinced that our conclusions are appropriate and supported by experimental evidence. To understand the function of PfSlp, which was described for the first time in this study, precisely will require a more detailed model of the still very much understudied malaria parasite centrosome and will be the subject of future inquiries.

      If PfSlp inhibits the MT polymerization, then the PfSlp reduction should lead to an extension of the bipolar spindle, which is partly supported by longer MTs in the hemispindles. How is the excess of the nuclear MTs prevent the spindle resolution in anaphase?

      Intranuclear hemispindle microtubules are indeed elongated. Increased microtubule polymerization does not necessary lead to an increased spindle length but could just as well promote the nucleation of multiple short microtubules or increase overlap between antiparallel microtubules. We, however, want to emphasize that our key conclusion is that PfSlp is implicated in the regulation of nuclear tubulin levels, rather than “inhibits extension of nuclear MT”. In our view this is an important distinction since microtubule misorganization is merely a consequence of changing nuclear tubulin levels. At no point we want to suggest that PfSlp somehow directly inhibits polymerization of microtubules and therefore did not provide any specific evidence. The fact that PfSlp and microtubules are in different compartments underlines this. Yet, we have noted that our abstract uses the word polymerization. Although we mention that it occurs as a consequence of increased tubulin concentration, which thermodynamically favors microtubule polymerization, we acknowledge that this could be misleading and removed this term (line 30). Concerning how the excess nuclear MTs prevent anaphase spindle resolution we propose several explanations in the discussion (lines 381ff). All line numbers refer to document with “tracked changes”.

      Fig 4C misrepresents mitotic phases: bipolar spindle should be broken into two in anaphase, while the drawing shows one elongated spindle connecting two poles.

      Indeed, we frequently observed, anaphase spindles being “split” ourselves (Simon et al. LSA, 2021, Fig. 2A). Although sometimes we would see one elongated spindle and sometimes more than two as in Liffner et al. 2021 Fig. 3A. For simplicity we only drew one elongated interpolar microtubule bundle but have now corrected this for more accurate representation.

      The authors should correct the use of terminology. Throughout the manuscripts, the parasite division stages are called life stages. Life stages are merozoites, gametocytes, ookinetes, sporozoites, etc. The division stages apply to a single life stage and, in the case of schizogony, are rings, trophozoites, and schizonts.

      We once falsely referred to life cycle in line 182 when we should have referred to the intraerythrocytic development cycle. The paragraph using the incorrect wording was removed in the revision.

      Please, note that schizogony does not follow the ring and trophozoite stages (line 119); it includes them as the distinctive morphological stages of one round of schizogony. The cell cycle terminology is incorrectly applied.

      We have the impression that the usage of the term schizogony is rather “fluid” in that it is occasionally also employed to just the describe the phase where DNA replication, nuclear division, and cytokinesis occur (hence schizont stage), but we clearly note the more canonical use as equivalent of the asexual intraerythrocytic development cycle as whole. We modified the terminology accordingly (e.g. by employing “schizont stage”) lines 43, 142, 184, 238, 265.

      What is the "mitotic spindle stage," "mitotic spindle nuclei, "or "mitotic spindle duration" (Fig. 4B)?

      It has now been conclusively demonstrated that nuclei go through independent nuclear cycles with different morphological stages (Simon et al. 2021 LSA, Klaus et al. 2022 Sci Advances). Hence, we use the term “mitotic spindle stage” to contrast it with the “hemispindle stage”, which can be morphologically distinguished using microtubules as a marker and occurs just prior to S-Phase. Consequently, “mitotic spindle nuclei” are nuclei in the “mitotic spindle stage”. “mitotic spindle duration” designates the time nuclei spend in that stage i.e. from hemispindle collapse until anaphase spindle elongation. We have adjusted and more accurately defined the terminology throughout the text and complemented Fig. 1A for clarity.

      Minor comments

      The PfSlp knockdown is inefficient: the 55% reduction at the RNA level translates into a minor change at the protein level (Fig.2 and S4). The evaluation of the protein changes should be done by western blot analysis with appropriate controls. The intensity of the IFA signal (used in the study) changes depending on the focal plane, as seen in Fig 1D.

      Due to the exceptionally big size of PfSlp of around 407 kDa and the low expression levels western blot analysis was not feasible in our hands. For quantification of the IFA signal we used image projections and background subtraction to integrate the signal of the full z-stack containing the entire cell and our measurement was therefore independent of the focal plane. We have now described this a bit more thoroughly in the methods section (lines 620ff). The change in signal as measured by IFA is still clearly significant and shows a reduction of about 45%, which is coherent with the reduction of 55% found by RNA analysis and ultimately results in a specific phenotype.

      Growth defects of the PfSlp KD: It is unclear what causes the reduced parasitemia of the GlcN untreated Slp parasites (Fig. 2C and D).

      A likely explanation is that the C-terminal tagging of PfSlp already slightly impairs the function of the protein causing a mild growth phenotype that is not observed in wild type although it is not statistically significant (Fig. 2C). Importantly, the reproduced analysis of parasite growth, shown as multiplication rate in Fig. 2C (and growth curve in Fig. S6) now more clearly demonstrates that when normalizing for GlcN treatment and GFP-glms tagging (“3D7 corr.”) the growth defect is still significant and can therefore be attributed to Slp KD and not to tagging or GlcN treatment addition, which on their own do not cause a significant phenotype.

      To conclude that the kinetics of DNA replication is affected, the authors will need to perform the real-time measurements of DNA replication forks.

      We thank the reviewer for pointing this out and removed the term “kinetics” (line 182, 269).

      The presented data supports that fewer S/M rounds were performed by PfSlp lacking parasites but gives no way to determine whether the S or the M phase was affected.

      We thank the reviewer for this valuable comment. Our data so far showed that the very first spindle extension, and therefore M-Phase, is clearly affected (Fig. 4A-B). If the first division fails all subsequent S phases and M phases might be affected at the population level. To test whether S-phase is affected we now acquired time lapse imaging of single cells labeled with the quantitative DNA dye 5-SiR-Hoechst and saw no difference in DNA signal increase for PfSlp KD parasites, while nuclear number was reduced, showing directly that M phase rather than S-Phase is affected (Fig. 4C, lines 280ff).

      DNA quantification graph (Fig. 2D) is confusing and does not correlate with the quantification of merozoites (Fig. 2E). Why is the DNA intensity of Slp- parasites lower than the DNA intensity of the Slp+ parasites, even though Slp deficient line produces less progeny? Is it possible that you missed the actual peak of DNA replication? Authors may consider more tight time courses with a few additional time points.

      This is a good point. We have repeated this experiment with longer sampling time and shorter intervals. We now plot the fraction of cells with DNA content above 2N (also to exclude double infections and cells that arrest prior to the schizont stage) as a measure to see how many cells are replicating (Fig. 2D, lines 175ff). Although the replication peak was, as observed before, shifted by GlcN treatment we found no significant differences in height. Although the lack of PfSlp tagging and GlcN treatment in the 3D7- control might favor the slightly more productive replication. We complement this analysis by plotting the average DNA fluorescence intensity over time (Fig. S7A) and the area under the curve (see below), as an approximation of “total replication activity” and still found no significant differences (Fig. S7B). The fact that the DNA fluorescence intensity peak does not correlate with the slightly reduced merozoite number observed in Fig. 2E is not very surprising as the fixed time point sampling for DNA quantification can’t differentiate between cells slowing or even halting progression and thereby confounding the averages. This limitation of single timepoint population analysis specifically highlight the importance of our time resolved single cell analysis presented later in Fig. 4, which clarifies the phenotype. Further, merozoite number counting does not give any insight about ploidy of the individual merozoites. Considering the significant nuclear division defect we also show in Fig. 4 it is plausible that some merozoites in the Slp KD could be polyploid, while globally replication is not strongly affected.

      Given the main claim, the study lacks the spatial-temporal analysis of tubulin described only in words. The tubulin quantifications by WB (Fig. S6) are not convincing, as well as the resulting conclusion of the cell cycle retardation.

      We are not completely sure what the reviewer is indicating by a lack of spatial-temporal analysis of tubulin given that we show time-resolved imaging data of tubulin organization in dividing cells and quantify intranuclear tubulin levels. Those data (particularly Fig. 4A) clearly show a retardation in the mitotic spindle stage. We, however, acknowledge that the data on tubulin quantification via western blot could, as Reviewer 2 also points out, be improved through the addition of biological replicates. We have repeated those experiments twice and can now confirm by statistical analysis that total tubulin, aldolase, and centrin protein levels are not affected by Slp KD at 24, 30, and 36 hpi (Fig. 3E, Fig. S8, lines 232ff). This indicates that the increase in intranuclear tubulin is not a consequence of globally increased tubulin expression.

      It is unclear how the authors arrived at the conclusion that the mitotic spindle is deficient in PfSlp KD parasites. Fig. 3C does not show visible differences in GlcN treated and untreated parasites.

      PfSlp KD parasites show unusual microtubule protrusions branching of the main microtubule mass, which have never been observed in wild type parasites. This should have been indicated more clearly by adding an arrow in Fig. 3C. We further think our observation that the tubulin content in mitotic spindles is almost three times higher on average than in wild type spindles (Fig. 3D) and that those spindles do not properly extend (Fig. 4A-B) justifies this claim.

      How many nuclei are in the cells shown in figure 4 and supplemental movies? It seems as if GlcN treated Slp parasites form one long spindle.

      In a previous study (Simon et al. 2021, LSA, Fig. 1B) we have demonstrated that the number of distinct microtubule foci, i.e. mitotic spindles, observed in cells corresponds directly to the number of nuclei. Hence we can assume that prior to successful spindle extension in the PfSlpKD there is one nucleus or two nuclear masses that are in the process of separation. We now added some new time-lapse microscopy data of DNA- and tubulin-stained parasites that confirms that arrested Slp KD parasites fail to properly divide their nuclei (Fig. 4C, Mov. S4-5) and confirms our previously published findings about nuclear number.

      A majority of PfSlpKD parasites indeed seem to form one long spindle. However, this “long spindle” appears only after a significant time delay during which wild type parasites already have undergone multiple nuclear divisions and could be a downstream effect of this retardation through e.g. increase of total tubulin levels over time (Fig. 3E).

      The conclusion of anaphase block is unsupported: the authors need to demonstrate the accumulation of the metaphase nuclei with a bipolar spindle.

      Anaphase describes the phase of chromosome segregation and includes the full extension of the spindle, as discussed above, both of which fails in more than half of the PfSlpKD parasites (Fig. 4A, Mov. S3, S5) and is therefore interpreted as “failure to properly progress through anaphase” for the first time in the discussion (line 381). We currently can’t think about a more direct way to demonstrate this than by time lapse imaging of the very first mitosis in individual parasites. Any analysis of populations at later time point or using fixed cells will be skewed by the phenotype occurring in the very early stages of nuclear division.

      Reviewer #1 (Significance (Required)):

      The eukaryotic centrosome is a microtubule organizing center that guides the segregation of duplicated chromosomes. Despite being an essential regulator of the parasite division, the apicomplexan centrosome remains poorly understood. Recent studies in Toxoplasma gondii (Suvorova et al., 2015) and Plasmodium species (Simon et al., 2021) demonstrated high diversity of the centrosome organization making the studies of microtubule organizing centers in apicomplexans, particularly challenging. Examining the protein composition is one of the ways to uncover organelle function. The current study would help to understand the evolution of the MTOC and mechanisms of cell division in understudied eukaryotic models.

      The focus of my research is the apicomplexan cell cycle. I previously showed the bipartite organization of the Toxoplasma centrosome and identified and characterized several centrosomal constituents, including centrin partner Sfi1. Our most recent study presented evidence of the functional spindle assembly checkpoint in Toxoplasma tachyzoites.


      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      Summary:

      Plasmodium falciparum parasites undergo several rounds of asynchronous nuclear divisions to produce daughter cells. This process is controlled by the centriolar plaque, a non-canonical centrosome that functions to organize intranuclear spindle microtubules. The organization and composition of this microtubule organizing center is not well understood. Here, Wenz et al. identify a novel centrin-interacting protein, PfSlp, that, following knockdown, leads to fewer daughter cells and aberrant intranuclear microtubule homeostasis and organization.

      Wenz et al. identify PfSlp via co-immunoprecipitation of P. falciparum 3D7 strain with an episomally expressed PfCen1-GFP, noting PfSlp as a gene of interest based on the presence of several centrin-binding motifs. The authors go forward to generate a transgenic 3D7 strain, equipping PfSlp with GFP and glmS ribozyme, to localize and evaluate the function of PfSlp in asexual blood stage parasites. PfSlp appears to, using immunofluorescence and STED microscopy, localize to the outer centriolar plaque in schizonts, based on its colocalization with PfCen3. The authors show, utilizing the inducible glmS ribozyme knockdown system, that PfSlp is required for proper parasite growth, noting a defect following addition of GlcN. This defect is noted to cause a delay in the initiation of nuclear division, or schizogony. Analysis of intranuclear microtubule dynamics reveal abnormal microtubule organization, specifically an increase in nuclear microtubule abundance and length following PfSlp knockdown. Together, these findings characterize the role of a novel protein, PfSlp, that contributes to nuclear tubulin homeostasis and organization during schizogony.

      Major comments:

      The major claims made by Wenz et al. are largely convincing with the data provided.

      1. One area that requires additional attention is the following: Wenz et al. claim PfSlp and centrin to be interacting partners based on 1) co-immunoprecipitation (without prior protein crosslinking), 2) the presence of centrin-binding motifs in PfSlp and 3) colocalization of PfSlp and PfCen3. This interaction is not interrogated fully and claims specific to this point need to be clarified and described as preliminary. As it is written, Wenz et al. claim PfSlp is required for centrin recruitment to the centriolar plaque but this is not investigated fully. The data show lower levels of endogenous centrin at the centriolar plaque in PfSlp knockdown parasites but centrin protein levels are similar in wildtype and knockdown PfSlp parasites. As is, the phenotype attributed to PfSlp knockdown could be attributed to PfSlp or aberrant centrin recruitment to the centriolar plaque. Experiments manipulating PfSlp centrin-binding motifs would strengthen these claims and elucidate the role of PfSlp apart from centrin. If not included, less emphasis should be placed here.

      We agree with the reviewer that additional evidence to demonstrate the direct interaction between PfSlp and centrin would be adequate. Due to the presence of multiple widely spaced centrin binding motifs in PfSlp, which would require multiple highly challenging rounds of genome editing to be modified, we have opted for reciprocal co-IP using PfSlp-GFP (line 139, Fig. S3, see below). The exceptionally large size of PfSlp of 407 kDa and low expression prevented us from detecting it directly on the western blot, but we found a clear centrin band in the Slp IP that was absent in the control.

      We have also further qualified our formulation about centrin recruitment depending on PfSlp (lines 138, 146). Finally, we agree that there are many factors downstream of PfSlp that can contribute to the observed phenotype, which might include centrins and will be subject of future investigations.

      The 3.5 mM glucosamine has some toxicity in the parental 3D7. Is it possible to use a lower concentration so the growth of 3D7 is unaffected but the grow of the Slp-GFP GlmS parasites is still reduced?

      We acknowledge that the used Glucosamine concentration is on the higher end of the classically used range. The slight toxicity of Glucosamine is dose-dependent and only vanishes at submillimolar concentrations. During initial experiments we have found to generate a robust phenotype with 3.5 mM and decided to carry out all experiments at this concentration. We think that the added effect of PfSlpKD over GlcN treatment alone is sufficiently show as e.g. the merozoite number phenotype (Fig. 2E) and the mitotic delay (Fig. 4B) only occurs in Slp+ parasites.

      Fig 3E - the quantification of tubulin levels requires biological replicates to have means and error bars.

      We fully agree with reviewer 2 (and reviewer 1 who commented along the same lines) and now generated two more biological replicates that allow us to confirm by statistical analysis that total tubulin, aldolase, and centrin protein levels are not affected by Slp KD at 24, 30, and 36 hpi (Fig. 3E, Fig. S8, lines 235ff).

      The use of "centrin" is somewhat imprecise throughout. The authors should specific which centrin (PfCentrin1 or PfCentrin3 or others) they are referring to each time in the text.

      Thank you for requesting this clarification. We have used “centrin” on purpose but have failed to properly explain our terminology in the text. For the detection of endogenous centrin we use a polyclonal antibody raised against PfCentrin3 (Simon et al. 2021). Due to the very high sequence identity between PfCentrin1-4 we can’t exclude cross-reactivity of any polyclonal antibody. Throughout the field so far polyclonal antibodies raised against Chlamydomonas centrin and Toxoplasma centrin 1 have been successfully used to label centrin pool at the centriolar plaque. Since we can’t distinguish with certainty which of the centrins (PfCen1-4) is targeted we chose the general description “centrin”. We were however able to show that all four centrins (PfCen1-4) colocalize at the centriolar plaque (Voss et al. biorxiv, /10.1101/2022.07.26.501452) and that Plasmodium centrins interact with each other was demonstrated previously (Roques et al. 2019) while the interaction between PfCen1 and PfCen3 was shown in this study. Therefore, this will not limit our conclusions. We now explain this better in the text (lines 132ff) and adjusted the labeling in Fig. 1E.

      The mention of the cell cycle checkpoint is an interesting and appropriate point in the discussion. However, the discussion of it in the last sentence of the introduction is less appropriate. It should be removed from line 92-93.

      We are excited by the prospects of this study to finally be able to investigate the presence of checkpoint induced delays using time-lapse microscopy, but absolutely agree with the reviewer and have removed the statement in the introduction.

      Minor comments:

      1. Line 50 - "are remaining unclear" should "remain unclear"

      Has been corrected.

      Line 65 - "players" is quite informal. A better word should be selected.

      Was replaced with “factors”.

      Line 223 - "were" should be "where"

      Has been corrected.

      The delay in schizogony which is observed following addition of GlcN (Figure S5) may be made more convincing if the experiment is performed hours post invasion rather than hours post treatment. The synchronization of the parasites is in question as it is described in the methods.

      We have included this data from our initial exploratory analyses and since it was not central to our argumentation, we choose to add it as supplemental figure. After producing further data, we came to realize that the classical morphological characterization using Giemsa-staining partly mispresents the relevant transition from the pre-mitotic to mitotic stages as the onset of first spindle formation and DNA replication can’t be detected. Previous studies have also indicated that parasites which were drug arrested at the trophozoite to schizont transition were morphologically similar to mid- to late schizonts (Naughton and Bell, 2007). In a context that investigates nuclear division phenotypes we feel that this analysis might rather be misleading and that the provided growth assays, DNA replication quantification, and time lapse movies are significantly more informative. Therefore, we have decided to remove the figure altogether. However, we have moved Fig. S7 to Fig. 4 to show the results of the 3D7+GlcN movie quantification in the context of the Slp+/-GlcN results.

      In general, data presentation is clear and readable. The growth defect observed following GlcN treatment (Figure 2C) could be made more clear with data normalization to emphasize that which can be attributed to PfSlp knockdown and not GlcN.

      This is a good suggestion and we have reproduced the initial dataset (Fig. 2C, Fig. S6, see below) and normalized the 3D7 multiplication rate, which shows the effect more directly than the growth curves displayed before, for Slp-tagging and GlcN treatment (“3D7 corr.”). We still found Slp +GlcN to be the only condition to have a significant reduction in multiplication rate in the first cycle after treatment (24-72hpi) with respect to 3D7 control as well as the normalized 3D7 value (“3D7 corr”).

      Line 276 - Why is nuclear tubulin homeostasis more relevant for closed mitosis? This is difficult to understand. It should be phrased differently or provided with additional explanation.

      We thank the reviewer for the comment and agree that this is poorly formulated. We were meaning to express that in e.g. mammalian organisms the nuclear envelope gets disassembled during mitosis and thereby removes the need to regulate import of tubulin into the nucleus for spindle assembly. This is a self-evident statement and has been removed for clarity.

      Line 316 - "were" should be "was"

      Has been corrected.

      The identity, source, and dilution for each antibody must be reported for each use in the methods.

      We noticed that we had not fully referenced Table S3, where we listed all used antibodies and dilutions, which we have now done throughout the methods section.

      Reviewer #2 (Significance (Required)):

      The mechanisms by which intranuclear microtubule dynamics are regulated by Plasmodium falciparum parasites are not well understood. Furthermore, the proteins that are present near the centriolar plaque remain mostly unknown. Understanding the role of the Plasmodium centriolar plaque and its members is critical to describing these dynamics and contributes to our growing understanding of schizogony, an atypical mode of cell division mode with several rounds of nuclear division lacking cytokinesis. Therefore, the identification and initial characterization of PfSlp1 is useful for malaria parasite cell division community.


      __

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      The work by Wenz and Simon approaches the function of a novel component of the malaria parasite centriolar plaque, a structure whose complexity has begun to be unraveled only recently__, greatly by the same group. __The authors identify a homolog of Sfi1, a centrin binding protein highly conserved in eukaryotes. Sfi1 homologues usually co-localize with centrioles.

      As a tool to characterize its function, the authors uses a conditional knock down strategy, based on GlcN addition, to downregulate PfSfi1-like protein (PfSlp). The authors analyze the impact of pfSlp downregulation on cell division progression, and go on detailly characterizing the progression of mitotic nuclear division. In sum the study finds that expression of Slp1 is required for proper progression of cell division in Plasmodium parasites.

      The study is well conducted, and the manuscript clearly written. In general terms I found the data shown to support the author's claims. However, I do have a few points of concern to raise, particularly pertaining overinterpretation of the data, and points that need clarification before the manuscript is fit for publication. In particular the authors should explain more clearly how the data based on fluorescence intensity quantifications was acquired and processed, and how this information is intertwined with the expected kinetics of structures measured, along the cell cycle.

      We appreciate the positive feedback and the constructive comments made by the reviewer and now adapted our interpretation of the data or provide additional experimental data to strengthen our argumentation as outlined below. Further we have added some detail to the description of our experimental approaches in the methods section.

      I outline below major and minor points that require attention,

      Major Points

      The manuscript stems off the premise that PfSlp interacts with PfCen1. Despite the fact that Sfi1 is a known interactor of centrin, that the identified protein in Plasmodium has centrin binding motifs, and these proteins co-localize, the support for the direct interaction between the two proteins is based solely on the IP/MS result. No reciprocal IP results are shown.

      We thank the reviewer for the suggestion and have now added the reciprocal co-IP, which shows a specific interaction between PfSlp and centrin without need for cross-linking (Fig. S3, see also reply to comment 1 by reviewer 2).

      Line 118 specifies that co-localization of Slp-GFP with centrin "corroborates their direct interaction." Co-localization most certainly does not show direct interaction. In addition, Figure 1D shows co-localization with Cen3, not with Cen1, which was the only protein shown to have a physical interaction with Slp via immunoprecipitation. Hence, the claim is unplaced and this section should be reworded for clarity.

      The reviewer is correct to point out that co-localization even at STED nanoscale resolution does not demonstrate interaction. We have reworded this statement. Cen3 was the only other specific protein found in the Cen1 immunoprecipitation (Table S1) and the interaction between the four centrins Cen1-4 was shown in an earlier study in P. berghei (Rogues et al. 2019). However, as the Reviewer 2 also indicated, we did not clearly communicate what the targets of our centrin antibody are. We, indeed used an antibody raised against PfCen3. Due to the very high sequence identity between centrins it is, however, unrealistic to exclude cross-reactivity between centrins for a polyclonal antibody (as explained in more detail in our response to Reviewer 2). We have added an explanatory statement in the main text (lines 132ff). Our recent finding that GFP-tagged PfCen1-4 all colocalize at the same position in the centriolar plaque (Voss et al. biorxiv, /10.1101/2022.07.26.501452) and our previously published study of the centriolar plaque (Simon et al. 2021) gives us additional confidence that the antibody specifically labels the compartment of interest.

      I was surprised to see how little recovery of PfCen1-GFP the authors obtained from their IP experiments. Whilst I understand that a western blot is not quantitative, I wonder, were the amounts of protein loaded onto each lane normalized for comparative purposes in any way? Please comment on this at least in the figure legend so the reader can gage whether the little PfCen1-GFP recovery was a consequence of the IP experiment, or whether the WB is not representative of the actual IP results but rather show a fraction of the recovered material.

      We did not determine the total protein concentration (by e.g. Bradford assay) and therefore did not normalize for protein amounts per lane. Instead, we determined the number of infected red blood cells per ml before Saponin-lysis of the red blood cells and loaded protein lysate equivalent to 1 x 107 cells per lane. We now explain this more clearly in the legend for Fig. S1. During the IP, much of the total protein amount might got lost during the washing steps, which might explain the weak Centrin1-GFP band and the absence of a protein signal in the eluate lane by Ponceau staining (neither a signal for Centrin1-GFP nor unspecific protein signal in the Ponceau). We would conclude that the WB, or at least the lane with the eluate, shows a fraction of the recovered material.

      If the WB is indeed representative of the actual PfCen1-GFP recovery rates, I suggest you discuss the possible outcomes of having pulled down so little from the total cell lysate - could it be that the recovered proteins are representative of interactions happening only for a subset of soluble PfCen1 molecules? Can the little protein recovery be explained by Cen1 interactions with insoluble cell components such as the cytoskeleton?

      As described above, the eluate lane does likely not represent the actual amount of Cen1-GFP that was pulled down and therefore the WB is not representative of the PfCentrin1-GFP recovery rates. Based on our previous studies we are not aware of any cellular PfCen1 pool beside the cytoplasm and the centriolar plaque. Although they might be below the detection limit. The reviewer raises an interesting hypothesis but we don’t have sufficient data to assume an association with the cytoskeleton and verifying this would require extended further studies.

      Were other IP conditions tested? Were the same results obtained?

      We carried out three PfCen1-GFP IPs. Once without cross-linking as shown in the study and twice with cross-linking. The two IPs with crosslinking had different amounts of targets identified (24 vs 162). While we did not detect PfSlp in the one with the low number of peptides we detected PfSlp in the second IP. In both IPs we additionally detected PfCen2 and PfCen3.

      Do you get the same interactors if the IP is done using anti-Centrin instead of anti-GFP?

      We did not test an anti-Centrin antibody for IPs as the protocol from the Brochet group was optimized for the highly specific bead-coupled anti-GFP antibody.

      Please define how you identified "specific hits." This is, please describe your criteria for determining "specificity." Was it an all or nothing selection approach? Are Cen1, Cen3 and PfSlp significantly enriched? And if so, how did you define "enriched for" in the context of your experiment?

      We thank the reviewer for given us the chance to clarify our candidate selection. We specifically selected the Cen1-GFP IP targets without cross-linking since it produced a short list of hits detected by mass spectrometry. We used an all or nothing approach in that we subtracted from that list any protein that was ever identified in a GFP control IP analysis by the Brochet lab using the same protocol (Balestra et al. 2021). This left only three proteins Cen1, Cen3, and Slp, as our “specific” hits. We have modified the text to explain our selection criteria more explicitly (lines 112ff) while avoid using the term “enrichment” since this is an all or nothing selection.

      I'm not at all suggesting here that you repeat this experiment. I understand that the focus of the manuscript is the description of PfSlp, and this stands regardless of the IP results. However, I suggest you include a lengthier discussion of the results shown in SFig1 and Fig1, and the limitations of the approach.

      We appreciate the assessment by the reviewer that the focus of the manuscript is otherwise and acknowledge that this is not an extensive analysis of PfCen1 interaction partners. We have, as requested, added a comment addressing this limitation in the discussion (lines 331ff).

      Line 123 mentions that Cen3 and Slp1 are recruited together only because they co-localize in most cells showcasing hemi-spindles. Please simply keep "simultaneously" here, as this is the only thing you can conclude from your quantification data. Being recruited "together" implicitly means by "the same mechanism", which is not shown by your data.

      We agree that simultaneously is more accurate and we have modified the text (line 146).

      Please specify which statistical test was used for determining significance in Figure S4, and what *** refers to in this case. It is hard to judge really how different these data sets are in light of the overlapping error bars. Also, what is quantified here? Integrated density from an immunofluorescence assay? How are the data normalized to be comparable? How many replicates did you quantify? Or are the data shown representative of a single experiment? I could not find these details in the M&M section or the figure legend.

      We have revisited all figure legends and consistently defining the p-value and number of replicates (usually N=3) and briefly explain the measurement. Further we have extended the methods section to make our image quantification approach clearer.

      Also, on the interpretation of these data; If Slp1 causes a delay in cell cycle progression, and taking into account that the fluorescence intensity of Slp1 varies along the cell cycle, with Slp1 intensity increasing as cell cycle progresses from the ring stages onwards, are these comparable measurements? In other words, are you selecting the same stages whereby the same Slp1 intensities at the centriolar plaque would be expected?

      If I understand correctly these measurements are carried out at 55hs post GlcN addition (when the growth phenotype starts evidencing itself?). At this time point, the relative abundance of ring and trophozoite stages (stages at which Slp1 is not expected to be detectable at the CP) is considerable higher than that of the control condition, hence a reduction in Slp1 is expected, and a mechanistic claim about recruitment or stability would be incorrect. Please clarify.

      As the reviewer correctly points out it is important to normalize for the stages when quantifying the PfSlp intensities. To achieve this, we only selected schizont stage parasites with a similar distribution of cells containing 3-10 nuclei between the conditions to ensure we are looking at comparable stages. We then quantified the integrated density at each individual centriolar plaque, designated by the presence of a centrin signal. Outside of centriolar plaques no PfSlp signal can be detected. As for ring and trophozoites stages, they do not have a discernable centriolar plaque, or at least not with the markers available in the field, and likely do not express PfSlp based on published transcriptomics data (Plasmodb.org). We have revisited the text to make our quantification strategy clearer (line 170, 621ff).

      To understand the relative contribution of Slp1 to the growth delay phenotype, please include 3D7+GlcN control in the quantification of stages shown in Fig. S5. Please check how the data shown in Fig S5 was normalized; the 49 and 73hs bars in the -GlcN condition exceed 100%.

      As indicated in our reply to Reviewer 2 we only included this data from our initial exploratory analyses and since it was not central to our argumentation, we chose to add it as supplemental figure. After producing further data, we came to realize that the classical morphological characterization using Giemsa-staining partly mispresents the relevant transition from the pre-mitotic to mitotic stages as the onset of first spindle formation and DNA replication can’t be detected. Previous studies have also indicated that parasites which were drug-arrested at the trophozoite to schizont transition were morphologically similar to mid- to late schizonts (Naughton and Bell, 2007). In a context that investigates nuclear division phenotypes we feel that this analysis might rather be misleading and that the provided growth assays, DNA replication quantification, and time lapse movies are significantly more informative. Therefore, we have decided to remove the figure altogether. However, we have moved Fig. S7 to Fig. 4 to show the results of the 3D7+GlcN movie quantification in the context of the Slp+/-GlcN results.

      What is "centrin signal" shown in Figure 2B? Centrin1? Centrin 3? Please clarify which centrin protein you are referring to throughout the manuscript, or provide evidence that they could be interchangeably used for localization and intensity measurement experiments.

      We thank the reviewer for pointing out this vagueness. As explained above in the second major point and in the reply to reviewer 2 we use the term “centrin” to emphasize that we cannot be certain to which degree PfCen1,2,3 or 4 contribute to the signal. Our recent preprint (Voß et al. 2022) and Roques et al. 2019 and Simon et al. 2021 however suggest that all centrins co-localize and interact at the outer centriolar plaque. As mentioned we now discuss this in the text (lines 130ff).

      Line 149 outlines that Slp1 and centrin intensities are simultaneously reduced, and that this fact alone "affirms" they are part of one complex, and that this implies that Spl1 is somehow involved in centrin recruitment. This claim is not supported by the data shown. There are multiple possible explanations as to how the intensities of both proteins could simultaneously decrease without them conforming the same structure, the same complex or even directly interacting. For example, if the centriolar plaque homeostasis is altered, or the "intensities" are simultaneously dependent on cell cycle progression, they will both be affected without necessarily ever interacting. In fact, if the centrin intensity monitored is that of Cen3, a direct interaction between Slp1 and Cen3 is not demonstrated at any time. At best, the authors could argue that both proteins are directly interacting with Cen1. Again, even this is no definitive proof that they form the same complex.

      The reviewer is correct to point out that there are multiple explanations for the decrease of centrin and Slp signal and we have phrased some of the relevant statements more carefully (lines 138, 146, 172). We, however, think that our new reciprocal co-IP data (Fig. S3) in combination with the already provided evidence now significantly strengthens our claim about the interaction between centrin and Slp.

      Measurements of DNA content, shown in Figure 2D, show that +GlcN Slp1 knockdown parasites exhibited reduced DNA amounts at 42hs post induction. These results are interpreted as "defects in nuclear division," however, 1. Nuclear division is not analyzed directly, but rather approximated by measuring DNA content. 2. Even in the presence of perfectly normal nuclear division, the DNA content reduction for these parasites at this time point is expected, as cell cycle progression is affected.

      Line 160 states that a reduction in merozoite number corroborates a defect in nuclear division. However, the data shown only quantifies merozoites per schizont. As mentioned above, nuclear division is not directly assayed.

      We thank the reviewer for emphasizing this important distinction (alongside Reviewer 1). Making the conclusion about nuclear division based on the reduced number of merozoites was premature and we now phrased this more carefully (line 198). Even our data showing inhibition of spindle extension (Fig. 4A-B), although being a strong indicator, do not strictly speaking observe nuclear division. Hence, we have added time-lapse imaging data of nuclear number in KD vs control conditions using the quantitative live cell DNA dye 5-SiR-Hoechst (Fig. 4C. Mov. 4-5). These data now clearly show that the nuclear division or M-phase is affected, while the increase of DNA signal, which represents replication, is not distinguishable from the control. This confirms that nuclear division is the initial and relevant phenotype.

      What the nuclear division defects observed are is unclear. Is there fusion, fission? loss of nuclear content? defects in mitosis completion? defects in DNA replication? A reduction in merozoites per schizont, with a concomitant reduction in overall DNA levels could also be explained by a general arrest in the final stages of division. Do other processes linked to nuclear division progress normally? For example, is there daughter cell formation during schizogony without the expected accompanying nuclear division? Are daughters forming in the correct number and position? Are there more daughter cells than nuclei? Or are parasites dying before completing schizogony and producing merozoites? These possibilities need to be carefully teased out before a nuclear division defect can be assigned as the sole causing factor of the division phenotypes observed.

      These are all very pertinent questions some of which go beyond the scope of this very first characterization of PfSlp function but we are keen to include those in our future analysis. Some of them we can answer while I will try to offer an interpretation for the remaining ones:

      It isn’t fully clear to us what is meant by “Is there fusion, fission”. We will assume that the reviewer refers to the process of karyofission where the nuclear membrane is constricted and fused between the segregating chromatin masses. The field is still lacking a nuclear membrane marker, which makes a direct analysis of this question difficult. Under normal circumstances it has been demonstrated that mitosis is fully closed and the nuclei are completely surrounded by membrane right after division (Klaus et al. 2021). To maybe clarify further we use the term nuclear division to designate the formation of two physically distinct nuclei from one progenitor. We can’t and don’t comment on the integrity of the nuclear membrane and if we had to speculate, it is probably not affected.

      Our new data on DNA dynamics (Fig. 4C) shows a delay in nuclear division while DNA replication seems unaffected in the early division stages. The failure to complete mitosis is also shown by the lack of proper spindle extension. It is possible that PfSlp KD affects final stages of division, but since we treat parasites at ring stages and detect a strong phenotype already at the very first division which occurs only a couple of hours after centrin/Slp recruitment one must assume that this is the defining phenotype, which likely has repercussion on later rounds of division. This makes it virtually impossible to clearly define late phenotypes. We actually have to assume that parasites that proceed to later stages of division do so because PfSlp KD was less efficient.

      Our data directly shows that more than half of our PfSlp KD parasites “fail to properly divide their nucleus” in the first round of mitosis and therefore can’t construe any other way than to designate this as a “nuclear division phenotype”. We purposefully don’t comment on potential later phenotypes and an impact on cytokinesis (budding) but look forward to investigating this in the future.

      Minor Points

      • Line 49: consider "...mechanisms remain unclear" instead of "... mechanisms are remaining unclear"

      We have corrected this sentence as suggested.

      • Readers not familiar with Plasmodium cell division would benefit from having the different stages shown schematically in Figure 1A labeled (ring, merozoite, trophozoite, etc.)

      Good suggestion. We have expanded the labeling in Fig. 1A, but still choose to focus on the division stage, which is relevant for the presented data.

      • Figure 1 legend: Please specify that "centrin" staining is approximated by centrin 3 specifically. Figure 1E is missing a legend in Figure 1's legend.

      Thank you for pointing this out. We have expanded the figure legend accordingly.

      • To ease the reader's interpretation of the data, please consider using a different color for 3D7 +GlcN in the plots shown in Figure 2. It is difficult to distinguish the light magenta from the red color at first glance, especially when the lines are partially overlapping.

      We explored many different color combinations and consulted with several colleagues and concluded that the chosen color combination is most suitable to convey the logic of the strains (while accounting for green-red blindness).

      • Please clarify how long after GlcN addition are phenotypes assessed - ex. Microtubule cumulative length measurements shown in Figure 3.

      We mentioned in the previous Fig. 2 that we add GlcN at the ring stage preceding the schizont stage we analyze but failed to specify that we consistently do so for all experiments. We have added more information in the results (line 221) and to the methods section in more detail.

      • For Figure 3C please provide a separate image for the Slp channel alone. The overlay of the green centrin signal and the magenta from the tubulin staining render a yellow signal. It is difficult to appreciate the level of Slp knockdown in these cells. Moreover, in the inset, the label "zoom in" is on top of the centrin signal in green, precluding the proper assessment/observation of any yellow signal left over.

      Thank you for this remark. We have removed the centrin signal, which is clearly shown in the main panel, from the zoom ins to render the residual PfSlp signal clearly visible.

      • When describing Sf1 in T. gondii, please also cite PMID: 36009009 PMCID: PMC9406199 DOI: 10.3390/biom12081115

      When submitting our manuscript this study was not yet published, but we are happy to now include it in the introduction (line 92).

      The notion of "checkpoint" is mentioned in the introduction and revisited in the discussion. This is a topic under current discussion/evaluation in the field. As mentioned by the authors, demonstration of a checkpoint implies demonstrating reversibility of the putative checkpoint. Though the authors do not make claims about Slp1 or the phenotypes observed activating a specific checkpoint, the manuscript could be further strengthened if the authors showed that the anaphase arrest is reversible upon wash out of GlcN and restored levels of PfSlp expression. I'm including this comment as a "minor points" because it is a only suggestion. I understand that carrying out these experiments is not within the scope of this work. However, if the authors decided to pursue this, it would certainly strengthen the manuscript.

      We highly appreciate the suggestion made by the reviewer and already considered ways to inactivate the putative spindle assembly checkpoint or reverse the phenotype. Wash out of GlcN would theoretically be an option although we are unsure that the kinetics of the subsequent protein synthesis would unfold on a short enough time scale. As suggested by Reviewer 2 we try to remain cautious about directly addressing the checkpoint issue, since e.g. PfSlp due to its localization can’t be a direct component of the checkpoint itself. The mention of “checkpoints” has also been removed from the introduction. We are, however, excited that using our time lapse microscopy protocols there now is a framework to investigate this in more depth in the future.

      Reviewer #3 (Significance (Required)):

      Plasmodium species lack centrioles, and display a divergent mitosis. It is therefore of interest and relevance to understand the peculiarities of the centriolar plaque, as it likely underlies the ability of Plasmodium to upscale its numbers.

      Our molecular understanding of the underpinning factors controlling nuclear and cell division in Plasmodium is limited to a few recent publications. The data presented herein is novel and contributes to the body of work with molecular insight and high resolution microscopy coming on for the malaria field.

      My expertise is in cell division in Apicomplexan parasites

    1. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      This manuscript reported the CryoEM ring-like structure of the full-length human E3 assembly ligase UBR5, showing its assembly into a tetramer. The authors identified critical determinants for antiparallel homodimer and tetrameric assembly. They further described AKIRIN2 as UBR5 substrate and provided evidences of a preferential interaction and activity of UBR5 towards monoubiquitinated proteins. Based on these findings, they proposed UBR5 as chain-elongating E3 ligase.

      CryoEM data are solid, and the model interpretation of the tetrameric structure provides a precise description of the domain composition of the protein that well fit with biochemical data. Additional experiments are suggested to corroborate few statements of the authors.<br /> We believe they are realistic in terms of time and resource.

      1. Authors should address the importance of tetramerization by mutating SBB2 at the tetramerization interface and comparing the mutant with wild type in mass photometry and ubiquitination assays. In silico analysis of the interaction interfaces (e.g by using PISA software) could be useful to select amino acids to be mutated. The authors suggested a role for oligomerization in catalysis and mutants are needed in order to define the real "functional unit" of the enzyme.
      2. The authors used sucrose gradient sedimentation assay to prove UBR5 and substrate interaction (Fig. 3). Control experiment that showed UBR5 protein sedimentation in presence of GFP only is instead in Supplementary Fig. 3D. Unfortunately, in that panel the signal of UBR5 is not visible. Main figure should be revised showing proper controls of the experiment.
      3. The authors need to better clarify the features of the AKIRIN-UBR5 interaction. According to the data, the enzyme is equally active on both AKIRIN-Ub and Securin-Ub, suggesting a Ub-specific engagement. What would be a correct explanation of these results? Is the UBA domain directly involved in this process? Testing the activity of a UBA-impaired mutant should help to solve this issue.
      4. The authors identified a 25 aa sequence, called Plug loop, preceding the HECT domain. In the structure it is inserted between N and C-lobe subdomains of the HECT and appears to lock the enzyme in an open L-conformation. These structural findings are interesting, but no supported by experimental data. Which is the effect of the Plug loop deletion in a ubiquitination assay? Without further validation the last chapter of the results remains purely speculative and may better fit in the discussion.
      5. The datasets are clearly affected by preferential orientation as showed by the angular distribution and 2D classes (reason why the authors correctly performed data collection with tilt). A comment on this is required in the experimental section. In addition, it is not clear whether the presented maps (Fig 1 and 2) derive from merging of the two datasets or only the model has been built using the two different datasets.
      6. As a general comment, authors should enlarge panels in which structural details are described, highlighting the side chain residues involved in binding interfaces. Fig. 5 and Fig. 6 are particularly small and incomplete. Most of the structural figures miss key labels needed for a proper understanding. E.g. among the others, numbering of the helix composing the armadillo domain.
      7. The overall organization of the figures is quite confusing. Pag. 7 Figure 2C should represent a "box stabilized by three zinc ions mediated by two histidine and seven cysteine residues" according to text citation, but none of these details is highlighted in the corresponding figure. The eye in Figure 1,2,4 does not mean much if a proper box is not linked to the actual site to be seen. In addition, arrows indicating the rotation axis is hard to interpret. Few panels miss the legend. Figure 1A and many other panels miss the reference in the text. More details below.

      Additional points:

      • Mass Photometry data need additional comments and labels. Please comment on the MP concentration used to analyze the samples. Being a dynamic system, you are probably seeing an equilibrium of species at 10 nM in MP. For better completeness of MP figures, labels that includes counts, % of species and sigma should be added to the nice representation of oligomers. Which condition/fraction represent the MP data showed in 1B?
      • If Alphafold models are mentioned and used for model building, it would be nice to provide at least a pLDDTscore and ptm score. Since some details of the AF model are described in the text, an additional superposition of the AF model with the final model derived by EM would be useful to the community.
      • A simple workflow describing the cryoEM data processing that includes how many particles have been used in each step is required, at least in the methods section. The authors need to show the cryoEM 2D classes of the dimer as well.
      • Please add the domain boundaries in Figure 1A and highlight the domains on the alignment included in Supplemental Table 1.
      • Pag. 8 please decide which abbreviation to use, either UBR or Ubr.
      • Page 8, line 192. I found annoying to find the same sentence used by competitors who posted a bioRxiv paper 3 days before the one we are reviewing (doi.org/10.1101/2022.10.31.514604 page 4, line 135).
      • In supp. 1C legend, "high concentration of NaCl" is a bit vague
      • Complementary to Supp Fig 2A, a zoom in of the density map with traced model would be beneficial to show the actual map quality obtained.
      • Pag. 6 lines 133-134, the helix residues involved in homodimerization are cited in the text, but not highlighted in the Figure 1.
      • Figure 1 legend, panels H-I-J description are missing.
      • Figure 3, panel B, meaning of the asterisk is not reported in the figure legend.
      • Figure 4, 5 panels from A to E are cited in the text while figure reported only 4.

      Referees cross-commenting

      I think all the reviews are fairly consistent and agree with the comments raised by my colleagues with the one exception of Point 3 of Reviewer 1. The issue is certainly important yet the experiment suggested is not clear. I personally have troubles designing an informative experimental set-up.

      Significance

      This paper presents the intriguing Cryo-EM structure of the full-length HECT E3 ligase UBR5. As it stands, this work provides evidence of the existence of a tetrameric RING-like conformation that could represent the functional unit of the catalysis. Very little validation of the features identified in the Cryo-EM structure is given, thus the paper remains quite descriptive, but in any case interesting and informative for the ubiquitin field.

      Considering that UBR5 is a quite competitive subject in these days (e.g. at least one additional Cryo-EM structure was posted in BioRxiv, doi.org/10.1101/2022.10.31.514604), I would positively consider this manuscript for publication if the authors reply in full to the issues raised.

      My field of expertise: Ubiquitin regulation and interactions, biochemistry, biophysics and Cryo-EM.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      1. General Statements [optional]

      We would like to thank the reviewers for taking time in reviewing and commenting on our paper. The comments were very constructive and conscientious, thanks to their expertise in the field. These comments and the revisions would surely make this paper a better and more robust finding in the field.

      The comments were about clearer explanations, increasing the quality of the data and additional experiments for a stronger conclusion, all of which we are eager to accomplish. Now we have sorted out the problems and planned the experiments required in the revision, as detailed below.

      2. Description of the planned revisions

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      Summary In this manuscript, Komori et al. examined the role of the LRRK2 substrate and regulator Rab29 in the lysosomal stress response. Briefly, in chloroquine (CQ)-treated HEK293 cells the authors observed an apparent LRRK2-independent increased in Rab29 phosphorylation which was accompanied by translocation of Rab29 to lysosomes. Intriguingly, the authors detected a similar increase in Rab29 phosphorylation when Rab29 was tethered to lysosomes in the absence of CQ treatment. Using mass spectrometry, mutagenesis and a phospho-specific anti-body, the authors mapped the CQ-induced phosphorylation site to S185 and demonstrated its independence from LRRK2. Next, the authors found that PKCa was the kinase responsible for S185 phosphorylation and lysosomal translocation of Rab29. Lastly, the authors showed that in addition to PKCa the lysosomal translocation of Rab29 was also regulated by LRRK2. Overall, Komori and colleagues provide interesting new insights into the phosphorylation-dependent regulation of Rab29. However, there are. Number of technical and conception concerns which should be addressed.

      Major points 1) Figure 1F: the localization of Rab29 to lysosomes is not convincing at all. The authors should either provide more representative image examples or image the cells at a higher resolution. The authors should also confirm the CQ-induced lysosomal localization of Rab29 in a different cell type (e.g., HEK293).

      We will replace Fig 1F pictures with slightly more magnified images with higher resolution. We will also include additional cell types (HEK293, and other cells that are predicted to express endogenous Rab29); Reviewer #2 also raised this point (see Reviewer #2 comment on Significance).

      Moreover, the authors should show that prenylation of Rab29 is required for its CQ-induced phosphorylation.

      We will test the effect of lovastatin, a HMG-CoA reductase inhibitor that causes the depletion of the prenylation precursor geranylgeranyl diphosphate from cells (Binnington et al., Glycobiology 2016, Gomez et al, J Cell Biol 2019), or 3-PEHPC, a GGTase II specific inhibitor that also causes the inhibition of Rab prenylation (Coxon FP et al, Bone 2005).

      2) The rapalog-induced increase in Rab29 phosphorylation in Figure 2D is not convincing since there is at least 2-3-fold more Rab29 in FRB-LAMP1 expressing cells compared to their FRB-FIS1 counterparts. An independent loading control is also missing. This is a key experiment and should be properly controlled and quantified. In addition, can CQ treatment drive 2xFKBP GFP-Rab29 from mitochondria to lysosomes (in the presence of rapalog and FRB-Fis1)?

      We will carefully examine another round of rapalog-induced phosphorylation of Rab29, with an independent loading control such as alpha-tubulin. The immunoblot analysis will be made against the intensity of non-p-Rab29. The response to the latter question was described in the section 4 below.

      3) Figure 4A-C: Are these stable Rab29 expressing cells? If not, the quantification of "the size of largest lysosome in EACH cell" becomes very problematic. This analysis should be repeated with stable Rab29 variant cells in a background lacking endogenous Rab29. Furthermore, the LAMP1 signal is too dim to see any convincing colocalization (e.g., with WT) or the lack thereof (e.g., in the case of S185D).

      The cells shown in Figure 4 are HEK293 cells transiently expressing Rab29, and the issue of quantification was described in the section 3 below. We agree that the signal of LAMP1 was dim, and it turned out that the confocal microscope we used had problems with the sensitivity of the red channels. We will be taking another round of these images with a new confocal microscope.

      Lastly, the authors should corroborate their findings with an ultrastructural analysis since the electron microscopy would definitively be more suitable for this type of measurements.

      We are planning to obtain electron microscopic images, according to this reviewer’s request. We plan to invite an expert in electron microscopy analysis as a co-author.

      4) The lysosomal colocalization of Rab29 in Figure 5C is again not convincing. This analysis needs to be repeated with high resolution imaging.

      Again, we will repeat this experiment with a new confocal microscope, with the hope that it would yield better images.

      5) The authors need to show the level of LRRK2 depletion (Figure 6). Given the role of LRRK2 in driving lysosomal Rab29 translocation, the importance of the LRRK2 independent pS185 for this process remains unclear.

      We will add the level of LRRK2 on its knockdown; we have experienced that LRRK2 knockdown usually occurs with more than 50% efficiency every time. The response to the latter comment was described in the section 3 below.

      6) In general, the authors employ an alternative, biochemical assay (e.g., LysoIP) for the lysosomal translocation of Rab29. This would in particular help to clarify the effect of the Rab29 variants and LRRK2 inhibition.

      We have previously shown that the overexpressed Rab29 (and LRRK2) is enriched in the lysosomal fraction from CQ-treated cells, which was performed using dextran-coated magnetite (Eguchi et al, PNAS 2018). Using the same biochemical method, we will show the enrichment of endogenous Rab29 in the lysosomal fraction.

      Minor points

      9) Figure 2C is lacking the control IF staining for mitochondria (to which 2xFKBP-GFP-Rab29 is assumed be recruited upon co-expression with FRB-FIS1).

      We will stain the cells with MitoTracker to ensure that anchoring away of 2xFKBP-GFP-Rab29 by FRB-Fis1 results in mitochondrial localization.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      The data in the manuscript convincingly demonstrates that lysosomal overload by Chloroquine treatment induces Rab29 localisation to the lysosomes and that this membrane association is dependent on PKCalpha-dependent phosphorylation at Ser185.

      We have a number of rather minor comments listed below:

      Figure 2

      The increasing levels of non-phosphorylated Rab29 over the indicated time course of AP21967 treatment in Figure 2B are concerning. First, could you provide an explanation for this clear increase in both non-p-Rab29 and p-Rab29 in the phostag but not the normal gel? Second, could all quantifications of p-Rab29 be made relative to the non-p-Rab29?

      We will try another round of rapalog-induced phosphorylation of Rab29, with an independent loading control. The immunoblot analysis will be made against the intensity of non-p-Rab29. Reviewer #1 raised a similar concern on Figure 2D.

      Figure 5

      To further demonstrate that PKCalpha phosphorylates endogenous Rab29 at Ser185, we recommend reperforming the Go3983/PMA treatment in figure B with the anti-p-Ser185 antibody. It may be sufficient to perform the treatment only at 4 or 8 hours, simply to provide stronger evidence regarding the phosphorylation of endogenous Rab29.

      We will give a try, although the anti-phosphorylated protein antibodies that we tried never worked for phos-tag SDS-PAGE. With the conventional western blot, we will be able to try this experiment.

      It is not clear whether the activity of PMA in the assay is due to inhibition of PKCalpha. Are the effects ablated by PKCalpha KD

      We will test the knockdown of PKCalpha, beta, gamma and delta by siRNAs to further narrow down the effects of PKC-dependent phosphorylation of Rab29.

      Reviewer #2 (Significance (Required)):

      These cell biology findings are important in the field as both Rab29 and LRRK2 are implicated in the pathogenesis of Parkinson disease. The phosphorilation of Ser185 of Rab29 by PKCalpha is novel and contributes to our understanding of Rab29 and LKRR2 regulation. One limitation of the study is that is conducted in only two cell types quite unrelated to the disease, so how general and disease relevant are the findings it is not clear. Most of the data are solid. There are two experiments whose results are difficult to interpret and a few controls missing. Also a few issues with quantifications, all of which is described in details above and will need to be fixed prior to publication. My expertise for this paper is in the cell biology of lysosomal function.

      The issue that only two cell types were analyzed was also raised by reviewer #1, so we will examine additional cell types, especially those that are predicted to express endogenous Rab29. Our responses to other issues raised are described elsewhere. Thank you for these insightful comments.

      3. Description of the revisions that have already been incorporated in the transferred manuscript

      Figure 4A-C: Are these stable Rab29 expressing cells? If not, the quantification of "the size of largest lysosome in EACH cell" becomes very problematic. This analysis should be repeated with stable Rab29 variant cells in a background lacking endogenous Rab29. (Reviewer #1)

      As described in the section 2 above, the cells shown in Figure 4 are HEK293 cells transiently expressing Rab29. We are sorry that the description “the size of largest lysosome in each cell” was misleading. As we analyzed only cells overexpressing GFP-Rab29 that were marked with GFP fluorescence, we believe that transient expression should not be a problem. To avoid any misunderstandings, we have described in Figure 4 legends that only lysosomes in Rab29-positive cells (and all cells expressing Rab29) were included in the analysis of the largest lysosome of each cell.

      Regarding the effect of endogenous Rab29 in Figure 4 experiments, Reviewer #2 similarly raised the issue on whether Rab29 phosphomimetics are acting as dominant active, preventing lysosomal enlargement. On this point, we have previously reported that knockdown of endogenous Rab29 causes the enhancement of lysosomal enlargement upon CQ treatment (Figure 5I,J of Eguchi et al, PNAS 2018), suggesting that the lysosome-deflating effect by phosphomimetics is a dominant active effect rather than dominant negative suppressing endogenous Rab29. This point is considered significant, and thus has been explained in the results section (page 7, lines 168-171).

      Along similar lines: why not all cells in Figure 5E and Figure 5G show Rab29- and LRRK2-positive structures? How do the authors know which of these phenotypes is the prevalent one? (Reviewer #1)

      As for the ratio of cells with Rab29- and LRRK2-positive structures, it seems reasonable given that different cells have different levels of exposure to lysosomal stress and that the response is transient and does not occur simultaneously. The ratio of these positive cells may also vary depending on the cell culture conditions. Since Rab29- and LRRK2-positive structures are rarely seen in control cells, we think this would be a meaningful phenotype even if only 20-30% of cells show such structures. The result that the ratio of localization changes is not 100% is now noted in the results section explaining Figure 1G (page 4-5, lines 108-110) where the immunocytochemical data first appears.

      Given the role of LRRK2 in driving lysosomal Rab29 translocation, the importance of the LRRK2 independent pS185 for this process remains unclear. (Reviewer #1)

      Our data suggested that Rab29 is stabilized on lysosomes only when LRRK2-mediated phosphorylation and S185 phosphorylation both occur on Rab29 molecule (as shown in Figure 7 scheme), so we believe there is no contradiction. We have now described more clearly about this notion at the end of the results section (page 9, lines 235-236).

      It is not clear what the authors mean by "lysosomal overload stress". Since mature lysosomal incoming pathways such as autophagy or endocytosis are disrupted by CQ, it is difficult to picture an overload. Maybe rephrasing would help to clarify this. (Reviewer #1)

      Chloroquine (CQ) is known as a lysosomotropic agent that accumulates within acidic organelles due to its cationic and amphiphilic nature, causing lysosome overload and osmotic pressure elevation, and this is what we call “lysosomal overload stress”. The well-known effects of CQ to disrupt lysosomal incoming pathways are ultimately caused by the above consequences. Also, we have previously reported that lysosomal recruitment of LRRK2 is caused by CQ but not by bafilomycin A1, the latter being an inducer of lysosomal pH elevation, or by vacuolin-1 that enlarges lysosomes without inducing lysosomal overload/pH elevation (Eguchi et al, PNAS 2018), and further found that not only CQ but also other lysosomotropic agents commonly induced LRRK2 recruitment (Kuwahara et al, Neurobiol Dis 2020). We thus have described the effect of CQ as “overload”. However, it is true that we have not provided a clear explanation for readers, so we have added some notes for lysosomal overload stress in the introduction section (page 3, lines 69-71).

      Which cell type is used for the IF analysis in Figure 2C? This information is in general quite sparse. The authors should clearly state the cell type for each experiment/Figure. (Reviewer #1)

      We have added cell type information that was missing in several places in the manuscript. We are very sorry for the inconveniences. For clarification, HEK293 cells were used in Figure 2C.

      Are the images in figure 1F representative? i.e. does Rab29 always colocalise to such enlarged lysosomes upon CQ treatment and does CQ treatment always drastically alter the cellular distribution of Rab29? (Reviewer #2)

      The images in Figure 1F are representative of when Rab29 is recruited, but it is not seen in all cells, and the ratio of recruitment (~80%) is shown in Figure 1G. Reviewer #1 also asked why Rab29 recruitment is not seen in all cells, and we gave the same answer above. It may be reasonable to speculate that different cells have different levels of exposure to lysosomal stress and that the response is transient and does not occur simultaneously. The ratio of these positive cells may also vary depending on the cell culture conditions. For the readers’ clarity, we have added that the ratio of localization change of Rab29 is not 100% and is comparable to that of LRRK2 previously reported (page 4-5, lines 108-110).

      Considering that the "forced localisation technique" induces a non-physiological colocalization of non-endogenous Rab29 to lysosomes, it may be an overestimation to conclude just from these data that phosphorylation of Rab29 occurs on the lysosomal surface. This is also quite in contrast with the later finding that phosphorylation by PKCalpha promotes lysosome localization of Rab29. It seems more reasonable to conclude that Rab29 can be phosphorylated when localised at the lysosomes (as opposed to other organelles such as mitochondria). If the authors feel strongly about this point they might need to find a less non-physiological assay. (Reviewer #2)

      Yes, it could be an overestimation, and as we do not have better means to conduct a less non-physiological assay, we have modified the description from “occurred on the lysosomal surface” to “could occur on the lysosomal surface” (page 5, line 112 (subtitle) and line 128).

      Regarding the comparison with the later finding that phosphorylation by PKCalpha promotes lysosome localization of Rab29, these data (Figure 2 and 5) could be explained with a single speculation: phosphorylation of Rab29 on lysosomal membranes could retain Rab29 on the membranes for a longer time. It is not easy to decipher which comes first, association with membranes or phosphorylation of Rab29, in a physiological assay, but considering reports that show PKCalpha activation happens on membranes (Prevostel et al., J Cell Sci 2000), at least the data favor our conclusion over the idea of PKCalpha phosphorylating Rab29 in the cytoplasm and then promoting lysosomal localization. This point is now clearly described in the discussion (page 10, lines 248-251).

      It is not clear how the Rab29 phosphomimetics are acting as dominant active preventing lysosomal enlargement. Authors should speculate or repeat the experiments in absence of endogenous Rab29 to clarify the matter. (Reviewer #2)

      A similar concern about the effect of endogenous Rab29 was also raised by Reviewer #1 (see above). We have previously reported that knockdown of endogenous Rab29 causes the enhancement of lysosomal enlargement upon CQ treatment (Figure 5I,J of Eguchi et al, PNAS 2018), suggesting that the lysosome-deflating effect by phosphomimetics is a dominant active effect rather than dominant negative suppressing endogenous Rab29. This point is considered important and thus has been explained in the results section (page 7, lines 168-171).

      Overall, there is some missing information regarding repeats for Western blots, such as those in figure 3C, 3D and 3E. Please add indications about repeats in the figure legend or methods. (Reviewer #2)

      We have added the repeat information to each figure legend where it was missing. We are very sorry for the inconveniences.

      The model in figure 7 however seems to suggest that Rab29 associates to lysosomal membranes independently, and is then stabilised at the membranes by LRRK2 and PKCalpha - a point which is not directly supported by the data. (Reviewer #2)

      As noted earlier, we consider that phosphorylation of Rab29 on lysosomal membranes could retain Rab29 on the membranes for a longer time, given the present data and previous reports that phosphorylation of Rab29 is more likely to happen on the lysosomal membrane than in the cytosol. Also, as inhibition of either of the two phosphorylations ends up in disperse Rab29 localization, we have made this figure as a model of what is plausible right now. This explanation is now added in the discussion (page 10, lines 248-251).

      English proofreading should be improved: "CQ was treated to HEK293" (page 4), "As we assumed that this phosphorylation is independent of LRRK2" as an opening line (page 5) (Reviewer #2)

      Thank you for pointing out these incorrect wordings. They were corrected.

      4. Description of analyses that authors prefer not to carry out

      In addition, can CQ treatment drive 2xFKBP GFP-Rab29 from mitochondria to lysosomes (in the presence of rapalog and FRB-Fis1)? (Reviewer #1)

      We do not think that a comparison between the affinities of FKBP-rapalog-FRB and Rab29-[unknown factor that directs Rab29 to lysosomes] is necessary, as the former has a Kd in the single digit nM range (Banaszynski et al, JACS 2005), whereas the latter (based on estimations from related PPIs) is estimated to be in the μM range, which shows a much weaker affinity than the former (McGrath et al, Small GTPases 2019). Furthermore, even if Rab29 appears to have migrated from mitochondria to lysosomes as a result of this experiment, one cannot rule out the possibility that a small portion of the mitochondrial membrane was incorporated into the lysosomal membrane that was enlarged by CQ treatment.

      Molecular weight markes should be provided for all immunoblot experiments. (Reviewer #1)

      The immunoblot pictures without molecular weight markers in our paper are all Phos-tag SDS-PAGE blot analyses. Phos-tag SDS-PAGE results in band shifts of phosphorylated proteins, and writing in markers would be misleading. Moreover, previous representative studies heavily using Phos-tag (e.g., Kinoshita et al, Proteomics 2011, Ito et al, Biochemical Journal 2016) also did not show the molecular weight markers. Here we performed phos-tag SDS-PAGE analysis only to find differences in the phosphorylation state of Rab proteins.

      The use of the quantification ratio of cells with Rab29-positive lysosomes in figure 1G might be slightly misleading as it does not allow the reader to understand to what extent Rab29 localisation at lysosomes upon CQ treatment. We recommend using a simpler quantification, such as by measuring the average colocalisation of Rab29 and LAMP1 per cell. (Reviewer #2)

      For figure 5D and 5F, As with figure 1G, we recommend using a more straightforward and impartial method of quantification such as simply measuring the colocalisation of Rab29 with LAMP1. (Reviewer #2)

      Popular colocalization analyses using Pearson’s or Mander’s coefficients would be a good choice if the amounts of Rab29 varied greatly between lysosomes. However, this may not apply in this case; the amount of Rab29 or LRRK2 on each lysosome is considered to saturate quickly and a relatively low amount of them may not be detected on immunofluorescence observations, whereas the probability of finding these structures has been shown to exhibit a moderate sigmoid curve (as seen in Figure 1E or 2H of Eguchi et al., PNAS 2018). Therefore, the amount of Rab29 or LRRK2 could be approximated to a Bernoulli distribution in terms of colocalization with lysosomes, and this is the reason why we chose to quantify “the ratio of cells with Rab29-positive lysosomes”.

      We recommend using a more transparent and simple quantification method, such as average size of lysosomes per cell. (Reviewer #2)

      As one can see in the inset of Figure 4B, unenlarged lysosomes are unfortunately too small for the quantification of their size, much less tell two small lysosomes apart in our experimental settings and laboratory resources, so we decided to analyze the largest lysosome in each cell as a representative of the cells to minimize measurement errors. This measurement only includes GFP-Rab29 positive cells, and by comparing against CQ-untreated cells we intended to increase the validity of this analysis. This quantification method was also used in our previous report (Eguchi et al, PNAS 2018).

    1. Author Response

      1) Response to the Editor

      We thank the Editor and the Reviewers for the kind words, the helpful suggestions, and the points of critique, which have all helped us substantially strengthen the manuscript in this revised version. Regarding the 3 general critiques highlighted by the Editor:

      Essential Revisions:

      1) Some hypothesis, and in particular the one that all individuals have the same inter-burst interval distribution should be tested/justified/discussed.

      (a) We have generalized the theory to directly address this point by relaxing the assumption of an identical inter-burst interval for all individuals. In short: the main insights continue to hold and we discuss the nuances in the text.

      (b) Experimentally, the hypothesis that all single fireflies isolated from the group exhibit the same interburst interval (IBI) distribution could not be rigorously tested. The main reason is practical: in order to compare IBI distributions across individuals, we would need to collect a large number of fireflies and track them for long durations, which was not realistic given our experimental setup and the short window of firefly emergence. In addition, external environmental factors might slightly alter behaviors as well, making comparisons even more complex. Thus, due to paucity of field data, we eventually use the assumption that all individual fireflies follow the same IBI distribution.

      2) Comparison between the models and the data must be improved, in particular through a quantification of the differences between distributions and sensitivity analysis of the numerical results.

      (a) Regarding the comparison of the agent-based simulations with experimental data, in Fig. 7, we compare the underlying distributions using the two-sided Kolgomorov-Smirnov statistical test for goodness-of-fit. These appear to us the most straightforward and informative approaches, without over-fitting.

      (b) Regarding sensitivity analysis for the agent-based simulations, for each β value from 0 to 1 we statistically compared simulations to the experimental distributions to find the most well-fitted β.

      (c) Finally, owing to experimental constraints leading to sparsity of available data in characterizing the interburst distribution, we strive to strike a delicate balance between sophisticated statistical tools to compare theoretical and simulation distributions (with unrestricted access to large sample sizes) to the finite samples in the empirical distributions. As such, we think it is the apposite to use the first two moments of respective distributions In Fig. 3 to show the striking similarity of trends.

      3) More discussion of the modeling in connection to past theoretical results and existing literature is necessary to better contextualize the present work and assess its originality.

      We have done this closely following the specific suggestions from reviewers.

      2) Revised terminology: removing usage of “model”

      Since unintended ambiguity may be caused by use of the word “model”, which could refer to either (1) the theoretical framework, principle of emergent periodicity, and attendant analytic calculation , or (2) the agent-based simulation in the computational realization, we have removed all instances of the word “model” from the results presented in the paper, and replaced by the specific meaning (theory or simulation) in each context.

      Similarly, in responding to Reviewers’ comments, we clarify what we understand by their use of the word “model” in each case.

      3) Addressing an error in the agent-based simulation code

      We (OM and OP) have now addressed an inadvertent unit typo in the agent-based simulation code. The discharging time (Td) before the typo was fixed was set to 10000ms. After the fix, the Td value was correctly set to 100ms. This caused very slow discharges, keeping the voltage high until any beta addition was received, resulting in more frequent bursts than we’d actually expect from the model dynamics. This has been fixed, and in our responses to the reviewers, we address the results of this fix by referring to the “unit typo”. We corrected the panels corresponding to agent-based simulation in Figs. 3 and 5 to reflect the new numerical simulation results, as well as the corresponding sections in the text of the paper.

      4) Addressing changes to experimental dataset

      We increased the size of our N=1 dataset (N is number of fireflies) to correctly match what was reported in the original text of 10 samples. Additionally, we have added characterization of the size of the datasets for N=5, 10, 15, and 20 fireflies.

      5) Response to Reviewer 1

      We thank the Reviewer for kind remarks, and the highlights of the strengths of the paper.

      Regarding concerns raised, point by point:

      Reviewer #1 (Public Review):

      Weaknesses:

      The work presented here is an excellent start at understanding the collective behavior of this particular species of firefly. However, the model does not apply to other species in which individual males are intrinsically rhythmic. So the model is less general than it may appear at first.

      We take the Reviewer’s point well. We have added text to the paper to clearly highlight this point.

      The modeling framework is also developed under the very stylized conditions of experiments conducted in a small tent. While that is a natural place to begin, future work should consider the conditions that fireflies encounter in the wild. Swarms that are spread out in space would require a model with a more complicated structure, perhaps with network connectivity and coupling strengths that both change in time as fireflies move around. This is not so much a weakness of the present work as a call to arms for future research.

      We agree with the Reviewer that this is an exciting call to arms for future research!

      Other comments:

      This assumption that all individuals have the same IBI distribution could be directly tested. Has this been done? If not, why not? e.g. Are there difficulties with letting one firefly flash long enough to collect sufficient data to fill out the distribution?

      1. We have generalized the theory to directly address this point by relaxing the assumption that all individuals exhibit the same inter-burst interval distribution. In short: the main insights continue to hold and we discuss the nuances in the text.

      2. Experimentally, hypothesis that all single fireflies isolated from the group exhibit the same interburst interval (IBI) distribution could not be rigorously tested. The main reason is practical: in order to compare IBI distributions across individuals, we would need to collect a large number of fireflies and track them for long durations, which was not realistic given our experimental setup and the short window of firefly emergence. In addition, external environmental factors might slightly alter behaviors as well, making comparisons even more complex. Thus, due to paucity of field data, we eventually use the assumption that all individual fireflies follow the same IBI distribution.

      The derivation given in 6.2.1 is clearer than the approach taken here, which unnecessarily introduces Q, q, and c and then never uses them again.

      We agree with the Reviewer and have accordingly revised the manuscript.

      We have also implemented the suggested edits in the marked up manuscript. We are grateful for the detailed feedback, which helped us substantially extend results, and improve presentation and clarity.

      6) Response to Reviewer 2

      We thank the Reviewer for their thorough feedback. We provide point by point responses below.

      Reviewer #2 (Public Review):

      1) The biological relevance of certain hypotheses is insufficiently discussed. This is important because if the observed behaviour is a universal one, alternative models may explain it as well.

      We thank the reviewer for raising this point. The main hypotheses underlying our models are: 1) individual fireflies in isolation flash at random intervals; 2) these random intervals are drawn from the empirical distribution reported (implicitly: all fireflies follow the same distribution); 3) once a firefly flashes, it triggers all others. Hypothesis 1) is directly supported by the data presented. Hypothesis 2) is comprehensively addressed in the revised manuscript, as discussed previously. Hypothesis 3) is central to the proposed principle, and enables intrinsically non-oscillating individuals to oscillate periodically when in a group. The resulting phenomenon has been compared to experimental data and extensively discussed in the manuscript. Further, we have also simulated the effect of changing the strength of coupling between fireflies based on this hypothesis in the revised section on agent-based simulation.

      2) Comparison between the models and the data could be improved, in particular through quantification of the differences between distributions and sensitivity analysis of the numerical results.

      1. Regarding the comparison of the agent-based simulations with experimental data, in Fig. 7, we compare the underlying distributions using the two-sided Kolgomorov-Smirnov statistical test for goodness-of fit. These appear to us the most straightforward and informative approaches, without over-fitting.

      2. Regarding sensitivity analysis for the agent-based simulations, for each β value from 0 to 1 we statistically compared simulations to the experimental distributions to find the most well-fitted β.

      3. Finally, owing to experimental constraints leading to sparsity of available data in characterizing the interburst distribution, we strive to strike a delicate balance between sophisticated statistical tools to compare theoretical and simulation distributions (with unrestricted access to large sample sizes) to the finite samples in the empirical distributions. As such, we think it is the apposite to use the first two moments of respective distributions In Fig. 3 to show the striking similarity of trends.

      Reviewer #2 (Recommendations for the authors):

      A. The assumption that single-firefly spikes obey the same distribution (there is no individual variation in the frequency, or even of the composing number of bursts, of the flash) does not seem to have been verified on the data, that are instead pulled together in one single distribution (Fig. 1D). Moreover, the main feature of such distribution is that it has a minimum at 12 secs (discarding the faster bursts that are not considered in the model) and that it is sufficiently skewed so that it takes a minimal coupling for collective synchrony to emerge. I think that the agreement between the distributions for different N would be more meaningfully discussed having previous work as a reference, whereas now this is relegated to the discussion, so that it is unclear how much of the theoretical results are novel and/or unexpected. Quantification of the distance between distributions would also be interesting: it looks like the two models (analytical and simulations) disagree more among themselves than with the data.

      Regarding the hypothesis that all individual fireflies exhibit the same interflash interval, please see our response to Main Point 1. Regarding comparing the analytical theory and numerical simulation analysis, Figs. 3 and 5 have been revised after a unit typo was found in the code (see Section 2). Following the update, the analytical and numerical models agree in (1) the location of the peak in Fig. 3 for all N values, and (2) the peak approaches the minimum of the input distribution as N increases.

      B. If I understand correctly, simulations are introduced as a way to get a dependence on the intensity of the coupling (\beta). There are several issues here. First, I do not see how the coupling constant could change in the present experimental setup, where all fireflies presumably see each other (different from when there is vegetation). Second, looking at Fig. 3, the critical coupling strength appears to depend very weakly from N, and it is not clear how the 'detailed comparison' that leads to the fit is realized (in fact, the fitted \betas look larger that those at which the transition occurs in Fig. 3A). I think a sensitivity analysis is needed in order to understand how do results change when \beta is changed, and also what is the effect of the natural Tb distribution (Fig. 2 F). Results of the simulations might be clearer if instead of using the envelope of the experimental results, the authors tried to fit it to a standard distribution (ex. Poisson) so that it can be regularized. This should allow to trace with higher resolution the boundary between asynchronous and synchronous firing.

      We have included agent-based numerical simulations as a way to provide a concrete instantiation of the theory principle and analytical results in the preceding section. While the analytic theory results are fitting parameters free, in the agent-based simulations, we introduce an additional fitting parameter, to see what happens when we relax one hypothesis of the analytical theory: the instantaneous triggering of all fireflies upon an initial flasher. Additionally, the agent-based simulations pave the way for future work, allowing for convenient exploration of the connectivity between individuals and analysis of the behavior of individual fireflies. in this context, please note that Fig. 5 has been corrected (see above), leading to a stronger co-dependence of β and N. In addition to the envelopes, we also report the trends in the first empirical moments (mean and STD) for comparison and tracking of the transition to synchrony.

      C. More care should be put in explaining what are the initial conditions hypothesized for the different models. For instance, the results of paragraph 3 are understandable if all fireflies are initialized just after firing, something that is only learnt at the end of the paragraph. I also wonder whether initial conditions may be involved with T_bs in the low-coupling region of Fig. 3A not being uniformly distributed, as I would have expected for a desynchronized population.

      We have clarified that, indeed, all fireflies are re-initialized after firing. The initial conditions then become a new random vector of interflash intervals. Importantly, we found after receiving the reviews that, due to inconsistent units in our numerical simulation code, Fig. 5 was incorrect. With proper units, the new results show a much more widespread distribution at low coupling, as expected by the Reviewer.

      D. I found that equations were hard to understand either because one of the variables was not precisely (or at all) defined, or because some information was missing: Eq. 1: q is not defined Eq. 2: explain what it means: the prob. that others have not flashed times that that one flashes. Also, say explicitly what is the 'corresponding PDF. Eq. 3: the equation for \epsilon(t) to which this is coupled is missing Why introduce \beta_{i,j} and T_bi if they are then taken independent of the indexes? Definitions of collective and group burst interval should be provided. It would be clearer if t_b0 was defined in the first paragraph of the results, so as to clarify as well its relation with T_b. Define T^i_b in the caption of Fig. 3 (they are defined later than the figure is first discussed). The definition of 'the vertical axis label' (maybe find a word for that...) is pretty cumbersome. I could imagine that other definitions would allow the lines in Fig. 3 E to converge to the same line for large betas, which would make more sense, considering that in the strong coupling limit I see no reason why the collective spiking should not be the same for different N (the analytical model could help here).

      Thank you for these comments; we have incorporated these and related changes.

      E. I think that the author's reading of the two 'dynamical quorum sensing' papers they cite is incorrect: De Monte et al. was not about the Kuramoto model, but the same limit cycle oscillators as in Strogatz; Taylor et al. considers excitable systems, potentially closer to noisy integrate-and-fire, at least in that they do not have self-sustained oscillations. Both papers show that oscillations appear above a certain density threshold, and that the frequency of oscillations increases with density, as found in this work. A more accurate link to previous publications in the field of synchronization theory, including the models by Kurths and colleagues for fireflies, would be useful both in the introduction and in the discussion, and would help the reader to position this work and appreciate its original contributions.

      1. Thank you for pointing out an inaccuracy in our literature citations regarding synchronization. We have now made corrections to address this point.

      2. While we take the Reviewer’s points well, our theory framework (“model”), building off of the principle of emergent periodicity we propose here, is fundamentally different in the nature of individuals from extant “models”. The reference in question has individuals as oscillators, and the fastest frequency is the frequency of the fastest individual oscillator. In contrast, in our work there is no fastest individual oscillator and the “fastest frequency” has a completely different meaning, since individuals do not have a particular frequency associated with them. In this sense, our work is not inspired by theirs. That said, we have included citations as suggested by the Reviewer.

      F. The authors say that part of the data is unpublished. I guess they mean that the whole data set will be published with this manuscript. I think the formulation is ambiguous.

      Thank you for this comment. We have now clarified that the data will indeed be published with the manuscript.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      Basier and Nurse revisit the fundamental question of how the rates of RNA and protein synthesis scale with cell size. The strong null hypothesis is that synthesis scales linearly with cell size: cells that are twice as big should make stuff twice as fast. This hypothesis has been tested many times, in many systems, using many approaches over the past century and, in general, the null hypothesis has been sustained. However, there have been many examples of evidence for more complicated synthetic patterns. Whether these results indicate that biosynthesis rates vary across the cell cycle, or in response to other factors, in addition to increasing with cell size, or whether observed deviations from the predictions of the null hypothesis has been due to artifacts of cell synchronization and labeling, is thus an open, interesting and, because biosynthesis rates have critical implications in cellular function and metabolic robustness, important question.

      The authors address the question in fission yeast using metabolic pulse labeling with a ribonucleoside or amino acid analog in asynchronous cells and single cell analysis to directly compare incorporation levels with cell size and cell cycle stage. The experiments are well designed, well executed and well controlled. Furthermore, the data is well presented and appropriately interpreted. In particular, the presentation of the size-v.-label data in Figures 2A and D, with the averages and variances in 2B and E and the normalized data in 2C and F are easy understand and interpret. It is thus notable that the size-v.-label data for the longer (cdc22-22) cells is omitted in favor of just the average (2H,J) and normalized (2I,K) data. This size-v.-label data should be added to Figure 2.

      We added two panels to the Figure supplementary 2 showing the requested data, the size-v.-global translation (S2E) and size-v.-global transcription (S2F).

      The authors should also explicitly state how they chose 15 µm as the inflection point in 2H; 16-17 µm seems like it would give a horizontal plateau, which would better fit their saturation explanation.

      This comment relates to the second comment of reviewer 4, see below for the detailed answer.

      The authors measure DNA content with a DNA-binding dye, the signal from which should linearly scale with DNA content. However, instead of reporting and analyzing total signal from the DNA-binding dye (or better yet, total signal in the nucleus, which they could do, having segmented the nucleus in their images), they report max signal. Using max signal is complicated because, as cells and thus nuclei increase in size the concentration of DNA and thus the max (but not total) DNA-binding-dye signal in in the nucleus decreases, requiring two-dimensional dye/size analysis (such as shown in Figure 3B) to distinguish G1 and G2 cells. The authors should use the more straight forward measure of total nuclear DNA-binding dye signal, or explicitly explain why they can't or prefer not to do so.

      The total fluorescence intensity signal of the DNA-binding dye is noisy because we had to use a low concentration of the dye. This was necessary as it allows a clearer distinction between cells with a one 1C DNA content and cells with a 2C DNA content that higher concentrations did not. The maximum signal per cell-v.-cell length produces distinct populations of cells in G1, or G2/M phase (see Figure 3H, and Figure 4B), and populations identified in this way have the distributions of total fluorescence intensity expected from cells in G1 and G2 or M phase (see Figure 3I and Figure S4D). We added one extra panel to Supplementary Figure 4 showing the distributions of the total fluorescence intensity signal of the DNA-binding dye for the G1, S, and G2 or M populations (S4D) for comparison.

      The authors should state in figure legends the strain numbers used for all experiments.

      We have modified all the figure legends to include the strain numbers.

      They should also cite the source of all the constituent parts (e.g. hENT1, hsvTK, EGFP-pcn1, and synCut3-mCherry) of their strains.

      The missing reference for the source of hENT1 and hsvTK (Sivakumar et al. 2004) has been added, the references for EGFP-pcn1 (Meister et al., 2003) and synCut3-mCherry (Patterson et al., 2021) were already present.

      CROSS-CONSULTATION COMMENTS My colleagues make constructive points. I agree with all of them, although I am less concerned about the use of cdc2-22 and CCP∆ to alter cell length and cell cycle distribution. Although these mutations alter CDK specific activity (and thus length and distribution) and could alter specific patterns of translation, the fact that they double at normal rates makes it seem unlikely that they could be significantly changing bulk synthesis rates.

      Reviewer #1 (Significance (Required)):

      As noted above, this work addresses an open, interesting and important question. Moreover it provides useful data in a specific system and a useful example of a general experimental approach to the problem. However, it does not settle the question of how biosynthesis scales with size, even in the specific case of fission yeast. In particular, it shows that protein synthesis plateaus just above normal cell size, whereas RNA synthesis scales up to twice normal cell size. This observation is striking, because there is no obvious mechanism that would (and the authors offer no suggestion of how to) explain how protein synthesis could be limited if RNA synthesis is not. Therefore, the strength of the paper is that it identifies an intriguing phenomena and its limitation is that it does not provide any testable hypotheses to explain that phenomena.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      Summary: Basier and Nurse investigate how "cell size, the amount of DNA, and cell cycle events affect the global cellular production of proteins and RNA molecules". Both transcription and translation, driving the production of biomass, have been shown to increase as a function of cell size in various systems. However, whilst cell size generally correlates with cell cycle progression there are inconsistent results in the literature if global cellular translation and transcription is affected by cell cycle state. They argue that this might be due to perturbations induced by different synchronisation methods used in the various studies.

      Therefore, in this study, to avoid potential perturbation from synchronisation methods, they developed a system that allows to assay unperturbed exponentially growing populations of fission yeast cells. The assay is based on single-(fixed)cell measurements of cell size, cell cycle stage, and the levels of global cellular translation and transcription. This allows them to correlate cell cycle state, cell size and global cellular translation and transcription levels at the single cell level under unperturbed conditions.

      Their results show that translation and transcription steadily increase with cell size, but that the rate of translation, but not transcription, becomes rapidly restricted when cells become larger than wild type dividing cells. This suggests that it is unlikely that the synthesis of RNA is the limiting factor for translation rate in large cells. In addition, their data indicates that translation scales with size, but that the rate increases faster at late S-phase/early G2 and even faster in early in mitosis before decreasing in mitosis and return to interphase. Transcription, on the other hand, increases as a combination of size and the amount of DNA. Overall, this suggests that cell cycle control affects global cellular translation and transcription, which is in line with some studies, but not others. As far as I can tell the assays and data analysis are robust and the data supports the general conclusions.

      Major comments I agree that inconsistent results published on this topic might be due to perturbations induced by different synchronisation methods used in the various studies. However, but much less emphasised in the paper, it also likely depends on the model system used. For example, in budding yeast there is strong evidence for gene expression homeostasis, i.e. gene expression increases as a function of size, independent of gene copy number. Do the authors believe this is a budding yeast specific phenomenon or is this a consequence of specific synchronisation methods used in budding yeast?

      Gene expression homeostasis has been suggested for budding yeast, but in contrast recent work in budding yeast also suggests that gene expression increases with the genome copy number and therefore the gene copy number in addition to cell size (Swaffer et al., 2022 – currently on bioRxiv). The differences that have been reported might be due to perturbations such as synchronisation methods as well as differences between yeast species.

      Whether growth rate increases linearly or exponentially has been the topic of decade long debates. Their data indicates that the translation rate increases faster at late S-phase/early G2 and even faster early in mitosis before decreasing in mitosis and return to interphase, 'resetting' the growth rate. This suggest an exponential, rather than linear, increase in biomass (i.e. growth rate?), but this is not explicitly pointed out. It would be good to get the authors opinion on this in the discussion.

      Assuming that protein degradation remains constant throughout growth, the increase of translation with cell size suggests that the growth rate increases as cells grow in size, possibly exponentially. In addition, our data showing that the translation rate increases from G1 to G2 for the same cell size, suggests that for cells of a given size the growth rate is faster in G2 than in G1. Thus, growth could be basically exponential but the speed of increase accelerates at the transition between S and G2, and early in mitosis, slowing down later in mitosis. We added the following sentence to the discussion section “Global transcription and translation increase with cell size possibly exponentially, but the changes in global translation during transitions through cell cycle stages suggest that the speed of growth is modulated by cell cycle progression, increasing between S and G2, and early in mitosis, and slowing down later in mitosis.”.

      The authors state that their approach has allowed them to determine how cellular changes are arising from progression through the cell cycle. However, they use fixed cells, rather than live cell imaging, so can't claim to have established changes during cell cycle progression, but only a correlation with cell cycle state/phase. Whilst this could be used as a proxy for progression it should be clearly stated in the abstract and elsewhere to prevent confusion. I for one, based on the abstract, thought they developed a live cell imaging strategy to look at this.

      We have modified the abstract to reflect the fact that the cells were fixed in our assays (line 36).

      In reference to the Stonyte, et al., study, in addition to different conditions (temperature shift and isoleucine medium), why do the authors think their findings are different? Is it the lack of correlation to cell size in the Stonyte paper or something else? For example, would using different growth conditions (as in the Stonyte paper). where fission yeast cells spend more time in G1, be used instead of the CCP mutant? Can the authors exclude that the lack of G1-S/cyclin-CDKs is not at the basis of a lower rate of translation in G1 and S phase cells? Either these experiments should be carried out or this should be discussed in more detail.

      In the study carried out by Stonyte et al., the relative translation rate per cell (a measurement related to our measurement of translation normalised per unit of length) of wild type fission yeast cells grown asynchronously in isoleucine minimal medium is constant between the G1 and the S phase cell populations, and is higher in the G2 population compared to the S phase population (Figure 2D of Stonyte et al., 2018). This is consistent with the lack of increase that we observe for a given cell size from G1 to G2, and the increase we observe from S to G2 in Figure 3K. In the same figure, Stonyte at al., find no difference between the G2 and the M-G1 populations but are not able to distinguish cells at different stages of mitosis or in early G1. Our study suggests that translation increases early in mitosis before decreasing after anaphase A, thus in the Stonyte et al study, pooling all stages of mitosis and early G1 cells might mask the dynamics of what is happening during mitosis. The lack of G1-S/cyclin-CDK could be the basis for the lower rate of translation in G1 and S-phase. We discuss this further in a reply to the first question of the significance part of reviewer 2 and have added a section to the discussion of the paper (see below for details).

      If the signal to noise signal is reduced by 20 minutes EU incubation (rather than 10 minutes) why wasn't it used in all experiments?

      To measure RNA production as closed as possible to the instantaneous rate of RNA synthesis, we sought to use the shortest pulse possible. We did this because the half-lives of some RNA species are short, in particular, the half-life of the pool of mRNA has been reported to be around 13.1 minutes in budding yeast (Chan et al., 2017). In longer pulses, some RNA molecules that have been synthesised after addition of EU will therefore have been degraded before cells are fixed, producing a measurement that underestimates the rate of RNA synthesis. We chose to incubate cells for 10 minutes as we estimated it to be the shortest time generating a signal to noise ration above 1 (Figure 1F). The one exception to this was with the pulsing of the CCP∆ EGFP-pcn1 hENT1 hsvTK mutant cells which incorporates less EU during the same time frame so we incubated this strain for 20 minutes to generate enough signal to be quantifiable (see line 237, “we assayed CCP∆ EGFP-pcn1 hENT1 hsvTK cells for global transcription using a 20-minute EU incubation to compensate for their lower signal production”).

      And the conclusion that the increase in transcription is not showing any discontinuities, are they referring to the triplicates in the supplementary figure 2?

      We think there might be a misunderstanding. We conclude that the increase in transcription shows no discontinuity because the median transcription increases steadily with cell length in Figure 2E. We have added “since global transcription increases smoothly with cell length (Figure 2E)” to clarify the text.

      Minor comments Lines 168-169: should be Figure 2F, S2C, S2D rather than Figure 2C, S2A, S2B.

      The figure numbers have been corrected in the manuscript.

      Line 179: doubling time instead of growth rate?

      The mention of “growth rate” has been changed to “doubling time” in the manuscript.

      Lines 184-186: There is an overall trend of slight decrease in transcription per length in cdc25-22 cells but a slight increase in wild-type cells. How does this differ to wild-type cells? Are these non-significant changes and could these be attributed to the low signal to noise ratio?

      These changes may be due to the low signal to noise ratio in the cdc25-22 transcription assay. We have added “The decrease with cell length in transcription that we observe in the cdc25-22 hENT1 hsvTK (Figure 2K) cells but not in the hENT1 hsvTK cells (Figure 2F) may be due to the low signal to noise ratio”.

      There is no cell size that is specific to S phase, it falls within the range of G1 and G2 cells. Since this strain has a variable onset of S phase, the phase durations could differ. Therefore, could time spent in each phase affect the translation rate (live cell imaging, i.e. progression, could address this, but not fix cell correlation)?

      It is possible that the phase duration of G1 and G2 could differ from one cell to another. There is no evidence that the length of S-phase varies in these cells. It would be interesting to measure how the phase length influences translation, but our techniques do not allow for the measurement of global translation in living cells.

      The data reflects translation/transcription in single cells at a specific cell cycle phase, not during the transition between cell cycle phases. Therefore, it would be more appropriate to only use G1, S, G2 and M rather than S/G2 transition or early G2.

      Our data represents cells at fixed cell cycle phases and we do not monitor the transition themselves directly. However, the discontinuity in signal for cells of the same size in consecutive stages of the cell cycle (for instance the discontinuity in translation between S and G2 cells of the same size in Figure 3J) is indicative that the transition between the two cell cycle phases is a consequence of a rate change.

      In figure 4C, there is a decrease in global transcription after 13 um (black line showing all cells), which they don't see in cdc25-22 mutants. Their conclusion that global transcription is constantly increasing with cell size is based on cdc-25 cells but the experiment in CCP mutant cells shows a decrease in the median of transcription. Are there replicates for these experiments as in figure 2 and supplementary figure S2? Maybe an average trend can be plotted too? Apart from the first set of experiments (figure 2 and supplementary figure 2), they don't show replicates for other strains. Maybe they can include another graph as in figure 3D and 3K of average replicate values?

      The apparent decrease in transcription on Figure 4C in long cells is seen in only one length bin (13.5 µm), which has a smaller number of cells compared to the ones directly before (89 cells, compared to 216 cells for the 12.5 µm bin and 316 cells for the 11.5 µm bin). This might have resulted in a higher variability in the measurement of the population median. We do not see the same decrease at 13.5 µm in the wild type (Fig 5G), the cdc25-22 mutant (Fig 2J), or the CCP∆ strain (Fig S4B) so on balance we favour the interpretation that the decrease observed in the longer length bin of Figure 3J is due to variability caused by the lower number of cells in that bin.

      CROSS-CONSULTATION COMMENTS I believe that since the whole premise of this study is that by using unperturbed conditions their findings are different from previously published work they should either clearly point out that this difference might be due to using mutations affecting CDK activity or carry out an experiment in media that induces a G1 population. CDK has been strongly implicated in promoting translation. Using a strain that lacks the G1 and S cyclin CDKs or compromised M-CDK is therefore likely to have an effect on translation, which could be at the basis of the increase in translation during the G2 (and S) phase of the cell cycle.

      This is addressed in the next comment.

      Reviewer #2 (Significance (Required)):

      As far as I can tell the assays and data analysis are robust and the data supports the general conclusions. However, whilst the cells are assessed in unperturbed conditions, they do use CDK mutants and the cdc25ts mutant to establsih gene expression during the different phases of the cell cycle, which could affect translation/transcription rates. This should either be clearly pointed out or complemented with an experiment where WT cells are grown in conditions that induces distinct G1-S-G2 populations of cells.

      The cell cycle stage and CDK activity are intrinsically linked. CDK activity defines the cell cycle stage so that an increase in CDK activity through the cell cycle is responsible for cells progressing through G1, S, G2, and mitosis (Coudreuse and Nurse, 2010, Swaffer et al., 2016). Nutritional conditions that induce a G1 also rely on repression of CDK activity through increased production of the Rum1 inhibitor (Rubio et al., 2018) to generate a G1 population. Therefore, uncoupling CDK activity from the cell cycle would not be possible in an unperturbed cell population. We have added the following paragraph to the discussion to address the comment “The cell cycle stage of a cell and the activity of its CDK molecules are intrinsically linked since CDK activity defines the cell cycle stage of a cell. CDK activity increases through the cell cycle and is responsible for cells progressing through G1, S, G2, and mitosis [44,53] so that an unperturbed asynchronous population of cells in G1 is achieved by a low CDK activity. Thus our results reflect changes happening through the cell cycle as the CDK regulation network undergoes modifications, and an unperturbed cell cycle therefore cannot be uncoupled from CDK activity.”.

      Overall, the work presented suggests that cell cycle control affects global cellular translation and transcription, which is in line with some studies, but not others. Whilst the study falls short of testing/establishing the (potential) mechanisms involved, these are important findings, which can be used to guide new studies into how the production of biomass is controlled as cells proceed through the cell cycle.

      The cell size field, which is considerable and growing, will be interested in this work.

      I have expertise in cell cycle control and genome stability, with a focus on the G1-to-S transition and cell cycle checkpoints during interphase.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      Summary Basier and Nurse use fission yeast as a model system to investigate how transcription and translation are coupled to cell-cycle progression. They use metabolic labeling in exponentially growing cells and analyze single cells by microscopy. They find that translation scales with size and increases at S/G2 and early mitosis while transcription increases with both size and the amount of DNA. They suggest that changes in CDK activity regulate changes in global translation rates.

      Major comments: 1) The paper addresses a much-disputed question in the field. The approach makes the most of the fission-yeast model system and the experiments are beautifully performed. The conclusions are well supported by the data. The experiments are replicated adequately and the statistical analyses are appropriate.

      2) The use of cdc25 and in particular the cig1Δ cig2Δ puc1Δ mutants to manipulate cell size is not without challenges when monitoring translation rates. A number of reports in different model organisms suggest that CDK activity can regulate translation. Work from the Nurse lab identified translation factors as CDK substrates (Swaffer et al, 2016), RNApolIII activity and thus tRNA levels are regulated in the cell cycle by CDK in budding yeast (Herrera et al, 2018), phosphorylation of the ribosomal protein RPL12 by CDK1 is required for translation of at least some proteins in mitosis in human cells (Imami et al, 2018), as is phosphorylation of DENR (Clemm von Hohenberg et al, 2022). The authors also suggest that changes in CDK activity might be responsible for the observed changes in global translation rates. It is important to consider whether using mutants impinging on CDK activity might lead to under- or overestimating cell-cycle dependent translation. The authors should either discuss this issue and tune down the hypothesis that CDK activity regulates changes in global translation rates, or use another approach to address the issue. One could use a replication mutant such as cdc17 or cdc20 to alter cell size without interfering with CDK activity. These experiments would strengthen the conclusions and might support the idea that CDK activity regulates changes in global translation rates. References Clemm von Hohenberg K, Müller S, Schleich S, Meister M, Bohlen J, Hofmann TG, Teleman AA (2022) Cyclin B/CDK1 and Cyclin A/CDK2 phosphorylate DENR to promote mitotic protein translation and faithful cell division. Nat Commun 13: 668 Herrera MC, Chymkowitch P, Robertson JM, Eriksson J, Bøe SO, Alseth I, Enserink JM (2018) Cdk1 gates cell cycle-dependent tRNA synthesis by regulating RNA polymerase III activity. Nucleic Acids Res 46: 11698-11711 Imami K, Milek M, Bogdanow B, Yasuda T, Kastelic N, Zauber H, Ishihama Y, Landthaler M, Selbach M (2018) Phosphorylation of the Ribosomal Protein RPL12/uL11 Affects Translation during Mitosis. Mol Cell 72: 84-98 e89 Swaffer MP, Jones AW, Flynn HR, Snijders AP, Nurse P (2016) CDK Substrate Phosphorylation and Ordering the Cell Cycle. Cell 167: 1750-1761 e1716

      As discussed above in the reply to reviewer 2, the cell cycle stage and CDK activity are intrinsically linked, CDK activity defines the cell cycle stage so that an increase in CDK activity through the cell cycle is responsible for cells progressing through G1, S, G2, and mitosis (Coudreuse and Nurse, 2010, Swaffer et al., 2016). Therefore, uncoupling CDK activity from the cell cycle is not possible in an unperturbed population. Temperature sensitive mutants of cdc20 (Ramirez et al., 2015, Win et al., 2002) and cdc17 (Jimenez et al., 1992) cause loss of viability when cells are shifted to the restrictive temperature so it cannot be assumed that they are in unperturbed conditions which makes results hard to interpret. It should be noted as far as possible in these experiments we have tried to avoid perturbations. In addition, the fraction of cells permeabilised in our assay decreases significantly when cells are grown above 30 °C, making it difficult to assay such temperature shifts.

      Minor comments: 1) The figures are beautifully presented, easy to understand and the cartoons present the experimental strategies very clearly.

      2) A major feature of the approach is that translation and transcription are monitored in exponentially growing cells, which are not exposed to any stress such as cell-cycle synchronization. However, one could argue that the analogues used for labeling impose some kind of stress, even if this is not very likely at the labeling times employed. A simple control experiment where the growth rates of labeled and unlabeled cells are compared would strengthen the claim that these are indeed happily growing cells.

      It is possible that incubating cells with the analogues could impose some kind of stress on the cell although that could be said about almost any experimental procedure. We have added two supplementary figures with the suggested experiments, showing that incubating cells with EU has little or no impact on their doubling time (we see at most a 2.4 % increase in doubling time in hENT1 hsvTK cells incubated with 20 µM EU, Figure S1I) and that incubating cells with HPG has little impact on their doubling time (we see a 8.6 % increase in doubling time in wild type cells incubated with 10 µM HPG, Figure S1H). Considering the small impact of analogue incubation on the doubling time of the population, and the fact that cells are only exposed to the analogue for a short time in our assays (compared to continuous growth in the presence of the analogue in the growth curves presented in Figure S1H and I), we conclude that the stress imposed is low.

      3) Please comment why the length of the EU labeling differs from figure to figure. In fig 2C, S2C and S2D the labeling on the y axes states 10 min, in Fig 4C it says 20 min.

      Please refer to the reply to reviewer 2 on the same topic.

      4) Lines 118-119 "The pulse signal was five times the background signal." Figure S2A,B show large variation in signal intensity after 5 min labelling. It is not clear how the pulse signal was estimated to be five times the background signal.

      We have added two panels for the supplementary figure 2 showing how the signal to noise ratio was computed for the HPG assay after 5 minutes of incubation (Figure S2G) and for the EU assay after 10 minutes of incubation (Figure S2H).

      5) In Fig S4C transcription is up by ca 60 % from G1 to G2, while in Fig 4D transcription is up by ca 25-30%, also from G1 to G2. The only difference I can see is the use of PCNA-GFP. Please comment what the reason might be.

      In Figure 4D, transcription is up 33 % from G1 to G2 and in Figure S4C, transcription is up 62 % from the 1C to the 2C population. It is possible that the EGFP-pcn1 strain might have a small growth defect which could possibly explain its lower signal production, the slower growth rate might mean that the concentration of RNA polymerase could be lower in this strain and the dynamic equilibrium model predicts that this would results in a smaller increase from G1 to G2 compared to cells with a higher concentration of RNA polymerase. But obviously this is speculative.

      6) Fig 1 B images of unlabeled control cells should also be shown.

      We have added 2 panels to the supplementary figure 1 showing the background controls in which cells are fixed immediately after addition of the analogue for the HPG assay (Figure S1F) and for the EU assay (Figure S1G).

      7) Lines 156 "to investigate how global cellular translation and transcription are affected by cell size, and by progression through the cell cycle" should be amended. Throughout the description of data in figure 2 binucleated and septated cells were excluded from the analyses, meaning that the data only represent cells in G2. The text should make this clear.

      "to investigate how global cellular translation and transcription are affected by cell size, and by progression through the cell cycle" has been changed to "to investigate how global cellular translation and transcription are affected by cell size and by progression through G2" to reflect the fact that binucleated and septated cells are excluded from the analysis on this figure.

      8) Lines 241-243 "the S-phase subpopulation was found to have an intermediary global transcription value between the G1 and G2/M subpopulations of around 20-25 %." And Lines 310-313 "the rate of transcription is increased in cells undergoing S-phase by 20 % and is 35 % higher in G2 cells which have completed S-phase, indicating that DNA content is limiting the global rate of transcription." It is unclear what the percentage values refer to and which populations exactly are being compared.

      "the S-phase subpopulation was found to have an intermediary global transcription value between the G1 and G2/M subpopulations of around 20-25 %" has been changed to “the S-phase subpopulation was found to have an intermediary global transcription value between the G1 and G2/M subpopulations with an increase of around 20-25 % compared to the G1 subpopulation” and “the rate of transcription is increased in cells undergoing S-phase by 20 % and is 35 % higher in G2 cells which have completed S-phase, indicating that DNA content is limiting the global rate of transcription” has been changed to “the rate of transcription is increased in cells undergoing S-phase by 20 % compared to G1 cells and is 35 % higher in G2 cells which have completed S-phase compared to G1 cells, indicating that DNA content is limiting the global rate of transcription”. These changes hopefully will clarify what populations comparisons the percentage values are referring to.

      9) Line 85 "Asynchronous cultures ... have not detected" rephrase; change detected to displayed or similar.

      “detected” has been changed to “displayed”

      10) Line 243 Figure 4J, K should read Figure 4C, D.

      “Figure 4j, K” has been changed to “Figure 4D, C”

      CROSS-CONSULTATION COMMENTS

      I also agree with the comments made by the colleagues. As for the use of the cyclin and cdc25 mutants: I agree with Reviewer #1 that it is unlikely that bulk synthesis rates are conisedarably different, since these strains are going at more or lass normal rates. However, I also agree with reviewer #2 that these mutants cannot be considered as unperturbed conditions. I suspect subtle regulation and in particular cell-cycle dependent regulation might well be lost. At the very least the focus of the interpretation should be on translation/transcription as a function of size, rather than in terms of cell-cycle regulation.

      Reviewer #3 (Significance (Required)):

      Basier and Nurse address a long-standing question in the cell-cycle field, namely how/whether transcription and translation are coupled to cell-cycle progression. This is technically challenging to address, and many previous studies were hampered by the necessity to synchronize the cells in the cell cycle. The approach of this study of using metabolic labeling in non-synchronized cells is not novel in itself. However, the analysis by microscopy is superior to previous flow-cytometry based strategies in that it allows the use of cell-cycle markers and thereby precise identification of cells in each cell-cycle phase. In addition, it allows accurate measurements of cell size and thus addressing questions of correlations between cell size and transcription / translation rates. A further strength of the study design is that they investigate both transcription and translation in parallel. The authors very nicely review the existing literature and point out the likely reasons for conflicting conclusions (synchronization methods, choice of model system). The advantages of their approach, such as single-cell analyses in non-synchronized cells and the use of cell-cycle markers make their conclusions less likely to be flawed and thus represent an important advance in the field. These findings are of interest for researchers working on the cell-cycle field and on the translation field. There have been significant technical advances in the translation field in recent years, allowing studying not only global translation but also translation of specific mRNAs. I expect that the old questions of coupling cell cycle and cell growth will be revisited also by others, exploiting these new approaches. My field of expertise extends to the cell-cycle field and the regulation of translation and the use of fission yeast.

      Reviewer #4 (Evidence, reproducibility and clarity (Required)):

      Summary Single cell measurements (flow cytometry and imaging) from unperturbed cells are obtained to investigate scaling of transcription and translation in fission yeast. A key finding is that translation and transcription are somewhat differentially responding to changes in cell size and cell cycle. Perhaps the most central finding of this manuscript is that transcription is not a limiting factor to translation and suggests that transcription is not limiting growth (increase in biomass).

      Major comments: What I like in this manuscript is that the translation and transcription measurements have been carefully checked to reflect the initial rates before the HPG and EU signals lose their linearity. More generally, experiments have been conducted with appropriate controls, and the analysis of unperturbed cells in each cell cycle phase is likely to be highly relevant for resolving some of the controversies in the field. Most claims and the conclusions are well supported by the data. Although it is encouraging that the results for translation match the single cell mass measurements in mammalian cells (e.g., ref 18), I would have liked to see some more discussion about the potential caveats of the performed analyses such as the low signal to noise ratio in EU incorporation and other potential technical issues, which might have confounded the results. As an example, looking at Figs 1B and E, most of the protein and RNA synthesis signal is nuclear localized. Is this due to nucleolar staining and incorporation of the labels into nascent ribosomes? Yet the manuscript mentions that roughly half of RNA is for rRNA and for ribosomal proteins the fraction of HPG incorporation might be even lower. This statement does not sound entirely consistent with the experimental images shown in Fig 1. Please clarify.

      We had initially performed modelling to estimate the proportion of rRNA in transcription but after reconsideration we agree that is difficult to assess whether the special pattern we observe is consistent with the statement that roughly half of the nascent RNA is rRNA. There is signal in the cytoplasm indicating that within the pulse time some RNA are exported from the nucleus, thus the localisation of the RNA signal is not necessarily an accurate indication of the fractions of the different RNA types in global transcription. We have removed the statement “Although the precise fractions of the different types of RNA in global transcription have not been fully characterised, recent work indicated that only half of the newly synthesised RNA consists of ribosomal RNA molecules, suggesting that a significant portion of transcription is dedicated to the production of messenger and other RNA molecules [27].” It cannot be concluded that most of the protein synthesis is nuclear located in Figure 1B. As mentioned in the text we cannot differentiate between proteins being synthesised in the nucleus and proteins being rapidly imported, we also cannot say what fraction of the proteins synthesised are related to ribosome biogenesis.

      A curious thing that has been glossed over is that the transcription and translation seem not to be completely linear but to display opposite patterns (translation slightly reducing, transcription slightly overshooting with cell size compared to a linear model). It remains possible that this could be experimental noise and a visual pattern that is not real, but it could also be relevant for growth control. For example, my interpretation from Fig. 2B is that the signal is not linear and starts to saturate around 10.5 um cell length as seen from the upper IQR. Related to this, I think it is oversimplification to force the data to appear as a discontinuous linear trend by splitting the data in 2H into two segments. Such a treatment will obviously match the data better than a single linear regression, but perhaps some nonlinear model would be actually much more accurate, unless you can point out some kind of regulatory event at the intersection of these two linear segments. In my opinion the current data looks more like a typical (logarithmic) growth curve of the cell population reaching saturation. Please comment.

      We agree that fitting two linear regressions for cells shorter and longer than 15 µm is in Figure 2H and 2I was an oversimplification which could result in a false discontinuity in the data. This echoes a comment from reviewer 1 pointing out that 15 µm might not be the length at which the transition occurs. We have removed the linear regressions and added a locally estimated scatterplot smoothing (LOESS) function which capture the nonlinear transition between the increase of translation with size and the saturation, and we have changed the cell length at the estimated saturation from 18 to 19 µm in the text to better reflect the trend.

      The main conclusion presented in the abstract is that scaling in transcription may result from dynamic equilibrium between RNA polymerases and available DNA template. This is a bit of speculative part, which I was not too fond of. The dynamic equilibrium idea has been suggested also elsewhere (refs 47) and is not well developed in this manuscript. There is a lack of mechanistic understanding and no formal (mathematical) model to support this idea. For example, global transcription increases much less (1.3-1.4x) than expected based on the increase in DNA content from G1 to G2 (2x). Is this expected based on the dynamic equilibrium model?

      The dynamic equilibrium model has been proposed and developed by Swaffer et al. (2022 – currently on bioRxiv) based on mass action kinetics describing the interaction between RNA polymerases and DNA. The model predicts that transcription increases with cell size and with the amount of DNA. With this model, the increase in transcription with DNA for a given cell size is also a function of cell size. Smaller cells are predicted to have a smaller relative increase in transcription from 1C to 2C DNA content than larger cells. This implies that depending on the cell size to DNA ratio of a cell, the span increase in transcription produced by a doubling in the amount of DNA goes from a small increase (at small cell size to DNA ratio) to a doubling (at large cell size to DNA ratio). Thus, in our view the 1.3-1.4x increase in transcription we observe from G1 to G2 is consistent with the dynamic equilibrium model.

      I am somewhat concerned about the interpretation of the S phase data in the global transcription measurement. The quantification in Fig. 4D shows S phase being intermediate between G1 and G2. Yet, when you look at the data in Fig. 4C, the S phase median is clearly discontinuous, with higher transcription in smaller S cells. I believe this could affect the normalized data in Fig. 4D and result in the apparent increase in transcription in S phase cells. Having said that, I am not sure if this small S phase transcription is noise (low cell counts?) or a real S phase specific regulation of transcription which is not DNA content dependent. This results is one of the most central ones in this paper to differentiate between transcriptional and translational scaling. Therefore, additional data or insights would be highly appreciated.

      It is possible that the discontinuity in the medians of the S phase population in Figure 4C could be the result of noise due to the low cell count in the short size bins (115 cells at 6.5 µm, 404 at 7.5 µm). In addition, because we cannot measure DNA with a degree of accuracy high enough to identify how advanced in S-phase each cell is, we do not know the distribution of the advancement into S-phase of cells for each length bin. This is complicated by the fact that some cells of the CCP∆ mutant start S-phase whilst still septated and might be in a late S-phase stage by the time the cell splits so the median global transcription of the shorter length bin does not necessary reflect the median of early S-phase cells. Hence the discontinuity observed with cell length does not necessarily suggest that there is a discontinuity happening through S-phase. We suggest that since the mean global transcription per cell length of cells in S-phase is in between the mean global transcription per cell length of cells in G1 and in G2, the increase happens through S-phase. To reflect this possibility we have added “It is also possible that the increase happens at a certain stage of S-phase independent of the amount of DNA since we do not know the extent of S-phase of each cell.”

      Minor comments: Line 61: "patterns of protein RNA". I guess this refers to patterns of protein/RNA synthesis?

      “patterns of protein and RNA” has been changed to “patterns of protein and RNA synthesis”.

      Line 248: typo "Tanslation"

      “Tanslation” has been changed to “Translation”.

      Line 410 and 416: Move interquartile ranges from line 416 to line 410 as this is the first occurrence of the IQR abbreviation.

      “Interquartile ranges” has been moved from line 416 to line 410.

      Line 473: "Almost linear". This is a subjective expression, please provide some measure such as the R2 value to quantitatively evaluate linearity in this strain.

      We have added a measure of the deviation from linearity in the text “, 15 % deviation from the OLS linear regression shown in Figure 1F”. Line 547: Is there a reason to stress in this experiment that the AREA of the fluorescence signal was measured as the area indicates the total fluorescence intensity?

      “area of the” has been replaced by “total” so the sentence refers to the total fluorescence intensity signal of Sytox Green. Fig1A: The schematic mentions peptides, shouldn't it be more accurate to use "polypeptides" or "proteins" when discussing protein synthesis?

      “Peptides has been changed to Polypeptides”

      Fig 5G: Y axis scale has a typo in the word transcription.

      “Trancription” has been changed to “Transcription”

      CROSS-CONSULTATION COMMENTS I also agree with the points raised by the colleagues. There will always be some technical or interpretation issues related to every experimental technique, every model system and every mutant strain used. I believe after addressing these limitations as pointed out in the reviews, most of those issues have been clarified for the readers.

      Reviewer #4 (Significance (Required)):

      Basier and Nurse revisit the classic question regarding growth and cell size control by examining scaling of global translation and transcription in fission yeast. Knowing how cells alter their transcription and translation has important consequences in cellular functions during proliferation and cellular aging and is of broad general interest. The main driver for this current work is that previous experiments both in fission yeast and other model organisms have yielded conflicting results, possibly due to different cell cycle synchronization methods. The strength of the paper is indeed in the single cell analyses of well defined yeast strains which allow accurate assessment of the cell cycle dependent changes and accurate measurements of cell size using the cell length.

      Reassuringly, the single cell analyses from unperturbed yeast cells resemble those recently obtained from unperturbed growth of individual mammalian cells. The main conclusion that transcription is not limiting translation, and consequently not limiting growth of the cells, is interesting as it is not consistent with some of the prevailing ideas in the cell size field. These ideas include ploidy dependent gene expression where DNA content is thought to be limiting growth or the model for minimal gene expression which assumes RNA polymerases are limiting gene expression and growth. In this regard, this manuscript provides important insights for future thinking of how growth is controlled.

      keywords: cell cycle, cell size control

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      1. General Statements

      We thank the reviewers for their critical analysis of our manuscript. We have addressed all reviewer concerns and questions in our revised version. Along with other improvements requested by the reviewers, we added an MTT assay to validate our flow cytometry assays, normalized binding to surface area to better compare toxin binding between Leishmania and HeLa cells, and revised the discussion. We believe the revised contribution provides important novel insights into membrane integrity in a non-standard organism that will appeal to a broad audience.

      Reviewer comments below are in italics.

      Point-by-point description of the revisions

      Reviewer 1

      *Major Comments. The experimental work has been carried out carefully, including multiple biological replicates, convincing statistical analysis. Data presentation is extensive, including 6 supplementary figures. It is likely that the experiments could be reproduced by others, as the approaches do not seem to be especially difficult, and the methods are well documented. *

      We thank the reviewer for this assessment.

      *My major comment regarding revision is that this paper is quite long and extensive given the relatively restricted body of experiments and discrete conclusions. The principal discovery is that sphingolipids protect Leishmania parasites against somewhat artificial treatment with bacterial sterol-binding pore forming toxins, but they do not do so by obstructing toxin binding to sterols. A similar effect is seen for the antileishmanial drug amphotericin B, the most important agent studied. No further mechanistic insights are provided regarding the process whereby sphingolipids blunt toxicity of either the CDCs or amphotericin B. In addition, the experimental approach relies largely upon one methodology, dose-response curves. A report with such highly focused scope should be presentable with considerably more economy. In particular, the Discussion is long and diffuse, obscuring the presentation of the major conclusions. It could probably be cut in half and would in the process present the major deliverables of the paper with higher impact. *

      We have condensed the discussion as requested, and to address Reviewer 2’s concerns, we provided a summary articulating the significance.

      Significance

      *The most notable advance is the observation that sphingolipids protect Leishmania parasites from the cytotoxic activity of the first line antileishmanial drug amphotericin B that binds to the major sterol in the parasite plasma membrane, ergosterol, and induces pore formation. This discovery suggests that parallel treatments with agents that selectively reduce sphingolipid levels in the parasite might act synergistically with amphotericin B, potentially allowing treatment with lower doses of this inherently toxic drug. This work will likely be of most interest to those with a focus on pharmacology and drug development for this and related parasites, but it will also be of some interest to those working on the basic biochemistry of these organisms. The senior authors are major workers in sphingolipid biochemistry in Leishmania parasites and thus are well positioned to address the relevant background in the field, much of which has come out of their laboratories.

      The major limitation of this study is its relatively circumscribed scope, resulting in one principal conclusion: Leishmania sphingolipids blunt the potency of toxins or drugs that target sterols for pore formation, but they do not do so by impairing binding of these agents to sterols, as they do in mammalian cells. The work would be of higher impact if it addressed mechanistically how sphingolipids do decrease toxicity, e.g., do they prevent these agents from oligomerizing or from intercalating into the membrane to form pores. Such studies would require the application of an expanded repertoire of experimental methodologies going beyond the measurement of dose-response curves with various mutants and drugs.*

      We agree with the reviewer that next steps include determining if Leishmania sphingolipids interfere with oligomerization or pore-insertion. One challenge is that these tools need to first be validated in Leishmania.

      To address the reviewer concern about the limited range of experimental methodologies, we added an MTT assay (Supplementary Fig S2E) as validation of our flow cytometry assays. We have better summarized the significance and broad impact of our work in lines 466-476.

      Reviewer 2

      *In the abstract the authors describe that the pore-forming toxins engage with ceramide and other lipids and while it's clear that the levels of sphingolipids are important for the effect of these toxins there is limited evidence to show they physically interact as the word engage suggests. *

      We agree with the reviewer that we do not show physical interaction. In the abstract, we are careful to only use the word “engage” in association with our proposed model. Our proposed model both explains our data, and uses those data to open new horizons by making falsifiable predictions that can be tested in the future. Direct engagement of toxins with lipids is one such prediction. For these reasons, we prefer to retain the word “engage” in the abstract.

      *The authors conclude that the ergosterol on the Leishmania cell membrane is less accessible to the CDCs as it does not bind as much CDCs as a HeLa cell. What is the relative abundance of sterols in the HeLa membrane in comparison to a Leishmania cell. A HeLa cell is much bigger than a Leishmania cell and will therefore be able to bind a lot more CDC, was the MFI normalised for cell size? This would be important to know as the difference in intensity may be purely related to the difference in cell size. *

      We thank the reviewer for this insight. We had not normalized MFI by cell surface area. We added MFI normalized to cell size (described on lines 573-577) and found that when area was accounted for, the promastigotes bound more toxin than HeLa cells. These data are now included as Supplementary Fig S1A, and discussed on lines 187-189.

      *The authors are keen to prosecute that ceramide is important for differences between PFO and SLO action as the inhibitor has a much greater effect on the PFO treatment of ipcs- cells than SLO, as ceramide will accumulate in these cells. But for the SLO analysis they stated that the treatment of spt2- with myriocin had no change on the LC50 as the target of myriocin was spt2 while they noted was there a drop in the LC50 with PFO. Based on this I think the importance of ceramide is being overstated here, as spt2- cells have little ceramide in them. Moreover the authors also suggest that changes to the lipid environment rather than a single species might be important. Are there alternative targets the myriocin might inhibit when there is no spt2-, it is intriguing that there is a decrease in LC50 for PFO on spt2- myriocin treated cells. *

      Clearly, IPC is very important for determining the cytotoxicity for the CDCs in Leishmania but I think the evidence for the role of ceramide and the sensing of it is less clear cut and the strength of the conclusions about this should be modified. In the results the authors conclude that the L3 loop is sensing ceramide and the data shows that the L3 loop is important but in the discussion they are more circumspect about the moieties L3 can detect. The authors should qualify these conclusions in the results a bit more.

      As requested by the reviewer, we have qualified our statements in the results, lines 282, 297, 315.

      *Minor comments *

      *It would be helpful for the review process to include line and page numbers to highlight areas that I have concerns about. *

      We agree with the reviewer and have added line numbers.

      *In the first paragraph of the results is there a reference for the spt2- cell line that was used here. *

      We have added the Zhang 2003 reference to the first paragraph of the results, line 161.

      *In the second paragraph there is a disconnect between the statements about the phenotype of the ipcs- cells and the reference/evidence for it. *

      We have added the reference to the earlier mention of the ipcs cells, and in the introduction, lines 118-120 and 167-169.

      *On many of the graphs the letters a, b, c are alongside many of the symbols but it was unclear what they represented. *

      The letters represent statistically distinct groups. These are used instead of stars and bars to reduce clutter on the figure. We have now explained the difference in the first figure legend in which they are used, lines 818-823.

      *The colour scheme for figure 4 was confusing - yellow diamonds in A/B are spt2-/+spt2 but in C/D are iscl-, this makes it hard to compare between them. *

      We have changed the color and symbols for the iscl- mutant in Fig 4 and Fig S6.

      *The methodology states that various tests were used to define whether differences were significant but it was not clear from the figures when these were being applied only a few graphs had '*' associated with them. *

      We have clarified this in the figure legends.

      *There is no overall conclusion to the study at the end of the discussion just a series of limitations of the study, which is good to acknowledge but feels an odd way to finish the manuscript. *

      We have revised the discussion in response to Reviewer 1, and included a summary to tie everything together, lines 466-476.

      *Significance: *

      Overall this is a strong manuscript with a set of experiments that have a clear strategy and purpose that was well written. This paper outlines the importance of the lipid composition for the cytotoxicity of both sterol specific toxins and amphotericin B in Leishmania, which will have significant implications for their study for other pathogens but also for the development of combination therapies to enhance the potency of amphotericin B, as such I think this will be of interest to both researchers interested in drug discovery and those interested in lipid metabolism.

      We thank the reviewer for this assessment.

      Reviewer 3

      Major comments: 1) The idea that sphingolipids do not block toxin access relies on the work of CDC-based probes binding the accessible pool of cholesterol in mammalian membranes. The authors make the observation that ergosterol is not shielded by sphingolipids because the presence of them does not prevent CDC binding. Is it possible to show that Leishmania sphingolipids are able to actually sequester ergosterol or would it all be considered free and available to toxin binding?

      Our interpretation of the binding data is that the Leishmania sphingolipids fail to sequester ergosterol from toxins, so ergosterol accessibility is independent of sphingolipids. Similar to mammalian cells, there could be an “essential” pool of ergosterol bound to other proteins/lipids that is inaccessible to toxins. However, detecting that pool is technically challenging.

      We have revised the manuscript to clarify this, lines 454-456.

      * 2) The statistical analysis applied to each experiment, while defined in the figure legends, are presented mostly using uncommon methods of presentation, making it difficult to determine if the correct analysis was applied.*

      We have clarified the statistics and use of letters. The letters represent statistically distinct groups. These are used instead of stars and bars to reduce clutter on the figure. We have now explained the difference in the first figure legend in which they are used, lines 818-823.

      * 3) The binding of these toxins to Leishmania cells appears to be independent of their lipid composition, but Figure 1A-D suggests that these toxins do not bind very well to Leishmania; a ~65 fold increase in toxin added only results in a maximal 3 fold change in amount of toxin bound. Therefore, the authors need to demonstrate that this increase in binding is not simply the result of adding more ug of each CDC. *

      Leishmania are smaller than HeLa cells, which accounts for the apparent reduced binding. We added Supplementary Fig S1A, which normalized MFI to estimated surface area. When normalized to surface area, Leishmania bound to toxin better than HeLa cells. We further note that the dose-dependent increase in cytotoxicity argues against non-specificity of increased toxin.

      * 4) The authors use HeLa cells to compare the ability of these toxins to bind to sterol containing membranes, but it is unclear how a mammalian cell line, which lacks ergosterol, can inform upon the differences in binding to Leishmania membranes when their data shows almost no cholesterol is found in the Leishmania membrane. The use of HeLa cells to compare the toxicity of these CDCs is simply a control experiment for the lytic activity of these proteins, and should not be used as a direct comparison of their LC50s, as a mammalian plasma membrane lipid composition is significantly different from that of Leishmania. If the authors want to use HeLa cells as a direct comparison to show that sphingolipids in mammalian cells also protect them from CDC pore formation, they must demonstrate the HeLa cells which have genetic defects in sphingolipid biology or which have been treated with sphingomyelinases are more sensitive to these CDCs. *

      We agree with the reviewer that to argue sphingolipids in mammalian cells are protective would require additional data beyond the scope of this manuscript. We are not making any statements about the role of sphingolipids in mammalian cells, which have a controversial role in CDC damage and membrane repair (see e.g. Schoenauer et al 2019. PMID: 29979630). Since the head group of sphingomyelin interacts with cholesterol (Endapally et al 2019), but the IPC head group is not expected to interact similarly with ergosterol, we choose to remain focused on Leishmania sphingolipids.

      Given our focus on Leishmania, why include HeLa cells at all? We think including HeLa cells provides an important and relevant point of reference because there are situations where both human cells and Leishmania promastigotes could encounter pore-forming toxins. This comparison provides insight to the following question: “In a mix of promastigotes and human cells (for example during a blood meal), which cells would die first from the bacterial PFT?” Comparing cytotoxicity to HeLa cells provides a point of reference in judging how cytotoxic CDCs are to Leishmania promastigotes, and how sensitive the spt- promastigotes become.

      We have rephrased the manuscript (lines 208-209) to better clarify that HeLa cells are a reference point so readers can evaluate the relative sensitivity of sphingolipid-deficient promastigotes.

      * 5) The authors need to demonstrate that the mutant cholesterol recognition motif (CRM) and the glycan binding mutant proteins can still bind to both Leishmania and Hela cell membranes to serve as controls for their lack of lytic activities. Without this, they cannot conclude that "Leishmania membranes engage the same binding determinants used by CDCs to target mammalian cells". *

      The glycan binding and ΔCRM mutants are unable to bind to HeLa cells. These toxin mutations were previously characterized (Mozola & Caparon, 2015 and Farrand et al 2010), showing that their defect lies in binding to cells, but not oligomerization or pore-formation. Since their defect lies solely in binding, if these toxins were able to bind to spt2- cells, they would kill the spt2- cells. This enables us to use these toxin mutants to ask if the CRM or glycan-binding is essential for toxin binding to Leishmania. Since the only defect in these mutant toxins is binding (either to glycans or cholesterol), the failure of these mutants to kill allows us to conclude that both of these binding surfaces on the toxin are essential for cytotoxicity in L. major.

      We have clarified the manuscript, lines 236-240. *

      Minor comments: 6) Multiple figures lack adequately defined axes. Examples include, but are not limited to: Figure 1A-D where the X-axis is plotted as logarithmic based 2 but this is not defined. Figure 2 the Y axis is plotted as logarithmic based 10 but is not defined. *

      We have updated the figure legends to indicate where log axes are used.

      7) The authors state that "Promastigotes with inactivated de novo sphingomyelin synthesis has a significant increase in total sterols" in reference to Figure 1E. Not only is there no significance indicated for the spt2-/-, the authors only indicate a significance point for the Myr (not yet defined) + WT sample in "Other sterols".

      We have rephrased this to indicate a trend, line 181.

      8) The authors use increases in membrane permeability as a read out for specific lysis using PI uptake, however, they then refer to this read out as killing of Leishmania, without measuring the viability of these cells. Therefore, the authors should provide additional experiments that demonstrate the death of the different Leishmania strains treated with the cytolysins.

      As requested, we have now provided an additional experiment to validate Leishmania death. We have now added MTT assay as Fig S2E, and discussed in the results, lines 202-205.

      9) It is not clear how the authors calculated their LC50 values in Figure 2. According to the figure legends, the authors used HU/ml ranges that would be sub lethal or not completely lysed within this range to most of the Leishmania strains tested. The data presented in Figure are not clear that the correct LC50 calculations were used as none of the Specific Lysis curves do not reach saturation with the concentrations presented, and one does not even reach 50% Lysis.

      We thank the reviewer for catching this discrepancy. The legend in Fig 2 did not include the correct ranges of toxin dose used for PFO. We have corrected the legend to indicate the toxin range used. To calculate LC50, we used linear regression on the linear portion of the death curve to determine the concentration at 50% lysis. This gives us a way to determine LC50 even without the use of very large (and costly) amounts of toxin to get extensive saturation on the kill curve.

      * 10) Figure 4 and Figure S6 are very difficult to interpret. Figure S6 would benefit by breaking up each graph into multiple graphs that would allow the reader to see more of the curves individually. Additionally, there are multiple conditions were it appears that a different number of experiments (2-4 totals) were preformed but statistical analysis was applied to these data. *

      We updated the labels on Fig 4 for improved readability. We broke Fig S6 up into multiple graphs. We have removed unpaired data (eg the n of 4 noted by the reviewer), and re-checked our stats. This change did not alter our conclusions. The apparent n of 2 was overlap of data points due to poor jittering of the datapoints. We have increased the jitter on the data points to make all three reps more distinct.

      * 11) The authors state "In contrast to myriocin-treated ipcs- L. major, which contain low levels of ceramide, myriocin treated iscl- L. major contain low levels of IPC" but do not provide a reference or point to data to support this claim. *

      We have qualified these statements to say ‘are expected to’ on lines 306-307.

      * 12) Figure 5 E would benefit in presentation by being broken up into 4 separate graphs based on the toxin used, as it is difficult to determine which data points are being compared. *

      We compare by toxin used in Fig 5A-D. The purpose of Fig 5E is to compare between toxins. We included all of the data points (including resistant control strains) for completeness. The main focus is the spt2- and ipcs- parts of Fig 5E.

      * 13) The authors state that "myriocin did not inhibit growth more than 25% promastigotes at 10 μM" but this data is not presented. *

      We have now added these data as Fig 6A.

      14) Multiple graphs lack legends or have axis that are not defined.

      In order to improve readability and avoid cluttering the figures, where the legends and axes are the same across multiple graphs, they are included only once for a given row and/or column.*

      Significance:

      Overall, the experiments presented were conducted to analyze each question, but many of the results are observational, without considering the impact of altered lipid species on the findings. The data suggests an existence of a protective mechanism for the parasite from CDCs, but it unclear how these finding inform upon the CDC or Leishmania fields. CDCs have been known to target sterols within membranes and that altered local membrane environments can have substantial impacts on CDC binding. This work suggests that the altered lipid species of Leishmania membranes, compared to a mammalian membrane, could dramatically effect the sequestering power of sphingolipids or other lipids, and thus change how CDCs bind to them. This work advances is likely to have specialized audience of Leishmania researchers looking at the dynamics of their membranes.*

      We believe this work will be valuable to a broad audience because it will be of interest to researchers studying membranes in general, pathogenic eukaryotes and pore-forming toxins. Most membrane biology work is done either in opisthokonts or in model liposomes, so there are few studies on biomembranes in other taxonomic groups, including many different human pathogens. We provide a blueprint for examining the membranes of non-standard organisms, establish L. major as a pathogenically relevant model system, and report on key differences in sterol sequestration compared to mammalian cells. These findings provide important perspectives for the generalization of biomembranes, especially when compared to prior work in opisthokonts.

      We have clarified our significance in lines 466-476.

    1. Author Response

      Reviewer #1 (Public Review):

      This paper tests whether people vary their reliance on episodic memory vs. incremental learning as a function of the uncertainty of the environment. The authors posit that higher uncertainty environments should lead to more reliance on episodic memory, and they find evidence for this effect across several kinds of analyses and across two independent samples.

      The paper is beautifully written and motivated, and the results and figures are clear and compelling. The replication in an independent sample is especially useful. I think this will be an important paper of interest to a broad group of learning, memory, and decisionmaking researchers. I have only two points of concern about the interpretation of the results:

      1) My main concern regards the indirect indicator of participants' use of episodic memory on a given trial. The authors assume that episodic memory is used if the value of the chosen object (as determined by its value the last time it was presented) does not match the current value of the deck it is presented in. They find that these mismatch choices happen more often in the high-volatility environment. But if participants simply choose in a more noisy/exploratory way in the high volatility environment, I believe that would also result in more mismatched judgments. What proportion of the trials labeled as episodic should we expect to be a result of noise or exploration? It seems conceivable that a judgment to explore could take longer, and result in the observed RT effects. Perhaps it could be useful to match up putative episodic trials with later recognition memory for those particular items. The across-subjects correlations are an indirect version of this, but could potentially be subject to a related concern if participants who explore more (and are then judged as more episodic) also simply have a better memory.

      Thank you for this important suggestion. We agree that noisy/exploratory choices could potentially masquerade as episodic on the episodic-based choice index used as one of our behavioral measures. As pointed out, this is because participants may be more likely to make noisier incremental value-based decisions in the high volatility compared to the low volatility environment. In our revision, we provided a new analysis that shows that, as the reviewer predicted, choices are indeed more noisy in the high volatility environment. We answer this concern in two ways. First, we took this noise into account in our analysis of the episodic/incremental tradeoff and show that it does not account for the main findings. And second, we provided a new analysis of subsequent memory that shows that choices that are defined as episodic during the decision-task are also associated with better recognition memory later on. These new analyses are described below as well.

      We used a mixed-effects logistic regression model to test for an interaction effect of environment and model-estimated deck value on whether the orange deck was chosen. We fit this model only to trials without the presence of a previously seen object in order to achieve a more accurate measure of noise specific to incremental learning. In both the main and replication samples, participants did indeed make noisier incremental decisions in the high compared to the low volatility environment (Main: 𝛽 = −1.589, 95% 𝐶𝐼 = [−2.091, −1.096], Replication: 𝛽 = −1.255, 95% 𝐶𝐼 = [−1.824, −0.675]). To account for the possibility that the measured difference between environments in our episodic-based choice index may be related to this difference in incremental noise between the environments, we included each participant’s random effect of the environment by deck value interaction from this model as a covariate in our analysis of the effect of environment on the episodic-based choice index. While each participants’ propensity to choose with greater noise in the high volatility environment did have an effect on the episodic-based choice index (Main: 𝛽 = 0.042, 95% 𝐶𝐼 = [0.012, 0.072], Replication: 𝛽 = 0.055, 95% 𝐶𝐼 = [0.027, 0.082]), the effect of environment was similar to that originally reported in the manuscript for both samples following this adjustment. The reported effects (lines 178 and Appendix 1) and methods (lines 643-655) have been updated to reflect these changes.

      We applied a similar logic to the reaction time analysis, to address the possibility that decisions based on exploration may take longer compared to decisions based on exploitation of learned deck value. We included a covariate in the analysis of the effect of episodic-based choices on reaction time that captured possible slowing due to switching from choosing one deck to the other (lines 656-662) and found that the slower reaction times on episodic choices are not fully explained by exploration. Because in this task a decision to explore is captured by switching from one deck to another, the effect of episodic-based choices on reaction time reported in the manuscript should account for this behavior. We have clarified this reasoning in the methods (lines 661-662).

      Finally, thank you for the idea to sort objects in the recognition memory test by whether they were from episodic- or incremental-based choice trials to provide a further test of whether our approach for sorting episodic decisions withstands an independent test. We performed this analysis and found that, in both samples, participants had better memory for objects from episodic-based choice trials. This result provides further support for the putative episodic nature of these trials and is now reported in the Results (lines 300-304 and Appendix 1), Methods (lines 737-742) and appears as a new panel in Figure 5 (Figure 5A).

      2) The paper is framed as tapping into a trade-off between the use of episodic memory vs. incremental learning, but it is not clear why participants would not use episodic memory in this particular task setup whenever it is available to them. The authors mention that there is "computational expense" to episodic memory, but retrieval of an already-established strong episodic memory could be quite effortless and even automatic. Why not always use it, since it is guaranteed in this task to be a better source of information for the decision? If it is true that RT is higher when using episodic memory, that is helpful toward establishing the trade-off, so this links to the concern above about how confident we can be about the use of episodic memory in particular trials.

      Thank you for raising this important point and for giving us the opportunity to clarify. We now address this point in two ways: first, we provide a new analysis of episodic memory and choice behavior and we address this point explicitly in the discussion.

      As now emphasized in the paper (lines 118-122 and lines 384-388), in this task, it is true that an observer with perfect episodic memory should always make use of it whenever available (i.e. on trials featuring previously seen objects). However, human memory is fallible and resourcelimited, and we find that participants with less reliable episodic memory overall actually relied less on this strategy and more on incremental learning throughout the task (Figure 5C and 5D). In other words, there is noise and uncertainty also in the episodic memory trace. While it is not the main focus of our study, the noise in episodic memory is indeed another reason why trading off between episodic memory and incremental learning is advantageous for behavior. We further agree that while the RT effects show that, relative to using incremental value, episodic memory retrieval takes longer, we cannot make strong statements about effort or “computational expense” per se from our data. Accordingly, we have removed the “computational expense” phrase (line 491), as well as our suggestion that episodic retrieval is “perhaps more effortful overall” (line 181), from the paper.

      Reviewer #2 (Public Review):

      This manuscript addresses the broad question of when humans use different learning and memory systems in the service of decision-making. Previous studies have shown that, even in tasks that can be performed well using incremental trial-and-error learning, choices can sometimes be based on memories of individual past episodes. This manuscript asks what determines the balance between incremental learning and episodic memory, and specifically tests the idea that the uncertainty associated with each alters the balance between them in a rational way. Using a task that can separate the influence of incremental learning and episodic memory on choice in two large online samples, several lines of evidence supporting this hypothesis are reported. People are more likely to rely on episodic memory in more volatile environments when incremental learning is more uncertain and during periods of increased uncertainty within a given environment. Individuals with more accurate episodic memories are also more likely to rely on episodic memory and less likely to rely on incremental learning. These data are compelling, even more so because all of the main findings are directly replicated in a second sample. These data extend the notion of uncertainty-based arbitration between different forms of learning/memory, which has been proposed and evaluated in other contexts, to the case of episodic memory versus incremental learning.

      The weaknesses in the paper are mostly minor. One potential weakness is the nature of the online sample. Many participants apparently did not respond to the volatility manipulation, making it impossible to test whether this altered their choices. It is unclear whether this is a feature of online samples (where people can be distracted, unmotivated, etc.) or of human performance more generally.

      Thank you for your comments. Indeed, we also found it interesting that many participants were insensitive to the manipulation of volatility in our study, as assessed and filtered based on the initial deck learning task. As you note, our study is not positioned to determine the cause and whether this is due to the online population or human performance more generally, and we added a discussion of this point to the paper (lines 477-485). Also, fractions exceeding 1/3 apparently inattentive participants are very much the norm in our experience with other online studies across many tasks. While there is much to say about the implications of this (see e.g. Zorowitz, Niv & Bennett PsyArXiv 2021), our basic philosophy (which we follow here) is that it is best practice, and conservative, to exclude aggressively so as to focus analyses on those participants for whom the experimental questions can meaningfully be asked.

      Reviewer #3 (Public Review):

      The purpose of this work is to test the hypothesis that uncertainty modulates the relative contributions of episodic and incremental learning to decisions. The authors test this using a "deck learning and card memory task" featuring a 2-alternative forced choice between two cards, each showing a color and an object. The cards are drawn from different colored decks with different average values that stochastically reverse with fixed volatility, and also feature objects that can be unfamiliar or familiar. Objects are not shown more than twice, and familiar objects have the same value as they did when shown previously. This allows the authors to construct an index of episodic contributions to decision-making: in cases where the previous value of the object is incongruous with the incrementally observed value, the subject's choice reveals which strategy they are relying on.

      The key manipulation is to introduce high- and low- volatility conditions, as high volatility has been shown to induce uncertainty in incremental learning by causing subjects to adopt an optimal low learning rate. The authors find that the subjects show a higher episodic choice index in the high-volatility condition, and in particular immediately after reversals when the model predicts uncertainty is at a maximum. The authors also construct a trial-wise index of uncertainty and show that episodic index correlates with this measure. The authors also find that at the subject level, the overall episodic choice index correlates with the ability to accurately identify familiar objects, and the reason that this indicates higher certainty in episodic memory is predicting the usage of episodic strategies. The authors replicate all of their findings in a second subject population.

      This is a very interesting study with compelling results on an important topic. The task design was a clever way to disentangle and measure different learning strategies, which could be adopted by others seeking to further understand the contributions of different strategies to decision-making and its neural underpinnings. The article is also very clearly written and the results clearly communicated.

      A number of questions remain regarding the interpretation of the results that I think would be addressed with further analysis and modeling.

      At a conceptual level, I was unsure about the equivalence drawn between volatility and uncertainty: the main experiments and analyses all regard reversals and comparisons of volatility conditions, but the conclusions are more broadly about uncertainty. Volatility, as the authors note, is only one way to induce uncertainty. It also doesn't seem like the most obvious way to intervene on uncertainty (eg manipulated trial-wise variance seems more obvious). The trial-wise relative uncertainty measurements in Fig 4 speak a bit more to the question of uncertainty more generally, but these were not the main focus and also do not disambiguate between trial-wise uncertainty derived from reversals versus within block variation.

      Thank you for your comments. We agree that this distinction was unclear and appreciate the opportunity to clarify. We hope the manuscript is now clear about the conceptual distinction between uncertainty as the construct of theoretical interest vs. volatility as the operational manipulation being used to access it. We have adjusted the presentation and added discussion to clarify this, and also enhanced the trial-wise analyses to strengthen the interpretation of results in terms of uncertainty more generally. Regarding obviousness, we think perhaps there is a difference between areas of study on this point. While trial-wise outcome variance (which we call stochasticity) has been widely used to manipulate uncertainty in perceptual and sensorimotor studies, it has been more rarely manipulated in reward learning studies, where instead the volatility manipulation we use has predominated. We have a recent paper reviewing examples of both and arguing that the field has underemphasized the importance of stochasticity, so we are sympathetic here (Piray and Daw, Nature Communications 2021).

      In any case, to address these points on revision, we have reframed the first section of the results, where we look at effects of environment on episodic-based choice, to focus primarily on volatility. Specifically, we have expanded on our explanation of how volatility induces uncertainty, changed the subtitle of the section from ‘uncertainty’ to ‘volatility’, and have specified that the prediction in this section is primarily about volatility (lines 97 and 116-123). We also reframed the second section of the results to be primarily about the uncertainty induced by volatility: while differences between the environments capture coarse effects of volatility, trialwise uncertainty should be present following reversals across both environments. We have now focused our explanation in this section on trial-wise uncertainty within the environments rather than volatility between the environments (lines 184-192). Further, we agree that there are other sources of uncertainty besides volatility that we did not manipulate in the paper, and that it remains for future work whether their manipulation would produce similar results. To amend this, we have added a new paragraph to the discussion covering these alternative sources and further qualifying the scope of our conclusions (lines 434-446).

      We also agree that our analyses in Figure 4 did not yet speak to differences in episodic-based choice that may arise due to blockwise volatility (as captured by the categorical effect of environment) vs. trial-to-trial fluctuations in uncertainty (as captured by relative uncertainty, over and above the blockwise effect). We have addressed this by adding an additional, separate effect of the interaction between environment and episodic value to our combined choice models which is explained in more detail in the recommendations for the authors portion of our response. These changes and results are described in the Methods (lines 686-694) and Results (lines 276-277; Figure 4C).

      Another key question I had about design choice was the decision to use binary rather than drifting values. Because of this, the subjects could be inferring context rather than continuously incrementing value estimates (eg Gershman et al 2012, Akam et al 2015): the subjects could be inferring which context they are in rather than tracking the instantaneous value + uncertainty. I am not sure this would qualitatively affect the results, as volatility would also affect context confidence, but it is a rather different interpretation and could invoke different quantitative predictions. And it might also have some qualitative bearing on results: the subjects have expectations about how long they will stay in a particular environment, and they might start anticipating a context change after a certain amount of time which would lead to an increase in uncertainty not just immediately after switches, but also after having stayed in the environment for a long period of time. Moreover, depending on the variance within context, there may be little uncertainty following context shifts.

      Thank you for raising this important point. To address the possibility that the task structure could have encouraged participants to infer context rather than engage in incremental learning, we added an alternative contextual inference (CI) model, based on a hidden Markov model with two hidden states (e.g. that either the red deck is lucky and the blue deck unlucky or vice versa). This model is now described in the Results of the main text (lines 226-228), listed in the Methods (line 674), and explained in detail in Appendix 3 alongside the computational models of incremental learning. Following model comparison, we found that this model provided a worse fit than the incremental learning models we previously presented in both samples, suggesting that incremental learning is a better descriptor of participants’ choices in this task than contextual inference. The results of this comparison are reflected in an updated Figure 3A.

    1. Author Response

      Reviewer #1 (Public Review):

      The study by Xie et al., investigates whether the entorhinal-DG/CA3 pathway is involved in working memory maintenance. The main findings include a correlation between stimulus and neural similarities that was specific for cued stimulus and entorhinal-DG/CA3 locations. The authors observed similar results (cuing and region specificity) using inverted encoding modeling approach. Finally, they also showed that trials in which participants made a smaller error showed a better reconstruction fidelity on the cued side (compared to un-cued). This effect was absent for larger-error trials.

      The study challenges a widely held traditional view that working memory and episodic memory have largely independent neural implementations with the MTL being critical for episodic memory but not for working memory. The study adds to a large body of evidence showing involvement of the hippocampus across a range of different working memory tasks and stimuli. Nevertheless, it still remains unclear what functions may hippocampus play in working memory.

      We thank the reviewer’s positive appraisal of the current research, which adds to the growing research interest in the MTL’s contribution to WM.

      Reviewer #2 (Public Review):

      Xie et al. investigated the medial temporal lobe (MTL) circuitry contributions to pattern separation, a neurocomputational operation to distinguish neutral representations of similar information. This presumably engages both long-term memory (LTM) and working memory (WM), bridging the gap between the working memory (WM) and long-term memory (LTM) distinction. Specifically, the authors combined an established retro-cue orientation WM task with high-resolution fMRI to test the hypothesis that the entorhinal-DG/CA3 pathway retains visual WM for a simple surface feature. They found that the anterior-lateral entorhinal cortex (aLEC) and the hippocampal DG/CA3 subfield both retained item-specific WM information that is associated with fidelity of subsequent recall. These findings highlight the contribution of MTL circuitry to item-specific WM representation, against the classic memory models.

      I am a long-term memory researcher with expertise in representational similarity analysis, but not in inverted encoding modeling (IEM). Therefore, I cannot verify the correctness of these models and I will leave it to the other reviewers and editors. However, after an in-depth reading of the manuscript, I could evaluate the significance of the present findings and the strength of evidence supporting these findings. The conclusions of this paper are mostly well supported by data, but some aspects of image acquisition and data analysis need to be clarified.

      We thank the reviewer for positive appraisal of the current study.

      I would like to list several strengths and weaknesses of this manuscript:

      Strengths:

      • Methodologically, the authors addressed uncertainty in previous research resulting from several challenges. Namely, they used a high-resolution fMRI protocol to infer signals from the MTL substructures and an established retro-cue orientation WM task to minimize the task load.

      • The authors selected a control ROI - amygdala - irrelevant for the experimental task, and at the same time adjacent to the other MTL ROIs, thus possibly having a similar signal-to-noise ratio. The reported effects were observed in the aLEC and DG/CA3, but not in the amygdala.

      • Memory performance, quantified as recall errors, was at ceiling - an average recall error of 12 degrees was only marginally away from the correct grating towards the closest incorrect grating (predefined with min. 20 degrees increments). However, the authors controlled for the effects of recall fidelity on MTL representations by comparing the IEM reconstructions between precise recall trials and imprecise recall trails (resampled to an equal number of trials). The authors found that precise recall trails have yielded better IEM reconstruction quality.

      • The author performed a control analysis of time-varying IEM to exclude a possibility that the mid-delay period activity in the aLEC-DG/CA3 contains item-specific information that could be attributed to perceptual processing. This analysis showed that the earlier TR in the delay period contains information for both cued and uncued items, whereas the mid-delay period activity contains the most information related to the cued, compared to uncued, item.

      We thank the reviewer for highlighting the multiple strengths of the current study.

      Weaknesses:

      • The authors formulate their main hypothesis building on an assumption related to the experimental task. This task requires correctly selecting the cued grating orientation while resisting the interference from internal representations of the other orientation gratings. The authors hypothesize that if this post-encoding information selection function is supported by the MTL-s entorhinal-DG/CA3 pathway, the recorded delay-period activity should contain more information about the cued item that the uncued item (even if both are similarly remembered). Thus, the assumption here is that resolving the interference would be reflected by a more distinct representation in MTL for the cued item. Could it be the opposite, namely the MTL could better represent the unresolved interference, for example by the mechanism of hippocampal repulsion (Chanales et al., 2017). It could strengthen the findings if the authors comment on the contrary hypothesis as well.

      We thank the reviewer for pointing out this interesting alternative hypothesis. Because of the different task design (e.g., over the course of learning vs. WM) and stimuli (e.g., spatial memory vs. orientation grating), it is hard to directly compare Chanales et al.’s findings with the current results. That said, we think the idea that the representation of similar information would lead to greater task demand on the MTL is consistent with our intuition regarding the role of the MTL in supporting the qualitative aspect of WM representation. We have now further discussed this issue in our revised manuscript to invite further consideration of the suggested alternative hypothesis,

      “Our data suggest that this process would result in more similar and stable representations for the same remembered item across trials, as detected by multivariate correlational and decoding analyses in the current study. However, under certain task conditions (e.g., learning spatial routes in a naturalistic task over many repetitions), the MTL may maximally orthogonalize overlapping information to opposite representational patterns (hence “repulsion”) to minimize mnemonic interference (Chanales et al., 2017). It remains to be determined how these learning-related mechanisms in a more complex setting are related to MTL’s contributions to WM of simple stimulus features.”

      • It is not clear for me why the authors chose the inverted encoding modelling approach and what is its advantage over the others multivoxel pattern analysis approaches, for example representational similarity analysis also used in this study. How are these two complementary? Since the IEM is still a relatively new approach, maybe a little comment in the manuscript could help emphasizing the strength of the paper? Especially that this paper is of interest to researchers in the fields of both working memory and long-term memory, the latter being possibly not familiar with the IEM.

      We thank the reviewer for this suggestion. In principle, the IEM is a multivariate pattern classification analysis based on an encoding model. There is no fundamental difference between this approach and other machine-learning or classification approaches, except that the IEM is a more model-based approach and therefore can be more computationally efficient (see Xie et al., 2023 for a conceptual overview for multivariate analysis of high-dimensional neural data). The relationship between IEM and representational similarity is grounded in item-specific information that could lead to shared neural variance. How these two analyses are complimented each other is well characterized by a recent theoretical review (Kriegeskorte & Wei, 2021). The rationale is that trial-wise RSA reveals shared neural variance between items, implying the presence of item-specific information in the recorded neural data. And the IEM approach or other classification algorithms can more directly test this item-specific information under a prediction-based framework (e.g., train the data and test on a hold-out set). As a result, the findings of these two methods are correlated at the subject-level (Figure S4), which is important to note for the purpose of analytical reliability. Furthermore, using the IEM also allows us to compare our current findings with that from the previous research (Figure S3), addressing some replicability questions in the field (e.g., Ester et al., 2015).

      We have clarified more on this issue in the paragraph when we first introduce IEM,

      “To directly reveal the item-specific WM content, we next modeled the multivoxel patterns in subject-specific ROIs using an established inverted encoding modeling (IEM) method (Ester et al., 2015). This method assumes that the multivoxel pattern in each ROI can be considered as a weighted summation of a set of orientation information channels (Figure 3A). By using partial data to train the weights of the orientation information channels and applying these weights to an independent hold-out test set, we reconstruct the assumed orientation information channels to infer item-specific information for the remembered item – operationalized the resultant vector length of the reconstructed orientation information channel normalized at 0° reconstruction error (Figure S2). As this approach verifies the assumed information content based on observed neural data, its results can be efficiently computed and interpreted within the assumed model even when the underlying neuronal tuning properties are unknown (Ester et al., 2015; Sprague et al., 2018). This approach, therefore, complements the model-free similarity-based analysis by linking representational geometry embedded in the neural data with item-specific information under a prediction-based framework (Kriegeskorte and Wei, 2021; Xie et al., 2023). Based on this method, previous research has revealed item-specific WM information in distributed neocortical areas, including the parietal, frontal, and occipital-temporal areas (Bettencourt and Xu, 2015; Ester et al., 2015; Rademaker et al., 2019; Sprague et al., 2016), which are similar to those revealed by other multivariate classification methods (e.g., support vector machine, SVM, Ester et al., 2015). We have also replicated these IEM effects in the current dataset (Figure S3).”

      Overall, this work can have a substantial impact of the field due to its theoretical and conceptual novelty. Namely, the authors leveraged an established retro-cue task to demonstrate that a neurocomputational operation of pattern separation engages both working-memory and long-term memory, both mediated by the MTL circuitry, beyond the distinction in classic memory models. Moreover, on the methodological side, using the multivariate pattern analyses (especially the IEM) to study neural computations engaged in WM and LTM seems to be a novel and promising direction for the field.

      Thanks for the reviewer for this positive appraisal of the current study.

      Reviewer #3 (Public Review):

      This work addresses a long-standing gap in the literature, showing that the medial temporal lobe (MTL) is involved in representing simple feature information during a low-load working memory (WM) delay period. Previously, this area was suggested to be relevant for episodic long-term memory, and only implicated in working memory under conditions of high memory load or conjunction features. Using well-rounded analyses of task-dependent fMRI data in connection with a straightforward behavioural experiment, this paper suggests a more general role of the medial temporal lobe in working memory delay activity. It also provides a replication of previous findings on item-specific information during working memory delay in neocortical areas.

      We thank the reviewer for highlighting the contribution of the current study to fill a gap in the literature.

      Strengths:

      The study has strengths in its methods and analyses. Firstly, choosing a well-established cueing paradigm allows for straightforward comparison with past and future studies using similar paradigms. The authors themselves show this by replicating previous findings on delay-period activity in parietal, frontal, and occipito-temporal areas, strengthening their own and previous findings. Secondly, they use a template with relatively fine-grained MTL-subregions and choose the amygdala as a control area within the MTL. This increases confidence in the finding that the hippocampus in particular is involved in WM delay-period activity. Thirdly, their combined use stimulus-based representational similarity analysis as well as Inverted Encoding Modeling and the convergence on the same result is encouraging. Finally, despite focusing on the delay period in their main findings, extensive supplementary materials give insight into the time-course of processing (encoding) which will be helpful for future studies.

      We thank the reviewer for highlighting multiple strengths of this current study.

      Weaknesses:

      While the evidence generally supports the conclusions, there are some weaknesses in behavioural data analysis. The authors demonstrated fine stimulus discrimination in the neural data using Inverted Encoding Modeling (IEM), however the same standard is not applied in the behavioural data analysis. In this analysis, trials below 20 degrees and trials above 20 degrees of memory error are collapsed to compare IEM decoding error between them. As a result, the "small recall error" group encompasses a total range of 40 degrees and includes neighbouring stimuli. While this is enough to demonstrate that there was information about the remembered stimulus, it does not clarify whether aLEC/CA3 activity is associated with target selection only or also with reproduction fidelity. It leaves open whether fine-grained neural information in MTL is related to memory fidelity.

      We thank the reviewer for this cautious note. As the current task is optimized to reveal the neural representation during visual WM and as our participants are cognitively normal college students, participants’ behavioral performance in the current experiment tends to be very good (Figure 1). This leaves us relatively small variation to further probe the behavioral outcomes of the task. We have recently generalized our findings using intracranial EEG and confirmed that trial-by-trial mnemonic discrimination during a short delay is indeed associated with the fidelity of item-specific WM representation (Xie, Chapeton, et al., in press).

      We have further discussed this issue in the revised Discussion,

      “… These two approaches are therefore complementary to each other. Nevertheless, these analyses are correlational in nature. Hence, although fine-grained neural representations revealed by these analyses are associated with participants’ behavioral outcomes (Figure 4), it remains to be determined whether the entorhinal-DG/CA3 pathway contributes to the fidelity of the selected WM representation or also to the selection of task-relevant information. Strategies for resolving this issue can involve generalizing the current findings to other WM tasks without an explicit requirement of information selection (e.g., intracranial stimulation of the MTL in a regular WM task without a retro-cue manipulation, Xie et al., in press) and/or further exploring how the frontal-parietal mechanisms related to visual selection and attention interact with the MTL system (Panichello and Buschman, 2021).”

      Moreover, the authors could be more precise about the limitations of the study and their conclusions. In particular, the paper at times suggests that the results contribute to elucidating common roles of the MTL in long-term memory and WM, potentially implementing a process called pattern separation. However, while the paper convincingly shows MTL-involvement in WM, there is no comparison to an episodic memory condition. It therefore remains an open question whether it fulfils the same role in both scenarios. Moreover, the paradigm might not place adequate pattern separation demands on the system since information about the un-cued item may be discarded after the cue.

      We thank the reviewer for this cautious note. We have now included a more detailed discussion on this issue.

      In the Discussion,

      “To more precisely reveal the MTL mechanisms that are shared across WM and long-term memory, future research should examine the extent to which MTL voxels evoked by a long-term memory task (e.g., mnemonic similarity task, Bakker et al., 2008) can be directly used to directly decode mnemonic content in visual WM tasks using different simple stimulus features.”

    1. Author Response

      Reviewer #2 (Public Review):

      In the current manuscript, Feng et al. investigate the mechanisms used by acute leukemia to get an advantage for the access to the hematopoietic niches at the expense of normal hematopoietic cells. They propose that B-ALLs hijack the niche by inducing the downmodulation of IL7 and CXCL12 by stimulating LepR+ MSCs through LTab/LTbR signaling. In order to prove the importance of LTab expression in B-ALL growth, they block LTab/LTbR signaling either through ligand/receptor inactivation or by using a LTbR-Ig decoy. They also show that CXCL12 and the DNA damage response induce LTab expression by B-ALL. They finally propose that similar mechanisms also favor the growth of acute myeloid leukemia.

      Although the proposed mechanism is of particular interest, further experiments and controls are needed to strongly support the conclusions.

      1/ Globally, statistics have to be revised. The authors have to include a "statistical analysis" section in the Material and Methods to explain how they proceeded and specify for each panel in the figure legend which tests they used according to the general rules of statistics.

      We apologize for the lack of details. This has been corrected in the revised manuscript.

      2/ The setup of each experiment is confusing and needs to be detailed. Cell numbers are not coherent from one experiment to the other. As an example, there are discrepancies between Fig1 and Fig2. Based on the setup of the experiment in Fig.2 (Injection of B-ALL to mice followed by 2 injections of treatment every 5 days), mice have probably been sacrificed 12-14 days post leukemic cell injection. However, according to Fig.1, B cells and erythroid cells at this time point should be decreased >10 times while they are only decreased 2-4 times in Fig.2. This is also the case in Fig.4B-J or Fig.5D with even a lower decrease in B cells and erythroid cells despite a high number of leukemic cells. Please explain and give the end point for each experiment in each figure (main and supplemental).

      We understand the reviewer concern but we’d like point out the following: kinetic experiments such as these were reproduced multiple times in the laboratory. However, when comparing side-by-side experiments performed over the course of several months discrepancies in the exact days when leukemia shuts-down hematopoiesis are bound to happen. This is because there are numerous variables at play that we can minimize to the extent possible, but we cannot completely eliminate. For example, we took all possible steps to work with stable batches of preB-ALL cells. However, it is impossible to be absolutely certain that the batch in one experiment is identical to another experiment. Cells have to be expanded for adoptive transfer, which inevitably carries some variability (all biological systems undergo random mutations, including purchased C57Bl6/J from reputable vendors); slight differences in ALL engraftment (i.e. injection variability) can occur such that kinetics may change by a couple of days, etc. The findings we reported here are highly reproducible: ALL shuts down lymphopoiesis and erythropoiesis acutely, less so myelopoiesis; that LTbR signaling is the major mechanism shutting down lymphopoiesis but not erythropoiesis; that ALLs up-regulate LTbR ligands when compared to non-leukemic cells of the same lineage and at a similar developmental stage; that CXCR4 and DSB pathways both promote lymphotoxin a1b2 expression. The exact kinetics of these experiments will vary, or at least carry a margin of error that is to the best of our capability impossible to eliminate.

      3/ To formally prove that the observed effect is really due to LTab/LTbR signaling, the authors must perform further control experiments. LTbR signaling is better known for its positive role on lymphocyte migration. They cannot rule out by blocking LTbR signaling, that they inhibit homing of leukemic cells into the bone marrow through a systemic/peripheral effect, more than through an impaired crosstalk with BM LepR+ cells. They must confirm for inhibited/deficient LTbR signaling conditions, as compared to control, that similar B-ALL numbers home to the BM parenchyma at an early time point after injection. Furthermore, they cannot exclude that the effect on the expression of IL7 (and other genes), and consequently the effect on B cell numbers, is not simply due to the tumor burden. Indeed, B-ALL numbers/frequencies are different between control and inhibited/deficient signaling conditions at the time of analysis. The analyses should thus be performed at similar low and high tumor burden in the BM for both control and inhibited/deficient LTbR signaling conditions.

      We performed ALL homing experiments into control and LTbR∆ and found no significant differences in ALL frequency or number in BM 24h after transplantation. These data have been included in Figure 4A.

      We also performed experiments to control for the number of ALL cells in the bone marrow. Briefly, we compared the impact of 3 million WT ALLs with that of 3 and 9 million Ltb-deficient ALLs on Il7-GFP expression in BM MSCs. The number of Ltb-deficient ALLs in the BM of mice recipient of 9 million ALLs was equivalent to that of mice that received 3 million WT ALLs 7 days after transplantation. Importantly, Il7 was only downregulated in mice transplanted with WT ALLs. These data have been included in Figure 4R and 4S.

      4/ LT/LTbR signaling is particularly known for its capacity to stimulate Cxcl12 expression. How do the authors explain that they see the opposite?

      The reviewer is alluding to a well-known role of LTbR signaling as an organizer of immune cells in secondary lymphoid organs such as spleen and lymph nodes, and particularly its role in promoting CXCL13, CCL19, CCL21 production by fibroblastic reticular cells of these organs. Both the B cell follicle and the T-zone do not express CXCL12 abundantly. Furthermore, in the B cell follicle niche, LTbR signaling is critical for the maturation of Follicular Dendritic Cells, yet FDCs hardly produce CXCL12 as well. So, while LTbR is a well-known regulator of cell organization through the production of homeostatic chemokines and lipid chemoattractants, CXCL12 itself is not one of the major chemokines controlled by this pathway. In summary, we do not think our data is in any way incompatible with prior studies on the LTbR pathway, and even if it was, to our knowledge this is the first study on cell-intrinsic effects of LTbR signaling in BM MSCs.

      5/ The authors show that CXCL12 stimulates LTa expression in their cell line. They then propose that CXCR4 signaling in leukemic cells potentiates ALL lethality by showing that a CXCR4 antagonist reverses the decrease in IL7 and improves survival of the mice. This experiment is difficult to interpret. CXCL12 has been shown to be important for migration/retention of B-ALL in the BM and the decreased tumor burden is probably linked to a decreased migration more than an impaired crosstalk with LepR+ cells (see also point 3). If CXCL12 increases LTab expression, CXCR4 blockade should do the opposite. This result should be presented. The contradiction is that if B-ALLs induce a decrease in CXCL12 in the BM (in addition to IL7) and that CXCL12 regulates LTab levels, leukemic cells should be exhausted. Similarly, IL7 has been previously shown to stimulate LTab expression and B-ALL cells express the IL7R. Again, a decrease in IL7 should be unfavorable to B-ALL. How do they explain these discrepancies?

      We thank the reviewer suggestion of testing the impact of CXCR4 blocking in vivo on LTa1b2 expression. We performed these experiments which have now been included in the revised manuscript (Fig. 5C and 5D). In summary, we observed reduced LTa1b2 on ALLs transplanted into mice treated with AMD3100, a well-known CXCR4 antagonist. These data also show that CXCR4 signaling is not the only mechanism driving LTa1b2. These results further strengthen the main conclusions of the manuscript. Finally, to our knowledge no study has reported Lymphotoxin a1b2 upregulation in B-ALLs by IL-7.

      6/ In Supp 4A, RAG-/- mice are blocked at the pro-B cell stage and do not have pre-B cells. Please compare LTa and LTb expression by Artemis deficient pre-B cell to wt pre-B cells. In this experiment, the authors show that similarly to B-ALL artemis-/- pre-leukemic pre-B cells express high levels of LTab and induce IL7 downmodulation. Using mice deficient for LTbR in LepR+ cells, they show that IL7 expression is increased. However, in opposition to leukemic cells (see Figure 4F), pre-leukemic cells are increased in absence of LTab/LTbR signaling. Please explain this discrepancy. The authors use only one B-ALL model cell line for their demonstration (BCR-ABL expressing B-ALL). Another model should be used to confirm whether LTab/LTbR signaling does favor leukemic/pre-leukemic B cell growth.

      We apologize for the confusion. The mice that were used in this study were initially described by Barry Sleckman and colleagues (Bredemeyer et al. Nature 2008). Briefly, they crossed Artemis-deficient mice with VH147 IgH transgenic and EμBcl-2 transgenic mice to generate mice in which B cell development is arrested at the preB cell stage. The Vh147 heavy chain allows their development to the pre-BCR+ preB cell stage but Artemis deficiency prevents Rag protein re-expression and hence B cell can’t recombine light chain genes. The EμBcl-2 transgene allows preB cells to survive despite carrying unrepaired double-strand DNA breaks (DSB).

      Regarding the discrepancy noted by the reviewer we argue that this is not a discrepancy. While ALLs can grow in vitro and in vivo in the absence of IL7, non-leukemic developing B cells are strictly IL7 dependent. PreB cells carrying unrepaired DSBs still express IL7 receptor and although no data is currently available on whether these cells are also IL7-dependent, we speculate that they are. Because up-regulation of Lymphotoxin a1b2 in preB cells carrying unrepaired DSBs promotes IL7 downregulation we speculate that this mechanism may contribute to the efficient elimination of pre-leukemic preB cells in vivo. We revised the manuscript to include this explanation of the mouse model and discussion on how we think the LTbR pathway may play a role in pre-leukemic states.

      Finally, the data presented in this study includes two distinct leukemia mouse models. It also includes data from human B-ALL and AML samples that is in agreement with the mouse data presented here. We respectfully disagree with the reviewer that a third model is needed to confirm a role for the LTa1b2/LTbR pathway in leukemia.

      7/ Pre-B cells are composed of large pre-B cells (pre-BCR+) and small pre-B cells (pre-BCR-). BCR-ABL B-ALL cells express the pre-BCR. What is the level of expression of LTa and LTb by each of these 2 subsets as compared to BCR-ABL B-ALL?

      This is a misconception. The difference between large and small preB cells is simply that large preB cells are in S/G2 phase of the cell cycle. Their increased size is a mere consequence of doubling DNA, protein, membrane content, etc.

    1. Author Response

      Reviewer #1 (Public Review):

      It is a strength of the current manuscript that it provides a near-complete picture of how the metamorphosis of a higher brain centre comes about at the cellular level. The visualization of the data and analyses is a weakness.

      I do not see any point where the conclusions of the authors need to be doubted, in particular as speculations are expressly defined as such whenever they are presented.

      The fact that molecular or genetic analyses of how the described metamorphic processes are organized are not presented should, I think, not compromise enthusiasm about what is provided at the cellular level.

      We appreciate the comments and guidance that Reviewer #1 has given us on data presentation. We have tried to simplify figures and make the images larger. For the developmental figures, a couple of illustrative examples are provided in the main figure with the remainder given in “figure supplements”

      Reviewer #2 (Public Review):

      This very nice piece of work describes and discusses the developmental progression of larval neurons of the mushroom body into those in the adult Drosophila brain. There are many surprising findings that reveal a number of strategies for how brain development has evolved to serve both the early functions specific to the larval brain and then their eventual roles in the adult brain. I think it is fascinating biology and I was educated while reviewing the paper.

      Line 115-116. 'Output from PPL1 compartments direct avoidance behavior, while that from PAM compartments results in attraction'. This is not correct and is actually reversed. The learning rule is depression so that aversive learning reduces the drive to approach pathways whereas appetitive learning reduces the drive to avoidance pathways. This should be corrected and reference made to studies demonstrating learning-directed depression.

      Line 222. It provides feed-forward inhibition from y4>2>1. I could be wrong but I'm not aware that there is functional evidence for this glutamatergic neuron being inhibitory. It's currently speculation.

      We have noted that this function was proposed by Aso et al.

      Line 242. I think it would be nice if the authors focused on extreme changes and showed larger and nicer images. The rest can be summarized but why not pick a few of the best examples to illustrate the strategies they consider in the discussion?

      We have reduced the number of neurons shown in the new Figs 5 and 6. Hopefully, the images are now large enough to appreciate. Data for the remaining neurons are now in Figure Supplements for Figs 5 and 6.

      Line 249 'became sexually dimorphic'. I may have missed it somewhere but this immediately made me think about the sex of all the images that are shown. Is this explicitly stated somewhere? Was it tracked in all larvae, pupae, and adults?

      We now begin the Methods addressing this point. We did an initial screen and found sex-specific differences only in MBIN-b1 and -b2. After this time, we kept no records as to the sex of the fly that was used except for the latter cells.

      Reviewer #3 (Public Review):

      Truman et al. investigated the contribution and remodeling of individual larval neurons that provide input and output to the Drosophila mushroom body through metamorphosis. Hereto, they used a collection of split-GAL4 lines targeting specific larval mushroom body input and output neurons, in combination with a conditional flip-switch and imaging, to follow the fates of these cells.

      Interestingly, most of these larval neurons survive metamorphosis and persist in the adult brain and only a small percentage of neurons die. The authors also elegantly show that a substantial number of neurons actually trans-differentiate and exert a different role in the larval brain, compared to their final adult functionality (similar to their role in hemimetabolous insects). This process is relatively understudied in neuroscience and of great interest.

      Using the ventral nerve cord as a proxy, the authors claim that the larval state of the neuron would be their derived state, while their adult identity is ancestral. While the authors did not show this directly for the mushroom body neurons under study, it is a very compelling hypothesis. However, writing the manuscript from this perspective and not from the perspective of the neuron (which first goes through a larval state, metamorphosis, and finally adult state), results in confusing language and I would suggest the authors adjust the manuscript to the 'lifeline' of the neuron.

      We have tried to be more “linear” in our presentation. This should make the text less confusing.

      In general, this manuscript does not explain how the larval brain has evolved as the title suggests but instead describes how the larval brain is remodeled during metamorphosis. It thus generates perspectives on the evolution of metamorphosis, rather than the larval state. Additionally, this manuscript would benefit from major rearrangements in both text and figures for the story to be better comprehended.

      We think that the end of the Discussion does relate to how a larval brain evolves. The evolution of the larval brain is faced with constraints related to the shortened period of embryonic development and the highly conserved temporal and spatial mechanisms that insects use to generate their neuronal phenotypes. These constraints result in a potential mismatch between the neurons that are needed and those that are actually made (revealed by the adult phenotypes of these neurons). The larva then turns to trans-differentiation to temporarily transform unneeded (or dead) neurons into the missing cell types to build its larval circuits.

      We think that these ideas provide some new insights into how a larval brain may have evolved and that our title is appropriate.

      The introduction is very focused on the temporal patterning of the insect nervous system, while none of the data collected incorporate this temporal code. Temporal patterning comes back in the discussion but is purely speculative.

      The Speculation about the importance of temporal patterning is now brought in late in the Discussion in reference to Figure 12

      Furthermore, the second part of the introduction describes one strategy for remodeling and why that strategy is not likely but does not present an alternative hypothesis. The first section of the results might serve as a better introduction to the paper instead, as it places the results of the paper better and concludes with the main findings. The accompanying Figure 1 would also benefit from a schematic overview of the larval and adult mushroom bodies as presented in Fig. 2A (left).

      This has been revised in the spirit of these comments

      In the second results section, the authors show the post-metamorphic fates of mushroom body input and output neurons and introduce the concept of trans-differentiation. Readers might benefit from a short explanation of this process. I also encourage the authors to revisit this part of the text since it gives the impression that the neurons themselves undergo active migration (instead of axon remodeling).

      We have tried to make it clear that there is no cell migration. Rather there is retraction/fragmentation of larval arbors followed by outgrowth to new, adult targets

      The discussion starts with a very comprehensive overview of the different strategies that neurons could use during metamorphosis (here too, re-writing the text from the neurons' perspective would increase the reflection of what actually happens to them).

      The Discussion now begins by dealing with gross changes in the MB, with reference to the compartments and eventually moves to changes in individual cells. We have reduced our discussion of the metamorphic strategies of cells and no longer have Fig 8A

      The discussion covers multiple topics concerning trans-differentiation, metamorphosis, memory, and evolution and is often disconnected from the results. It could be significantly shortened to discuss the results of the paper and place them in current literature. Generally, the figures supporting the discussion are hard to comprehend and often do not reflect what the text is saying they are showing.

      The Discussion is still long, but, hopefully, our organization now makes it much easier to read and comprehend.

    1. Author Response

      Reviewer #1 (Public Review):

      Junctophilin is mostly known as a structural anchor to keep excitation-contraction (E-C) proteins in place for healthy contractile function of skeletal muscle. Here the authors provide a new interesting role in skeletal muscle for Junctophilin (44 kD segment, JPh44), where it translocates to the nuclei and influences gene transcription. Also, the authors have shown that Calpain 1 can digest junctophilin to generate the 44 kDa segment. The field of skeletal muscle generally knows little about how E-C coupling proteins have dual role and influence gene regulation that subsequently may alter the muscle function and metabolism. This part of the manuscript is solid, informative, and novel. The authors use advanced imaging and genetic manipulations of junctophilin etc to support their hypothesis. The authors then also aim to link this mechanism to hyperglycemia in individuals susceptible for malignant hyperthermia as they have elevated levels of the 44kDa segment. However, the power of the analyses are low and the included data comparisons complicates the possibility to interpret the results and its relevance. Nevertheless, the data supporting the novel dual role of junctophilin would likely be appreciated and gain attention to the muscle field.

      Thanks for your constructive reading. We agreed (in our answer to Item 1) to your concern regarding power of the tests. To improve it we would need many more individual patients (which, after the pandemic peaks, are starting to be recruited again, although at a pace of no more than 2 per month). We are committed to updating the present report as soon as we obtain, say, 20 more MHS and MHN patients –a minimum to impact power of the tests. In any case, we claim that power is not an acute concern, as this communication deals mainly with positive results, where significance is of the essence.

      We have established significance in most of the observations communicated here; in the few cases where p is marginal, significance is inferred by correlations.

      Reviewer #2 (Public Review):

      Skeletal muscle is the main regulator of glycemia in mammals and a major puzzle in the field of diabetes is the mechanism by which skeletal muscle (as well as other tissues) become insensitive to insulin or decrease glucose intake. the authors had proposed in a previous publication that high intracellular calcium, by means of calpain activation, could cleave and decrease the availability of GLUT4 glucose transporters. In this manuscript, the authors identify two additional targets of calpain activation. One of them is GSK3β, a specialized kinase that when cleaved, inhibits glycogen synthase and impairs glucose utilization. The second target is junctophilin 1, a protein involved in the structure of the complex responsible for E-C coupling in skeletal muscle. The authors succeeded in showing that a fragment of junctophilin1 (JPh44) moves from the triad to other cytosolic regions including the nuclei and they show changes in gene expression under these conditions, some of them linked to glucose metabolism.

      Overall, the manuscript shows a novel and audacious approach with a careful treatment of the data (that was not always easy nor obvious) that allow sensible conclusions and definitively constitutes a step forward in this field.

      Thanks for the generous report.

      Reviewer #3 (Public Review):

      First, we express utmost gratitude for your critical work on our manuscript. Your concerns made us perform additional experiments and validations, eventually forcing us to abandon a couple of erroneous notions and therefore improving our understanding and interpretations. Because your concerns were already in the “Essentials” list assembled by the Editor, our responses here will mostly refer to our earlier answers to the items in that list.

      1) Figure 1 A and B show a western blot of proteins isolated from muscles of MHN and MHS individuals decorated with two different antibodies directed against JPH1. According to the manufacturer, antibody A is directed against the JPH1 protein sequence encompassing amino acids 387 to 512 while antibody B is directed against a no better specified C-terminal region of JPH1. Surprisingly, antibody B appears not to detect the full-length protein in lysates from human muscles, but recognizes only the 44 kDa fragment of JPH1. However, to the best of the reviewer's knowledge, antibody B has been reported by other laboratories to recognize the full-length JPH1 protein.

      The reason for the failure of ab B to recognize the full human protein may be that it was raised against a murine immunogen (this interpretation was communicated to us by G.D. Lamb, who co-authored the 2013 paper by Murphy et al. where the failure was noted). It recognizes both JPh1 and JPh44 of murine muscle in our hands.

      Thus, is not obvious why here this antibody should recognize only the shorter fragment.

      We agree entirely. In spite of the difficulties in interpretation, the recognition of human JPh44 by the ab is, however, a fact, repeatedly demonstrated in the present study, which can be used to advantage.

      In addition, in MHS individuals there is no direct correlation between reduction in the content of the full-length JPH1 protein and appearance of the 44 kDa JPH1fragment, since, as also reported by the authors, no significant difference between MHN and MHS can be observed concerning the amount of the 44 kDa JPH1.

      Tentative interpretations of the lack of correlation have been presented in the response to Item 14, above.

      Based on the data presented, it is very difficult to accept that antibody A and B have specific selectivity for JPH1 and the 44 kDa fragment of JPH1.

      Indeed, we now acknowledge that Ab A reacts equally with JPh1 and the 44 kDa fragment (and provide quantitative evidence for it in Supplement 1 to Fig. 8). We also provide conclusive evidence of the specificity of ab B (e.g., Supplement 2 to Fig. 1).

      2) In Figure 2B staining of a nucleus is shown only with antibody B against the 44 kDa JPH1 fragment, while no nucleus stained with antibody A is shown in Fig 2A. Images should all be at the same level of magnification and nuclear staining of nuclei with antibody A should be reported. In Figure 2Db labeling of JPH1 covers both the nucleus and the cytoplasm, does it mean that JPH1 also goes to the nucleus? One would rather think that background immunofluorescence may provide a confounding staining and authors should be more cautious in interpreting these data.

      These items are fully covered in our response to Item 16.

      Images in 2D and 2E refer to primary myotubes derived from patients. The authors show that RyR1 signals co-localizes with full-length JPH1, but not with the 44 kDa fragment, recognized by antibody B. How do the authors establish myotube differentiation?

      Myotubes are studied 5-10 days after switching cells to differentiation medium, which is DMEM-F12 supplemented with 2.5% horse serum, as explained in Figueroa et al 2019. Cells with more than 3 nuclei were considered myotubes. Myotubes with similar degree of maturation (number of nuclei) were selected for experimental comparisons.

      3) Figure 3 A-C. The authors show images of a full-length JPH1 tagged with GFP at the N-terminus and FLAG at the C- terminus. In Figure 3Ad and Cd the Flag signal is all over the cytoplasm and the nuclei: since these are normal mouse cells and fibers, it is surprising that the FLAG signal is in the nuclei with an intensity of signal higher than in patient's muscle.

      Can the authors supply images of entire myotubes, possibly captured in different Z planes? How can they distinguish between the cleaved and uncleaved JPH1 signals, especially in mouse myofibers, where calpain is supposed not to be so active as in MHS muscle fibers?

      Answer fully provided to Items 16b and 17 in Essentials list.

      4) If the 44 kDa JPH1 fragment contains a transmembrane domain, it is difficult to understand the dual sarcoplasmic reticulum and nuclear localization. To justify this the authors, in the Discussion session, mention a hypothetical vesicular transport of the 44 kDa JPH1 fragment by vesicles. Traffic of proteins to the nucleus usually occurs through the nuclear pores and does not require vesicles. Even if diffusion from the SR membrane to the nuclear envelope occurs, the protein should remain in the compartment of the membrane envelope. There is no established evidence to support such an unusual movement inside the cells.

      In agreement with the criticism, we have removed the speculation from the Discussion.

      5) In Figure 5, the authors show the effect of Calpain1 on the full-length and 44 kDa JPH1 fragment in muscles from MHS patients. Can the authors repeat the same analysis on recombinant JPH1 tagged with GFP and FLAG?

      We agree that confirmatory evidence of the calpain effect on dual-tagged recombinant JPh1 would be desirable. However, we think an in-depth study is required to follow up on the number of JPh1 fragments generated by calpain (or by different calpain isoforms) and their positions, similar to the detailed study of JPh2 fragmentation Wang et al. in 2021 (5).

      Can the authors provide images from MHN muscle fibers stained with JPH1 and Calpain1.

      We complied with the request.

      6) In Figure 6, the authors show images of MHS derived myotubes transfected with FLAG Calpain1 and compare the distribution of endogenous JPH1 and RYR1 in two cells, one expressing FLAG Calpain1 (cell1) and one not expressing the recombinant protein. They state that cell1 shows a strong signal of JPH1 in the nucleus, while this is not observed in cell2. Nevertheless, it is not clear where the nucleus is located within cell2 since the distribution of JPH1 is homogeneous across the cell. Can the authors show a different cell?

      In agreement, we now show a comparison between cultures with and without transfection in Supplement 1 to Fig. 6.

      7) In Figure 7, panels Bb and Db: nuclei appear to stain positive for JPH1. It is not clear why in panels Ac, Bc they show a RYR1 staining while in panels Cc and Dc they show N-myc staining. The differential localization to nuclei appears rather poor also in these panels.

      We have entirely removed from the manuscript the description of experiments of exposure to extracellular calpain, including Fig. 7 and three associated tables.

      8) The strong nuclear staining in Figure 8, panels C and D is very different from the staining observed in Fig. 2 and Fig. 3. Transfection should not change the ratio between nuclear and cytoplasmic distribution.

      Transfection is an intrusive procedure, which requires production and trafficking of an exogenous protein. This protein, furthermore, is an artificial construct (in this case, a “stand-in”, which adds to the native protein and therefore is akin to overexpression). For the above reasons, we believe that differences in intensity of nuclear staining may obey to multiple causes and should not be especially concerning.

    1. Author Response

      Reviewer #1 (Public Review):

      1) This study performs an interesting analysis of evolutionary variation and integration in forelimb/hand bone shapes in relation to functional and developmental variation along the proximo-distal axis. They found expected patterns of evolutionary shape variation along the proximo-distal axis but less expected patterns of shape integration. This study provides a strong follow-up to previous studies on mammal forelimb variation, adding and testing interesting hypotheses with an impressive dataset. However, this study could better highlight the relevance of this work beyond mammalian forelimbs. The study primarily cites and discusses mammalian limb studies, despite the relevance of the suggested findings beyond mammals and forelimbs. Furthermore, relevant work exists in other tetrapod clades and structures related to later-developing traits and proximo-distal variation. Finally, variations in bone size and shape along the proximo-distal axis could be affecting evolutionary patterns found here and it would be great to make sure they are not influencing the analysis/results.

      We appreciate the reviewer’s comments, and we acknowledge the importance of including examples of non-mammalian lineages in our study. We attended to the recommendation and included more examples of other tetrapod taxa in our text and in our references, providing a more inclusive discussion of limb bone diversity beyond mammals. We also explain below why the results obtained are not inflated by variation of bigger versus smaller sizes of bones.

      Reviewer #2 (Public Review):

      10) Congratulations on producing a very nice study. Your study aims to examine the morphological diversity of different mammalian limb elements, with the ultimate goal seemingly to test expectations based on the different timing of development of the limb bones. There's a lot to like: the sample size is impressive, the methods seem appropriate and sound, the results are interesting, the figures are clear, and the paper is very well written. You find greater diversity and integration in distal limb segments compared to proximal elements, and this may be due to the developmental timing and/or functional specialization of the limb segments. These are interesting results and conclusions that will be of interest to a broad readership. And the large dataset will likely be valuable to future researchers who are interested in mammalian limb morphology and evolution. I have one major concern with how you frame your discussion and conclusions, which I explain below. But I think you can address this issue with some text edits.

      We sincerely thank the reviewer for his constructive recommendations and for his appreciation of our work. We addressed the issue raised as detailed below.

      11) Major concern - is developmental timing the best hypothesis?

      You discuss two potential drivers for the relatively greater diversity in distal elements: 1) later development and 2) greater functional specialization. Your data doesn't allow you to fully test these two hypotheses (e.g. you don't have detailed evo-devo data to infer developmental constraints), and I think you realize this - you use phrases like "consistent with the hypothesis that ...". You seem to compromise and conclude that both factors (development + function) are likely driving greater autopod diversity (e.g. Lines 302-306). Being unable to fully test these hypotheses weakens the impact of your conclusions, making them a bit more speculative, but otherwise, it isn't a critical issue.

      But my concern is that you seem to favor developmental factors over functional factors as the primary drivers of your results, and that seems backwards to me. For instance, early in the Abstract (Line 32) and early in the Discussion (Line 201) you mention that your results are consistent with the developmental timing hypothesis, but it's not until later in the Abstract or Discussion that you mention the role of functional diversity/specialization/selection. The problem with favoring the development hypothesis is that your integration results seem to contradict that hypothesis, at least based on your prediction in the Introduction (Line 126; although you spend some of the Discussion trying to make them compatible). Later in the paper, you acknowledge that functional specialization (rather than developmental factors) might be a better explanation for the integration results (Lines 282-284, 345-347), but, again, this is only after discussions about developmental factors.

      When you first start discussing functional diversity, you say, "high integration in the phalanx and metacarpus, possibly favoured the evolution of functionally specialized autopod structures, contributing to the high variation observed in mammalian hand bones." (Line 282). This implies that integration led to functional diversity in the autopod. But I'd flip that: I think the functional specialization of the hand led to greater integration. Integration does not result solely from genetic/developmental factors. It can also result from traits evolving together because they are linked to the same function. From Zelditch & Goswami (2021, Evol. & Dev.): "Within individuals, integration is customarily ascribed to developmental and/or functional interdependencies among traits (Bissell & Diggle, 2010; Cheverud, 1982; Wagner, 1996) and modularity is thus due to their developmental and/or functional independence."

      In sum, I think your results capture evidence of greater functional specialization in hands relative to other segments. You're seeing greater 1) disparity and 2) integration in hands, and both of those are expected outcomes of greater functional specialization. In contrast, I think it's harder to fit your results to the developmental timing hypothesis. Thus, I recommend that throughout the paper (Abstract, Intro, Discussion) you flip your discussion of the two hypotheses and start with a discussion on how functional specialization is likely driving your results, and then you can also note that some results are consistent with the development hypothesis. You could maintain most of your current text, but I'd simply rearrange it, and maybe add more discussion on functional diversity to the Intro.

      Or, if you disagree and think that there's more support for the development hypothesis, then you need to make a better case for it in the paper. Right now, it feels like you're trying to force a conclusion about development without much evidence to back it up.

      We thank the reviewer for his thoughtful and thorough comment. We agree that the results provided, particularly those of integration, support the hypothesis that functional specialization contributes to the uneven diversity of limb bones. We addressed the concerns by substantially changing our discussion, particularly moderating (and removing) sections on the developmental constraints and adding new arguments for other possible drivers for the diversity of limb bones, such as function. However, the goal of the paper was to test whether the data corroborate - or not - the predictions derived from the developmental hypothesis, and they largely do. Therefore, we decided to keep the developmental hypothesis presented first in the introduction and in the discussion section, as we believe this sequence provides more coherence considering the hypothesis tested (we believe that detailing the role of functional specialization particularly in the introduction would mislead the reader to think that we directly tested for these parameters). Following the discussion of the integration results, we then go on to discuss the possible role of functional specialization on the results obtained (lines 262-285, see also lines 216-234). Yet, these are not tested in this paper and remain to be tested in a future analysis focusing specifically on the role of ecology and function in driving variation in the mammalian limb.

      12) Limitations of the dataset

      Using linear measurements is fine, but they mainly just capture simple aspects of the elements (lengths and widths). You should acknowledge in your paper the limitations of that type of data. For example, the deltoid tuberosity of the humerus can vary considerably in size and shape among mammals, but you don’t measure that structure. The autopod elements don’t have a comparable process, meaning that if you were to measure the deltoid tuberosity then you’d likely see a relative increase in humerus disparity (although my guess is that it’d still be well below that of the autopod). And you omit the ulna from your study, and its olecranon process varies considerably among taxa and its length is a very strong correlate of locomotor mode. In other words, your finding of the greatest disparity in the hand might be due in part to your choice of measurements and the omission of measurements of specific processes/elements. I recommend that you add to your paper a brief discussion of the limitations of using linear measurements and how you might expect the results to change if you were to include more detailed measurements and/or more elements.

      We followed the recommendation and included a discussion about the dataset limitations, acknowledging for the possible impact of the measurements and the bones chosen in the results obtained (Lines 235-260).

      Reviewer #3 (Public Review):

      32) This paper uses a large (638 species representing 598 genera in 138 families) extant sample of osteologically adult mammals to address the question of proximodistal patterns of cross-taxonomic diversity in forelimb bony elements. The paper concludes, based on a solid phylogenetically controlled multivariate analysis of liner measurements, that proximal forelimb elements are less morphologically diverse and evolutionarily flexible than distal forelimb elements, which the paper concludes is consistent with a developmental constraint axis tied to limb bud growth and development. This paper is of interest to researchers working on macroevolutionary patterns and sources of morphological diversity.

      Methodological review Strengths:

      The taxonomic dataset is very comprehensive for this sort of study and the authors have given consideration to how to identify bony elements present in all mammalian taxa (no small task with this level of taxonomic breadth). Multivariate approaches as used in this study are the gold standard for addressing questions of morphological variations.

      The authors give consideration to two significant confounders of analyses operating at this scale: phylogeny and body size. The methods they use to address these are appropriate, although as I note below body size itself may merit more consideration.

      We sincerely thank the reviewer for his appreciation of our study. We addressed the main concerns pointed out below.

      Weaknesses:

      33) The authors assume a lot of knowledge on the part of the reader regarding their methods. Given that one of their key metrics (stationary variance) is largely a property as I understand it of OU models, more explanation on the authors' biological interpretation of stationary variance would help assess the strength of their conclusions, especially as OU models are not as straightforward as they first appear in their biological interpretation (Cooper et al., 2016).

      We acknowledge that this may not be straightforward and now include a more extensive explanation of the approach and the metrics used. We detailed the explanation about the stationary variances in the methods, contextualizing the biological meaning (lines 456-469).

      34) It is unclear what the authors mean when they say they "simulated the trait evolution under OU processes on 100 datasets". Are the 100 datasets 100 different tree topologies (as seems to be the case later "we replicated the body mass linear regressions with 100 trees from Upham et al (2019)." If that is so, what is the rationale for choosing 100 topologies and what criteria were used to select the 100 topologies?

      We understand the explanation may have been confusing. Globally, we used a parametric bootstrap approach to assess the uncertainty around point estimates for morphological diversity and integration. That is, we first simulated 100 datasets on the maximum clade credibility tree (MCC tree, that summarizes 10,000 trees from Upham et al. 2019) – using the best fit model on our original data (i.e., an OU process) with parameters estimates from this model fit. The model (an OU process) was then fit to these 100 simulated traits, and the distribution of parameters estimates obtained was used to assess the variability around the point estimate (for the determinant, the trace, and the measure of integration) obtained on empirical data. We did not used the simulated dataset to estimate the significance of the stationary variances. We fitted the empirical datasets with 100 trees randomly sampled from the credible set of 10,00 trees of Upham et al (2019) – instead of using the MCC – to further assess the variability due to the tree topology and branching times uncertainties. We included this expanded explanation in the methods in lines 421-428 and 471.

      35) The way the authors approach body mass and allometry, while mathematically correct, ignores the potential contribution of body mass to the questions the authors are interested in. Jenkins (1974) for example argued that small mammals would converge on similar body posture and functional morphology because, at small sizes, all mammals are scansorial if they are not volant. Similarly, Biewener (1989) argued that many traits we view as cursorial adaptations are actually necessary for stability at large body sizes. Thus size may actually be important in determining patterns of variation in limb bone morphology.

      We agree with the observation. We believe that categorizing the groups according to size would provide a meaningful overview on the effect of size on the diversity and evolution of limb bones. Although insightful and worthy of investigation, we were particularly interested in understanding whether developmental timing corresponds to bone diversification more broadly across Mammalia and thus considered only the size residual values. This issue will be addressed in our future works. We discussed in the lines 329-341 the potential contribution of body size to limb segment diversification and the importance of considering this aspect in future studies.

      36) Review of interpretation.

      The authors conclude that their result, in showing a proximo-distal gradient of increasing disparity and stationary variance in forelimb bone morphology, supports the idea that proximo-distal patterning of limb bone development constrains the range of morphological diversity of the proximal limb elements. However, this correlation ignores two important considerations. The first is that the stylopod connects to the pectoral girdle and the axial skeleton, and so is feasibly more constrained functionally, not developmentally in its morphological evolution. The second, related, issue arises from the authors' study itself, which shows that the lowest morphological integration is found in the stylopod and zeugopod, whereas the autopod elements are highly integrated. This suggests a greater tendency towards modularity in the stylopod and zeugopod, which is itself a measure of evolutionary lability (Klingenberg, 2008). And indeed the mammalian stylopod is developmentally comprised of multiple elements (the epiphyses and diaphysis) that are responding to very different developmental and biomechanical signals. Thus, for example, the functional signal in stylopod (Gould, 2016) and zeugopod (MacLeod and Rose, 1993) articular surface specifically is very high. What is missing to fully resolve the question posed by the authors is developmental data indicating whether or not the degree of morphological disparity in the hard tissues of the forelimb change over the course of ontogeny throughout the mammalian tree, and whether changing functional constraints over ontogeny (as is the case in marsupials) affect these patterns.

      We thank the reviewer for sharing such an interesting reinterpretation of the results. Combined to the recommendations from the other two reviewers, we substantially changed our discussion, specially modifying the interpretation of results concerning trait integration. We discussed the possible role of the functional variation at the articulations on element integration in lines 263-285.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      [Reviewer's comments]

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      Summary In this article Roure et al address the role of BMP during formation of the ascidian palps, using Ciona intestinalis. Overexpression of BMP (specifically ADMP) from early stages of development results in complete suppression of palp formation, and early loss of the palp forming region (also called anterior neural border ANB). Using p-Smad1/5/8 antibody staining they show a marker of the ANB (FoxC) is expressed in a region negative for BMP signals. Inhibition of BMP signals is not sufficient to produce ectopic ANB. However, treatment with FGF protein from very early stages (8-cell stage) plus inhibition of BMP signaling (from 8-cell stage) increased FoxC expression. Looking at later stages of development the authors show that in a U-shaped expression domain of Foxg, Smad1/5/8 is active in the ventral-most part, which is expected to form the ventral-most palp. BMP2 treatment from gastrula stages results in loss of the ventral most palp expression of Isl and repression of ventral Foxg expression. Inhibition of BMP signaling from gastrula or neurula stages results in failure of a U-shaped pattern of Isl expression to resolve into the three palp expression domains, and by late tailbud stages, Sp6/7/8/9 (proposed as a repressor of Foxg in the inter-palp territory) expression is reduced and the numbers of specific cell-types making up the palps is increased. These cells are present in a single large palp of dorsal identity. Thus, inhibition of BMP from early gastrula stages results in a single palp made of more cells than the three palps of control larvae, presumably due to recruitment of cells usually present between the palps. The authors then show a similar phenotype in another ascidian species Phallusia mammillata. Using their previous RNA-Seq data of embryos treated with BMP4, they looked for potential novel palp markers and identify a further eight novel markers of the palps. Looking further into this data and at a list of 68 genes expressed in palps (but not exclusively) they find that in whole embryo RNA-Seq data 70% were regulated by BMP signaling, mostly repressed, but some activated by BMP. 30 of these genes were regulated by Notch. Apart from the confusion I explained in my comments below, the data seems to be carefully presented and interpreted. Overall, this manuscript presents a more detailed analysis of the role of BMP signaling during ascidian palp formation, but it remains to be precisely understood.

      [Response]

      We thank the reviewer for the evaluation of our work.

      Major comments

      1) I am a little confused about the timing of the protein treatments. In Figure 2, the authors show nicely that at the neurula stages, P-Smad1/5/8 staining abuts the FoxC ANB territory. Then at late neurula P-Smad1/5/8 is detected in the ventral-most part of the Foxg U-shaped part of the palp forming region, presumably the ventral most palp. However, the protein treatments with BMP (and FGF) are carried out from the 8-cell stage, which seems a bit drastic and embryos look difficult to orientate (e.g. Fig. 3D).

      [Response]

      We first would like to clarify the issue raised from Figure 3. Actually, Figure 3D was the only case where the embryo was shown from the side (the description as a lateral view was inadvertently omitted in the legend). We have now modified Figure 3 by properly showing only dorsal (neural plate) views and lateral views in insets when necessary. In addition, we have added schemes of embryos depicting the main tissues we have examined (palps, CNS and epidermis) and their localization depending on the treatments.

      Regarding the timing of treatments, we performed them at the 8-cell stage to make them manageable to perform. At the latest, bFGF treatment should be performed at the 16-cell stage (before neural induction at the 32-cell stage), while BMP2 treatment should be performed at the 64-cell stage (before the onset of Foxc/partial effect at early gastrula (St. 10)). In principle, sequential treatment (first bFGF, then BMP2) could thus be performed. Since earlier treatments, produce the same effects, we reasoned that combined treatments from the 8-cell stage should be equivalent and would avoid fastidious repeated manipulation of the embryos that could negatively impact their development. We are convinced that the way we performed the treatment has no impact on our results (except for the treatment by bFGF alone on Foxc as already discussed in the text) and conclusions.

      While BMP-treatment from early stages inhibits all palp gene expression and any sign of palp formation (Figure 1), treatment with BMP from the early gastrula stage, when Smad1/5/8 is detected only in mesendoderm cells and before it is detected in any ectoderm, is sufficient only to block ventral palp formation and cause a partial down-regulation of FoxC expression in the ANB. Thus, there seems to be a discrepancy between the roles proposed for BMP during ANB and palp formation as judged by P-Smad1/5/8 staining and the temporal evidence from BMP- and BMP-inhibitor treatment. Do the authors have some explanation for why they need to treat at least one hour before the BMP-mediated patterning mechanism (as indicated from the P-Smad1/5/8 staining) is taking place? For example, could the authors check how long it takes DMH1 to inhibit P-Smad1/5/8 positive staining? Or BMP to strongly induce P-Smad1/5/8? This seems to be a simple experiment and might go some way to explaining why they need to treat embryos much earlier than I would have thought necessary.

      [Response]

      We understand the reviewer's concerns, but we do not think that there are major discrepancies in the timing of events. The main rationale is to consider the onset of expression for the main genes of interest. We have examined their dynamics of expression in details, but we do not show them since our conclusions are in agreement with a previous report (Figure 1 from Liu and Satou, 2019). We have summarized the data in the modified Figure 2. Foxc can be detected from early gastrula stages (St. 10) when the palp precursors consist of a single row of 4 cells. This is the exact developmental time when the treatment with BMP2 has partial effects (Figure 4). Once the cells divide to make 2 rows of 4 cells robustly expressing Foxc (St. 12), BMP2 treatment has no effect on Foxc. Similarly, DMH1 treatment has no effect from late neurula stage (St. 16) (Figure 4) that corresponds to the onset of Sp6/7/8/9 expression. We thus consider that modulating BMP pathway has no effect once key regulatory genes have acquired a robust expression in their normal domains. We have enhanced these points in the main text (lines 205-208, lines 228-229).

      We think the above discussion should address the points raised by the reviewer. In the contrary, we are willing to perform the suggested experiments.

      2) It does not make sense to me that BMP treatment from gastrula stage blocks only ventral palp formation (Figure 4) and ventral Foxg expression (Fig. 5G). In particular, it is the ventral palp region which is positive for P-Smad1/5/8 (Fig.2I,J) so I would not expect the ventral palp to be the most sensitive to BMP-treatment.

      [Response]

      We were, like the reviewer, surprised by the phenotype. The time window to obtain this phenotype is quite narrow, and most likely deals with the full acquisition of the palp fate ('consolidation' of Foxc expression, onset of Foxg). This is actually a phenotype that we have not characterized in details. And such a characterization may help clarify the role of BMP: does BMP regulate papilla/inter-papilla fates only for the ventral palp or for all three palps? Does BMP 'only' regulate the dorso-ventral identities of the palps?

      To better understand the role of BMP in palp formation, we propose to describe this specific phenotype: loss of ventral palp induced by BMP2 treatment at St. 10. We propose to test the following hypotheses. What is the fate of the ventral palp? Conversion into epidermis (more ventral fate)? Conversion into inter-papillar fate? What is the identity of the 2 remaining presumptive palps? Do they still have a dorsal identity? Are they converted into ventral palps? This is part of the proposed experiments for a revision.

      Minor comments line 185 I see what the authors are trying to say but I don't agree that BMP limits the domain of FoxC expression as inhibition of BMP has no effect on FoxC. Rather BMP has to be kept out of the ANB in order to allow ANB formation.

      [Response]

      We have modified the sentence (lines 195-196).

      The relationship between Foxg and Sp6/7/8/9 expression is not really clear and it would be better to do this with double ISH if the authors want to show mutually exclusive expression domains, or at least provide a summary figure.

      [Response]

      We have modified Figure 5 by adding schematic representations of our understanding of the expression patterns in relation to the different precursors of the palp lineage.

      In case the reviewer does not find this clarification sufficient, we propose to perform the double fluorescent in situ hybridizations as part of the revision plan.

      Line 218, I do not see the data showing that Isl is expressed at a U-shape at st. 23, it seems to be expressed in three dots, unless embryos are treated with DMH1.

      [Response]

      We apologize for the misunderstanding since the sentence was not clear. We referred to the U-shaped Isl expression under BMP inhibition. Indeed, Isl starts to be expressed in 3 separate domains in the palp forming region, and not following a U-shape as its upstream regulator Foxg (Liu and Satou, 2019). We amended the sentence (lines 234-235).

      Figure 6B, G. It could be nice to show a close up of the palps to see elongated cells.

      [Response]

      The close up pictures have now been added in the modified Figure 6.

      Figure 6K. It is better to use a statistical test to support the authors conclusions.

      [Response]

      As suggested, we have performed a statistical evaluation (Mann-Whitney U test) of the cell counts. The p-values are presented in Figure 6Q. The slight increase of Celf3/4/5/6 is not statistically significant, but it does not impact our conclusion that the number of papilla cells increases following BMP inhibition.

      It could be nice to provide a timeline for Smad1/5/8 signaling and the role for BMP signals that are proposed in this manuscript as a summary diagram.

      [Response]

      Following the suggestion, we have added summary diagrams in Figure 2 for BMP signaling in relation to lineages and gene expression.

      lines 66-74 is lacking references.

      [Response]

      This is now corrected (lines 70-80).

      Reviewer #1 (Significance (Required)):

      Significance While it is still not clear how BMP signals are established (which ligands for example) and their precise role in palp formation, this manuscript adds more information to our current understanding of the role of BMP signaling during palp formation. In particular it shows that BMP signals need to be kept out of the ANB for its formation and that it is required to resolve the later forming palp territory into three discrete palp regions. However, there is some way to go before this is fully understood. This article will certainly be of interest to ascidian developmental biologists trying to understand the formation and patterning of the larval PNS. It may also be of some interest to evolutionary biologists trying to understand the relationship between the telencephalon territory of vertebrates and the palp forming territory of ascidians as some links have been proposed between these two developmental territories (e.g. line 78).

      [Reviewer's comments]

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      Summary. The manuscript presents a detailed examination of how dynamic changes in BMP signaling during the development of the ascidian larval palps. Early in development BMP inhibition is responsible for the formation of a large field within the neuroectoderm that includes, among other fates, the presumptive palps. As development progresses, the territories of BMP activity/inhibition appear to be spatially refined within the palp-forming territory to specify palp versus interpalp fate. The experiments are presented with sufficient replication and statistical rigor.

      [Response]

      We thank the reviewer for the evaluation of our work.

      Major Comments.

      1. The researchers should look at otx expression in pFOG>Admp overexpressing embryos. It is difficult to assess from Figure 1, but it appears possible the the entire anterior sensory vesicle (not just the palps) are absent in the pFOG>Admp embryos (can the authors say briefly whether other ectodermal structures such as the atrial primordia or the oral siphon are still present?). Thus, is it possible that the entire a-lineage is disrupted? This would be an important distinction to make: are the defects attributed to experimental BMP activation specific to the palps, or are they more widespread in the anterior neuroectoderm? If the entire a-lineage is mis-fated, might this change the interpretation of the role of BMP inhibition? For example, might the formation of the palps depend on the proper development of the neighboring anterior neural plate? To address this concern, the authors should use a different driver to restrict Admp overexpression only to the palp forming region.

      [Response]

      In Figure 1, we show that Celf3/4/5/6, a general neural marker was still expressed in pFog>Admp embryos. We explain, in the Figure 1 legend, that this most likely corresponds to the CNS. It does not demonstrate that the anterior sensory vesicle (a-line induced CNS lineage) is still present. Unfortunately, Otx cannot be used as a suitable marker since it is also expressed in the posterior sensory vesicle (A-line lineage) (Hudson et al., 2003). Other a-line markers do exist. However, determining their expression at tailbud stages may not be conclusive since it is most likely that the patterning of the sensory vesicle (hence the expression of these markers) is modified after BMP activation. We have presented in former Figure 3 and Figure S1, strong evidence that the a-line neural lineage is intact at the neural plate stage. To better communicate these data, we have combined then in a modified Figure 3 that includes all markers examined and interpretative embryonic schemes. We show that, following BMP2 treatment, Otx and Celf3/4/5/6 were downregulated in the palp lineage but otherwise normal. Consequently, the a-line CNS lineage is most likely not affected by BMP pathway activation. This does not mean that its later derivatives form normally, but this is an issue that we have not addressed. A previous report indicates that BMP activation leads to Six1/2 repression and, possibly, the absence of oral siphon primordium (based on the images, no description in this paper) (Figure 1 from Abitua et al., 2015).

      We think that we have addressed the concern of the reviewer, but would like to comment on the suggested experiment. It is very difficult to find a driver that would allow BMP activation only in the palp lineage (by overexpressing a constitutive active BMP receptor for example). a-line neural linage and palp lineage are intimately linked and separate at gastrula stages (St. 10). The regulatory sequences of Foxc, the first palp specific gene that we know, would thus be interesting. But it is most likely too late according to our whole embryo protein treatments (Figure 4). In agreement with this assumption, overexpressing Bmp2/4 (another BMP ligand) using the regulatory sequences of Dmrt (a master regulator of the palp+a-line CNS lineage expressed just before Foxc) does not apparently abolish palp formation (Extended Data Figure 5 from Abitua et al., 2015).

      1. The authors hypothesize that papilla versus inter-papilla fate is controlled by differential BMP signaling. Is it possible to show differential P-Smad staining in papilla versus inter-papilla territories, as in Figure 2 for earlier gastrula-stage embryos? This data would make the authors hypothesis much more compelling. It appears that the authors have the necessary reagents.

      [Response]

      The actual lineage and fate segregation of papilla and inter-papilla lineage has not been determined as far as we know. Our current understanding comes from indirect evidence from gene expression and gene function, in particular from the study of Foxg and Sp6/7/8/9 by Liu and Satou (2009). Papillae originate from the 3 Foxg/Isl positive spots that are visible at very early tailbud stages. At earlier stages, Isl is not expressed and Foxg is expressed with a U-shape (Figure 5). Within this U, it is most likely that the segregation of papilla and inter-papilla fates takes place when Sp6/7/8/9 starts being expressed at late neurula stages. It is thought that Sp6/7/8/9+/Foxg+ cells will become inter-papilla cells while Sp6/7/8/9-/Foxg+ will become papilla. Our data indicate that BMP signaling is active in the future ventral papilla. We have mapped these data on schematics in the modified Figure 2.

      Minor Comments.

      1. There is no mention of panels Figure 1 U and V in the text. In the figure legend they are misidentified as panels S and T.

      [Response]

      This has been corrected.

      Very small issue with English usage that occurs throughout the manuscript. The authors should check the use of "palps" versus "palp", particularly when expressions such as the following are used: "palps formation", "palps network", "palps lineage", "palps differentiation", "palps molecular markers", "palps neuronal markers", "palps phenotypes", etc . For example, the sentence, "Here, we show that BMP signaling regulates two phases of palps formation in Ciona intestinalis", should read instead "Here, we show that BMP signaling regulates two phases of palp formation in Ciona intestinalis".

      [Response]

      Thank you, we have corrected these mistakes.

      It would be worth mentioning possible relationships between the tunicate palps and the adhesive glands for larval fish and amphibians. Are there common mechanisms? All of these are anterior ectoderm derivatives.

      [Response]

      Thank you for the suggestion. We have added a section on that topic in the discussion (line 358).

      Please consider providing references in the Introduction for the sentences which end on the following lines of text: 36 ( . . . is the sister group of vertebrates), 46 ( . . . and sensory properties), 48 ( . . . the secretion of adhesive materials), 57 ( . . . on the nervous system in chordates), 68 ( . . . also known as Ap2-like), 74 ( . . . anterior neural territories)

      [Response]

      References have now been added.

      To provide extra emphasis and to help the figures to stand alone with their respective legends, can you mention in the legend for Fig. 2 that D and E are controls? Also, can a brief legend be provided for S2 to give overall indication of staging, scale, orientation, etc.?

      [Response]

      Actually, the original Fig 2D and 2E correspond to treated embryos as explained in the legend. For clarity, these embryos have been separated from control embryos in the modified Figure 2.

      Figure S2 has modified and a legend has been added.

      Reviewer #2 (Significance (Required)):

      Significance.

      This study presents an advance in our understanding of the fine-structure regulation of BMP signaling in sculpting neuroectoderm derivatives. While this study is potentially of broad interest, the authors fail to fully discuss the comparative aspects of this study in the context of conserved chordate developmental mechanisms. This could be remedied without too much difficulty in the Discussion section.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      Summary: This paper explores the role of BMP signaling for palp formation in ascidians using gain and loss of function approaches. The paper shows that while BMP at early (gastrula) stages prevents formation of the Foxc-positive palp ectoderm in Ciona, at later stages it appears to be essential for separation of the palps (possibly by promoting differentiation of interpapillary cells). The paper further shows that BMP plays similar roles in a different ascidian, Phallusia mammillata. Using previously published RNA-Seq results for the latter species after BMP up-regulation, the authors were able to identify additional BMP-responsive genes expressed in the palp region of ascidians.

      [Response]

      We thank the reviewer for the evaluation of our work.

      Major comments: However, while the effect of BMP overexpression at early stages has been confirmed by two independent strategies (electroporation of the BMP agonist ADMP and BMP2 treatment), the effects of late BMP activation as well as the effects of BMP inhibition at both early and late stages have been studied exclusively by pharmacological treatments with a single BMP signaling agonist (BMP2) and antagonist (DMH1). To substantiate these findings and rule out unspecific side effects, it would have been desirable to verify them with alternative strategies.

      [Response]

      The reviewer may have missed some of our data. We have shown that BMP inhibition through overexpression of the secreted antagonist Noggin via electroporation using the early ectodermal driver pFog gives the same phenotypes as DMH1 treatment. The effects on Foxc * were presented in Figure S1, and are now presented in the modified Figure 3 (line 170). We also showed that the morphological Cyrano phenotype was observed with Noggin overexpression (modified Figure 6H). We now present a novel Figure S1 with expression of Isl and Celf3/4/5/6* following Noggin overexpression, and stress the use of this independent way of inhibiting BMP (lines 260-264). Given that early or late BMP inhibition lead to the same phenotype, we do not consider that overexpressing Noggin at gastrula stages is necessary.

      Regarding BMP activation from gastrula stages, we have only used BMP2 treatment. It may be possible to overexpress Admp using promoters active in the palp lineage such as the ones of Dmrt, Foxc or Foxg. However, it may be difficult to phenocopy the phenotype obtained using BMP2 protein (loss of ventral palp), for two reasons. First, the precise timing to reach high BMP activation is not tightly controlled using such a method. Hence, all drivers should be tested. Second, the different promoters are active progressively later in development and in more and more restricted regions. Consequently, we consider that this requires a huge effort to validate a method (BMP protein treatment) that we already validated for the early effects and that has been used in several publications.

      Therefore, while this study provides some new insights into the role of BMP in the specification of the palp forming region and subsequent palp development in ascidians, the evidence provided is relatively weak. Moreover, the scope of the study is quite limited. While identifying some BMP-responsive genes expressed in the palp region and describing the effects of BMP dysregulation on palp morphology, the study does not provide further insights into the underlying mechanisms how BMP patterns this region or affects subsequent palp formation.

      [Response]

      We are surprised by the appreciation of the reviewer describing our work as 'some new insights'. To our knowledge, this is the first report addressing the role of BMP signaling in palp formation at the molecular level. The only previous report by Darras and Nishida (2001) describes solely the morphology of the palps following overexpression of Bmp2/4 and Chordin overexpression by mRNA injection. We have brought significant novel findings 1) two important steps in palp formation with a precise description of the cellular and molecular actors, and a proposed function for BMP at each step, 2) evidence for conservation of this process in different ascidian species and 3) significant enrichment in the molecular description of this process. Moreover, the reviewer does not ask for specific items, we thus feel in the impossibility to offer satisfaction.

      Minor comments:

      • 63: ...as the anterior...

      [Response]

      Corrected.

      • 68, 71, 74: references missing

      [Response]

      References have now been added.

      • 73: better: anterior neural territories and placodes

      [Response]

      Corrected.

      • 76: palp territories also share molecular signature with anterior (eg. olfactory) placodes, not only telencephalon

      [Response]

      Corrected.

      • 106: awkward sentence

      [Response]

      Corrected.

      • 114: at what stage was ADMP electroporated?

      [Response]

      Electroporation of plasmid DNA is performed in the fertilized egg. Transcription of the transgene is controlled by the driver. In this case, with pFog, it occurs from the 16-cell stage. This precision has been added in line 121.

      • 134: to facilitate comparison between stages it would be useful to label cells in Fig. 2(eg. which are a-line and b-line cells? Where is the border between them?)

      [Response]

      As suggested by the reviewer, we have modified Figure 2 with embryo outlines and schemes to better appreciate where BMP signaling is active.

      • 152: since Foxc and Foxg overlap with pSMAD1/5/8 at neurula but not gastrula stages, do you know whether this is due to a dorsal expansion of BMP activity or a ventral expansion of Foxc/Foxg expression? Again, labeling of the nuclei would help

      [Response]

      The change corresponds to a dorsal expansion of P-Smad1/5/8. Our conclusion comes from combining nuclear staining (not shown for simplicity) and available fate maps. The results are presented in schematic diagrams of embryos in frontal views in the modified Figure 2.

      • 174: the description is not clear here; what proportion of embryos did show reduction versus expansion of expression?. Why is the reduction shown in Fig.3 D asymmetrical?

      [Response]

      The proportions are now indicated in line 184.

      We apologize for the impression led by Fig 3D. Actually, it was the only case where the embryo was shown from the side (the description as a lateral view was inadvertently omitted in the legend). It did not show an asymmetric repression but an ectopic expression. We have now modified Figure 3 by properly showing only dorsal (neural plate) views and lateral views in insets when necessary. In addition, we have added schemes of embryos depicting the main tissues we have examined (palps, CNS and epidermis) and their localization depending on the treatments. We hope that the results are now clearly presented.

      • 198: ... of endogenous...

      [Response]

      Corrected (line 213).

      • 208: I suggest to highlight the regions of changes in Fig. with asterisks/arrows etc.

      [Response]

      We have added schematic embryos to highlight expression changes in the modified Figure 5.

      • 218: contrary to what is stated here, there is no depiction of u-shaped Isl1 expression in control embryos of Fig. 4

      [Response]

      As also pointed by reviewer 1, we apologize for the misunderstanding since the sentence was not clear. We referred to the U-shaped Isl expression under BMP inhibition. Indeed, Isl starts to be expressed in 3 separate domains in the palp forming region, and not following a U-shape as its upstream regulator Foxg (Liu and Satou, 2019). We amended the sentence (lines 234-235).

      • 220: the cell shapes referred to here cannot be seen in Fig. 4 (too small)

      [Response]

      We have modified Figure 6 to include close up of the palps.

      • 271: the description here is confusing: first you talk about 53 genes and the mention palp expression of 12/26. Where does number 26 come from? And why was in situ done then for 27 additional genes? Also, while the comparison with previously published RNA-Seq data was valuable in uncovering additional BMP-sensitive palp markers, it does not provide any substantial new insights into how BMP patterns this territory.

      [Response]

      We have amended the sentence to make it clearer (lines 291-295).

      • line 624: where

      [Response]

      Thank you. Corrected line 731.

      • Fig. 2: to facilitate comparison between stages it would be useful to label cells (eg. which are a-line and b-line cells? Where is the border between them?)

      [Response]

      Already responded above.

      -Fig. 3: Why is the expression in D asymmetrical? In the main text you write that expression is expanded in some embryos but reduced in others - Please show examples also of the expanded phenotype and give numbers

      [Response]

      Already responded above.

      • Fig. 6: small panels in I, L, N need to be explained (single channels), white signal needs to be explained (overlap ?)

      [Response]

      We used white for better display of separate single channels. Given the confusion and the good quality of the 2 color fluorescent in situ images, we removed these panels in the modified Figure 6.

      White in K and L correspond to overlap (explained in the legend).

      • Fig. S2: legend is missing

      [Response]

      This has been amended.

      Reviewer #3 (Significance (Required)):

      Since the study does not provide substantial new insights into the mechanisms how BMP patterns the palp forming region or affects subsequent palp formation in ascidians, it will be of interest mostly for a specialized audience in the field of developmental biology.

      [Response]

      We do not agree with the reviewer as discussed above. The description of the role of BMP signaling in the specification of the ANB and its subsequent patterning in ascidians has interesting evolutionary implications and should be of interest for a broader audience.

    1. Author Response:

      Reviewer #1 (Public Review):

      This paper reports an analysis of the inhibition of the serotonin transporter, SERT, by a novel compound, ECSI#6. The authors perform a comprehensive analysis of SERT transport inhibition for the new agent and compare its properties to those of other well-characterized agents: cocaine and noribogaine, with the data pointing to an unusual noncompetitive mechanism of inhibition, a model also supported by electrophysiological recordings of transport currents. Based on the results of these experiments the authors conclude that ESCI#6 binds essentially exclusively to the inward-facing state of the transporter. The authors further present experiments suggesting that ESCI#6 can stabilize the folded form of an ER-arrested SERT mutant and recover its trafficking to the plasma membrane, with some in-vivo drosophila experiments perhaps also supporting this conclusion. Finally, kinetic simulations using a transport model with rate constants from previous experiments support the basic conclusions of the first sections of the paper.

      Strengths:<br /> The transport experiments and simulations here are thorough, carefully performed, and reasonably interpreted. The authors' arguments for noncompetitive inhibition seem well-thought-out and reasonable, as is the conclusion that ESCI#6 binds to the inward-facing state of the transporter. The simulations are also thorough and support the conclusions. In the discussion, the comparison of enzyme noncompetitive inhibition to the process studied here was thoughtful and interesting. Also, the care and analysis of the uptake data are a strength of the paper, with well-presented evidence of reproducibility and statistics. The electrophysiology data is more limited but does communicate the essential conclusion.

      Weaknesses:<br /> The most important concern about the work is the interpretation of the in-vivo drosophila data. Though the SERT fluorescence with WT protein is strong, I cannot see any fluorescence in either drug-treated image from the PG mutant. In this context, shouldn't there be additional intracellular staining for ER-resident SERT? If the cell bodies of these cells are elsewhere this should be clearly pointed out.

      We have modified Fig. 6 to include, in all instances, images of the posterior brain, where the neurons (FB6K) reside, from which the serotonergic projections originate. These images visualize expression of membrane-anchored GFP (mCD8GFP; in panel B), immunoreactivity of serotonin (panel B’), wild type SERT (panels C’,D’,E’) and mutant SERT-PG601,602AA (panels F’,G’,H’) in the soma. The description of these panels has been added to the pertinent sentences starting on p. 20, line 6 from bottom to the end of end of the first paragraph p. 21, which read:

      “These projections (Fig. 6A-A’’) and the FB6K-type neurons, from which they originate in the posterior brain (Fig. 6B-B’’) can be visualized by expressing membrane-anchored GFP (i.e. GFP fused to the C-terminus of murine CD8; [36]) under the control of TRH-T2A-Gal4. Similarly, when placed under the control of TRH-T2A-Gal4, YFP-tagged wild-type human SERT was expressed in the FB6K-type neurons (Fig. 6C’) and delivered to the fan-shaped body (Fig. 6C). In contrast, in flies harboring human SERT-PG601,602AA, the transporter was visualized in the soma of FB6K-type neurons (Fig. 6F’), but the fan-shaped body was devoid of any specific fluorescence (Fig. 6F). However, if three-day old male flies expressing human SERT- PG601,602AA were fed with food pellets containing 100 μM ECSI#6 or 100 μM noribogaine for 48 h, fluorescence accumulated to a level, which allowed for delineating the fan-shaped body (Fig. 6G and H, respectively). This show that ECSI#6 and noribogaine exerted a pharmacochaperoning action in vivo, which partially restored the delivery of the mutant transporter to the presynaptic territory. As expected, in flies harboring wild-type human SERT, feeding of ECSI#6 and noribogaine did not have any appreciable effect on the level of fluorescence in the fan-shaped body (Fig. 6D and E, respectively). “

      Similarly, the single Western blot demonstrating enhanced glycosylation in the presence of Noribogaine or ECSI#6 could be strengthened. I can see the increased band at a high molecular weight that the authors attribute to the fully glycosylated form, but this smear, and the band below, look quite different from those in the blot shown in the El-Kasaby et al reference, raising concerns that the band could be aggregated or dimerized protein rather than a glycosylated form. This concern could easily be addressed by control experiments with appropriate glycosidases, as shown in the reference.

      We understand that the appearance of the mature glycosylated species is being criticized, at least in part, because it differs from sharper bands, which can be found in our previously published papers. We stress that the resolution very much depends on the electrophoretic conditions. We addressed the reviewers’ criticism by carrying out the recommended deglycosylation experiments: a representative experiment is shown in (the new) panel F of Fig. 5, with lysates prepared from HEK293 cells expressing wild type SERT, from untransfected HEK293 cells and from HEK293 cells, which had been preincubated with 30 μM cocaine, 100 μM ECSI#6 and 30 μM noribogaine. The experiment confirms the band assignment with the upper band(s) M representing the mature glycostylated species (which are resistant to deglycosylation by endoglycosidase H) and the lower band C corresponding to the core- gylcoylated species (which are susceptible to cleavage that (as expected) the mature band show a representative degylcosylation by endoglycosidase H). We also think that the immunoblot in panel F ought to satisfy the aesthetic criticism: the bands are sharper/less smeared.

      The description of panel F can be found on p. 18, starting in line 7 from bottom to end of page, and reads: “We confirmed the band assignment by enzymatic deglycosylation (Fig. 5F): the upper bands (labeled M), which appeared in cells incubated in the presence of ECSI#6 and of norbogaine, were resistant to deglycosylation by endoglycosidase H (which cannot cleave mature glycans). In contrast, the core-glycosylated species (labeled C), was susceptible to cleavage by endoglycosidase H resulting in the appearance of the deglycosylated band D.”

      The overall interest in the work is reduced given the quite low affinity of ECSI#6 for the transporter.

      We agree that it would be preferable to have a compound, which works in the submicromolar/nanomolar range. However, it is worth pointing out that the EC50 is low enough for allowing in vivo rescue of the folding-deficient SERT-PG: feeding flies restores its trafficking to the cell surface and to the presynaptic specialization. Obviously, there is room for improvement, but ECSI#6 provides a starting point.

      Reviewer #3 (Public Review):

      This is interesting research that uncovers a novel inhibition mechanism for serotonin (SERT) transporters, which is akin to traditional un-competitive inhibitors in enzyme kinetics. These inhibitors are known to preferentially bind to the enzyme-substrate complex, thus stabilizing it, resulting in a decrease of the IC50 with increasing substrate concentrations. In contrast to this classic enzyme inhibition mechanism, the authors show for SERT, through detailed kinetic analysis as well as kinetic modeling, that the inhibitor, ECSI#6, binds preferentially to the inward-facing state of the transporter, which is stabilized by K+. Therefore, inhibition becomes "use-dependent", i.e. increasing substrate concentrations push the transporter to the inward-facing configuration, which then leads to the increased apparent affinity of ECSI#6 binding. Interestingly, this mechanism of action predicts that the inhibitor should be able to rescue SERT misfolding variants. The authors tested this possibility and found that surface expression and function of a misfolding mutant SERT is increased, an important experimental finding. Another strength of the manuscript is the quantitative analysis of the kinetic data, including kinetic modeling, the results of which can reconcile the experimental data very well. Overall, this is important and, in my view, novel work, which may lead to new future approaches in SERT pharmacology.

      With that said, some weaknesses of the manuscript should be mentioned. 1) The authors suggest that serotonin and ECSI#6 cannot bind simultaneously to the transporter, however, no direct evidence for this conclusion is provided.

      We assessed this point by plotting the data in Fig. 2A,B,C as Dixon plots in (the new) panels D,E,F of Fig. 2. We refer the reader to Segel’s textbook on enzyme kinetics (new ref. 18) on using Dixon plots in the presence of two inhibitors. The pertinent description is on p. 9, lines 12-22 and reads as follows: “We transformed the data summarized in Figs. 2A-C by plotting the reciprocal of bound radioligand as a function of inhibitor concentration to yield Dixon plots (Fig. 2D-F): the x-intercept corresponds to -IC50 of the inhibitor [18]. Thus, Dixon plots allow for differentiating mutually exclusive from mutually non-exclusive binding, if one inhibitor (i.e., cocaine, noribogaine or ECSI#6) is examined at a fixed concentration of the second inhibitor (i.e., serotonin) [18]: if binding of the two inhibitors is mutually non-exclusive, a family of lines of progressively increasing slope, which intersect at -IC50, is to be seen. In contrast, if the two inhibitors bind to the same site, the slope of the inhibition curves is not affected and the x- intercept (i.e, -IC50 of the variable inhibitor) is shifted to more negative values. It is evident from Fig. 2D-E that the presence of 1 and 10 μM serotonin progressively shifted the (expected) x-intercept for cocaine (Fig. 2D), noribogaine (Fig. 2E) and ECSI#6 (Fig. 2D). Thus, binding to SERT of serotonin and of these three ligands was mutually exclusive.” Based on the Dixon plots, we feel that our conclusion is justified, i.e., binding of serotonin and ECSI#6 (and of the other ligands) is mutually exclusive.

      2) How does ECSI#6 access the inward-facing binding site? Does it permeate the membrane and bind from the inward-facing conformation, or is it just a very slowly transported low-affinity substrate that stabilizes the inward-facing state with much higher affinity? Including ECSI#6 in the recording electrode may provide further information on this point.

      We did the suggested experiments: the data are summarized in panel I of Fig. 4 and described in the first paragraph on p. 15, which also explains why this experiments is possibly inconclusive due to the high diffusivity of ECSI#6:

      “Fig. 4I shows representative traces of 5-HT induced currents recorded from SERT expressing cells in the absence (in blue) and presence of 100 μM ECSI#6 (in red) in the electrode solution: when thus applied from the intracellular side, ECSI#6 did not cause an appreciable current block. The right-hand panel summarizes the current amplitude obtained from cells measured in the absence (blue open circles) and presence of intracellular ECSI#6 (open circles in red). These data seem to indicate that ECSI#6 binds to SERT from the extracellular side. Yet this conclusion can be challenged based on the following consideration: in earlier experiments, ibogaine, the parent compound of noribogaine, was found to block HERG channels when applied from the bath solution but failed to do so when added to the electrode solution [27]. However, at a lower intracellular pH (i.e., pH 5.5), ibogaine gained the ability to inhibit HERG from the intracellular side (i.e., via application through the electrode). Conversely, ibogaine was less effective when applied to an acidified bath solution. These observations led to the conclusion that ibogaine blocked HERG from the cytosolic side: because the molecule in its neutral form was so diffusive, a low intracellular pH was required to force its protonation and thus preclude diffusion from the interior of the cell. ECSI#6 is presumed to also be very diffusible given its estimated logP value and polar surface area of 2.48 and 66 Å2, respectively. However, ECSI#6 harbors an amide nitrogen (see Fig. 1A) and thus remains neutral in the experimentally accessible pH range. Hence, it is not possible to verify to which side of SERT it binds.”

      Additionally, it is not clear why displacement experiments were not carried out with cocaine. Since cocaine is a competitive inhibitor but does not induce transport (i.e. doesn't induce the formation of the inward-facing conformation), it should act in a competitive mechanism with ECSI#6.

      We did not quite understand this comment, because displacement experiments were done with cocaine (Fig. 2A, new Fig 2G/previous Fig. 2D). However, if the reviewer questions why we do not use cocaine rather than 5-HT, in the three-way competition experiment, it is precisely, because we wanted to compare the action/binding mode of ECSI#6 to that of cocaine.

      3) Why are dose-response relationships not shown for electrophysiological experiments? These would be a good double-check for the radiotracer flux data.

      These experiments were done and are shown in (the new) panels G and H of Fig. 4; the pertinent description is in the second paragraph of p. 14 and reads:

      “The protocol depicted in Fig. 4B can also be used to gauge the apparent affinity of ECSI#6 for SERT in the presence of 5-HT. Plotted in Fig. 4G is the block of the serotonin-induced current as a function of the co-applied ECSI#6 concentration. The current was evoked by a saturating concentration of 5-HT (30μM) and inhibited by 3, 10, 30 and 100 μM co-applied ECSI#6, respectively (the inset in Fig. 4G shows representative current traces). A fit of an inhibition curve to the data points yielded an IC50 value of approx. 5 μM. This value was lower but still in reasonable agreement, with the IC50 obtained in the radioligand uptake assay for the condition where the 5-HT concentration had been saturating (cf. dashed line in Fig.1C; 10 μM 5-HT). In the uptake assay the IC50 value of ECSI#6 dropped to about 0.5 mM, in the presence of a low 5-HT concentration (i.e., 0.1 μM). In contrast to uptake experiments, electrophysiological recordings also allow for assessing the apparent affinity of ECSI#6 for SERT in the absence of the substrate. This can be achieved by employing the protocol depicted in Fig. 4H (see representative current traces on the left-hand side): we first applied 30 μM 5- HT to a cell expressing SERT for 0.5 s to elicit a peak current (i.e., a control pulse). We then reapplied 30 μM 5-HT after a superfusing the cell with 100 μM ECSI#6 for 1 s (second upper trace in panel H). We chose this time period because it had been sufficient to allow for full current block in the other protocol (see Fig. 4G): the amplitude of the peak current following pre-application of 100 μM ECSI#6 was essentially identical to the prior control pulse. When we pre-applied 100 μM ECSI#6 for a longer period (i.e., 3 s) the amplitude of the two peak currents also remained the same (cf. lower traces in panel H). The right-hand panel shows the summary of several experiments. Plotted in the graph is the ratio of the second and first pulse, which was always close to one. We previously used this protocol to assess the binding kinetics of cocaine, methylphenidate and desipramine on SERT and DAT. Pre-application of these inhibitors consistently led to a concentration dependent reduction in the peak current amplitude of the second pulse in comparison to the first [23]. The lack of inhibition, thus, indicates that the affinity of ECSI#6 in the absence of 5-HT is low. To obtain estimates for the affinity of ECSI# for SERT in the absence of 5-HT we would need to apply this compound at much higher concentrations. This, however, is not possible, because ECSI#6 is poorly soluble in aqueous solutions (i.e., max. 0.03 mg/ml).”

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer 1

      This paper identifies a role for the hereditary spastic paraplegia protein spatacsin in lysosome morphology, positioning and dynamics, and undertakes detailed mechanistic studies to try to identify the mechanism for this effect. In doing so the paper elucidates further mechanistic information about the properties of two other hereditary spastic paraplegia proteins, spastizin and AP5Z1. The work is done in mammalian cells and uses a combination of over-expression, depletion and biochemical studies. The main findings are:

      1. The authors present evidence that spatacsin is an ER-localised protein.
      2. Murine embryonic fibroblasts lacking spatacsin have a reduced number of tubular lysosomes and the remaining lysosomes are less motile. In general, a relationship between tubular lysosome morphology and lysosome motility, often in association with the endoplasmic reticulum (ER), is demonstrated. These tubular lysosomes are catalytically active and acidic.
      3. In terms of mechanism of this effect, by combining a yeast-two hybrid and siRNA phenotypic screen, the authors identify a number of spatacsin-interacting proteins that also regulate lysosomal tubulation. The most important of these for the purposes of this paper is UBR4, an E3 ubiquitin ligase.
      4. The authors show that spatacsin and UBR4 promote degradation of AP5Z1, and that this property required the ability of spatacsin to interact with UBR4. Somewhat surprisingly, as AP5Z1 is a coat protein, this degradation appeared to occur within the lumen of the lysosome - the authors speculate how this could be in the discussion.
      5. The authors then demonstrate that AP5Z1 and spastizin, both hereditary spastic paraplegia proteins, compete for binding with spatacsin.
      6. The relationship between spatacsin, spastizin, AP5Z1 and motor proteins in then examined. There is a known interaction between spastizin and KIF13A and expression of a dominant negative KIF13A protein reduced lysosomal tubulation. The authors then demonstrate an interaction between AP5Z1 and the p150Glued dynein/dynactin complex member, then showed that expression of a dominant negative p150Glued protein reduced lysosomal tubulation.
      7. Finally, that authors demonstrate the relevance of these findings to neurons, the target cells of hereditary spastic paraplegia, by showing that lysosomal tubulation and axonal transport are reduced in mouse neurons lacking spastacsin, and that depletion of UBR4 or AP5Z1 affected these as expected from the experiments above.

      Major comments:

      Overall I believe that the key conclusions of this paper are generally convincing and that the work is of high quality. However, I do have some reservations:

      1. The localisation of spatacsin on the ER. It is always difficult to be convinced about colocalization of a diffuse punctate marker and the ER. From the STED experiments in figure 1, while it definitely seems that there is some spatacsin on the ER, there also appears to be some spatacsin puncta that are not. I'd like to know if these puncta represent lysosome-associated spatacsin. This is important for interpretation of the subsequent experiments (see point 3 below). I also think quantification of these co-localisation will increase confidence in the results. In addition, a caveat of the immunofluorescence studies is that they use over-expressed spatacsin. I appreciate that there are no good antibodies to endogenous spatacsin, but I don't think this limitation is sufficiently acknowledged. As the claim of ER-localisation is critical for the proposed mechanistic model, and in the absence of experiments with endogenously tagged spatacsin, this makes the biochemical fractionation studies of figure 1C very important. To make these more convincing I would prefer to see additional control markers to verify the separation of lysosomal and ER compartments - e.g. lamp1, lamp2, an ER tubular marker such as a REEP5 or a reticulon.

      Authors response : We agree with the reviewer that the localization of spatacsin is critical, and we appreciate the knowledge of the reviewer concerning the lack of good antibodies to endogenous spatacsin. We better acknowledged this limitation in our revised manucript (p. 5 and p. 15). We performed extra experiments to convincingly show that spatacsin is indeed localized at the ER. First, we performed 3-color STED experiments to visualize in the same cell spatacsin, the ER and lysosomes. The preliminary data seem to indicate that some spatacsin is associated with lysosomes at ER-lysosomes contact site. We plan to add quantifications of colocalization between spatacsin and ER staining at STED resolution to better support the fact that spatacsin is a protein of the ER.

      Moreover, as requested, we have performed a western blot with Lamp2 and REEP5 antibodies on the ER- and lysosome-enriched fractions (New Figure 1B). This western blot shows that a significant proportion of Lamp2 is present in the ER-enriched fraction, which may be explained by the strong association of ER with late endosomes and lysosomes. Yet the lysosome-enriched fractions that contained no ER markers do not present spatacsin staining, suggesting that spatacsin is either in the ER or in lysosomes associated with the ER that are not positive for cathepsin D. We reformulated the text of Figure 1 according to the new included data (p. 5-6).

      The authors generally do a good job of quantifying their results. However, this is lacking for the biochemical experiments (immunoblotting and IP) in figures 4 and 5, and I would prefer to see these quantified (the quantification should include data from repeat experiments so that we can judge the reproducibility of the results).

      Authors response : We agree that our presentation did not indicate that the western blots were repeated several times. We have added quantifications for the western blots present in Figures 4 and 5.

      On page 10, referring to the proximity ligation results, the authors comment: "This suggests that the spatacsin-spastizin interaction occurs at contact sites between the ER and lysosomes to allow spastizin recruitment to lysosomes". I'm not sure this statement is fully supported, as mentioned at point 1 above it is possible that some steady state spatacsin is at lysosomes. To fully support this, we'd need to see the PLA signal also convincingly co-localise with an ER marker.

      Authors response : We will perform extra PLA experiment to indeed show that the spots where spatacsin and spastizin colocalize with an ER marker. This data will be added in Figure 5.

      In figure 6C and D the effect of spastizin on lysosomal tubulation and dynamics is investigated. Wartmannin treatment is used to do this, as it is known to remove spastizin from lysosomes. However, this is a very indirect manipulation that could have many other consequences and it would be better to demonstrate this directly by showing the effect of depletion of spastizin on lysosomal morphology/dynamics. I also think the role of AP5Z1 in tubulation/dynamics would be better supported with additional experiments to deplete the protein - at present only over-expression is examined.

      Authors response: *We added new data to answer this comment. Downregulation of spastizin using siRNA led to lower number of tubular lysosomes and decreased the proportion of dynamic lysosomes, showing that spastizin is required to regulate lysosome motility (Figure 6B-6C Supplementary Figure 7B). We have also added new data regarding downregulation of AP5Z1 (Figure 6A-6C-Supplementary 7A). Both overexpression and downregulation of AP5Z1 using siRNA decreased the number of tubular lysosomes and decreased the proportion of dynamic lysosomes (Figure 6A-6C-Supplementary Figure 6C-D). *

      This observation suggests that the levels of AP5Z1 must be tightly regulated to control lysosome motility. We added discussion about this point as well (p.12-13).

      While the experiments showing that over-expression of dominant negative forms of KIF13A and p150Glued affect lysosomal tubulation/dynamics provide good circumstantial evidence that spatacsin influences these lysosomal properties via its interactions with spastizin and AP5Z1 (which bind to these motor proteins), the authors have not shown that the interaction of the motor proteins with spastizin and AP5Z1 is required for this ability to regulate lysosome tubulation/dynamics. This means that the model presented in figure 7 is not fully supported by the data. If the authors have been able to map the binding regions for these interactions then perhaps this could be investigated with rescue experiments, although I appreciate that this is potentially a major piece of work and perhaps outside the scope of this paper. An alternative would be that the authors acknowledged this part of the model as somewhat speculative.

      Authors response : We agree with the reviewer that our data do not show that KIF13A and p150Glued interact directly with spastizin and AP5Z1 to regulate lysosome dynamics. It has previously been shown that the adaptor complex AP2 interacts with p150glued via the ear domain of AP2 b subunit (Kononenko et al, 2017). It is therefore likely that the interaction of adaptor complex 5 with p150-Glued also occurs via AP5B1 subunit, and thus interaction of AP5Z1 with p150 glued would be indirect. *We discussed this point carefully (p.16). *

      *Regarding the interaction of Spastizin with KIF13A, it was identified by yeast-two hybrid screen and validated by GST-pulldown (Sagona et al, 2010). This showed that KIF13A interacts with the C-terminal domain of Spastizin, and we discussed this point. To confirm that KIF13A interaction with spastizin is required to promote its role in tubular lysosome formation and dynamics, we can perform an experiment where we downregulate endogenous mouse spastizin using siRNA and express either full length human spastizin to rescue the effect of the siRNA, or overexpress a human spastizin lacking its C-terminal domain required for the interaction with KIF13A (where we would expect no rescue). This would strengthen our conclusion on the role of KIF13A in link with spastizin to regulate the formation and dynamics of tubular lysosomes. We could add these data in Figure 6 (or Supplementary Figure 7). *

      • Are the experiments adequately replicated and statistical analysis adequate?

      In general I am not convinced that the statistical tests are applied rigorously in this paper. Most experiments are done three times, but the "n" used for statistical testing is typically chosen as, e.g. the number of cells, number of lysosomes, rather than number of biological repeat experiments. This means that inter-experimental variability is not rigorously taken into account. A more rigorous practice would be to use the mean measures for each of three biological repeats and apply the statistical tests to the three means, so n=3 if three repeats were done. Superplots would be a nice way to graphically display these data.

      Authors response : We agree with the comments of the reviewer regarding data presentation. We have therefore changed the presentation of all graphs of the manuscript using superplots that allow us to show all the points that were analyzed as well as the mean value for each biological replicate, and performed statistical analyses by comparing the biological replicates as proposed in Lord et al, JCB 2020 (10.1083/jcb.202001064).

      Minor comments:

      1. In supplementary figure 3D I cannot honestly say that I see the smaller band.

      Authors response : We agree that this western blot is not clear. We will provide a new western blot.

      When first called out, I expected supplementary tables 1 and 2 to show the list of interactors with wild-type spatacsin and spatacsind32-34 respectively, but this is not what they show.

      Author response : We have added two supplementary data tables (Now Supplementary Tables 1 and 2) to give the list of interactors of wild-type C-terminal domain of spatacsin and spatacsinD32-34, respectively.

      Supplementary Tables 3 and 4 now refer to the analysis of the downregulation experiments by respectively the neural network method and the tubular lysosome detection method.

      The experiments in Figure 4A are a little problematic in the way that they are called out. The first call refers to just a small subset of the data in the figure, and the figure is then called out at various points later in the paper. This is quite confusing. Is there any way this could be simplified?

      Authors response :We agree with the reviewer that Figure 4A was called at various points of the manuscript. This was to avoid duplicating data into two separate figures. However, we have modified the presentation of Figure 4 and Figure 5. We have included new Figure 4C to show that downregulation of UBR4 prevents the degradation of AP5Z1 upon overexpression of Spatacsin-GFP, but also in basal conditions in wild-type fibroblasts. The co-IP that was originally presented in Figure 4A has now been moved into Supplementary Figure 6A.

      The section on page 10: "Spatacsin also interacts with spastizin, and is required to recruit spastizin to lysosomes (Hirst et al., 2021). ........ We hypothesized that spatacsin interaction with spastizin was required for spastizin localization to lysosomes." Is odd, as the authors seem to be hypothesising an observation that they have just said has already been demonstrated.

      Authors response : We agree that these sentences were odd. We have rephrased the paragraph (p. 11).

      Can the authors explain why there is so little interaction between wild-type KIF13A and spastizin?

      Authors response : The interaction domain of spastizin with KIF13A is close to the motor domain according to the two-hybrid data published by Sagona et al (2010). The dominant negative construct of KIF13A that is devoid of the motor domain (KIF13A-ST) may thus facilitate access of spastizin to binding domain. We have commented on this point in the text (p.13).

      In figure 6G p150Glued signal is also present in the control IP lane, which casts doubt on the specificity of the interaction. Could the authors generate a cleaner result?

      Authors response : We have repeated the experiment 3 times, always with some p150Glued signal present in the control IP. Of note, as stated in the method section, we have increased the concentration of NaCl in the washing of this co-IP to decrease non-specific binding of p150glued to control beads, but we could not get cleaner results so far. We will try to get cleaner western blot to illustrate Figure 6G.

      I would be interested to see how AP5Z1 expression differs between neurons with and without spatacsin- we would expect similar results to those shown in the MEFS.

      *Authors response : We have not checked the levels of AP5Z1 in neurons with and without spatacsin yet. However, the complete knockout of spatacsin strongly modifies the levels of its partners. We previously showed that spastizin levels are decreased by >90% in Spg11 knockout brain (Branchu et al, 2017). Furthermore, the levels of AP5Z1 have been shown to be decreased by ~50% in fibroblasts of SPG11 and SPG15 patients (Hirst et al, 2015). *

      *Our work shows that spatacsin promotes the degradation of AP5Z1 by lysosomes. It is possible that other degradation mechanism(s) may exist and could explain the lower levels of AP5Z1 in knockout cells. We discussed this point (p.15). *

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      In this study Pierga et al. report that SPG11 (spatacsin) is an ER-resident protein involved in the regulation of ER-lysosome contact sites (in particular tubular lysosomes) and subsequent faster motility of tubular lysosomes, as well in the degradation of AP5Z1 (SPG48), which forms a heterotrimeric complex with SPG15 (spastizin) and SPG11. This complex has been localized by several groups on the cytoplasmic side of LAMP-1-positive lysosomes. In addition, mutations in SPG11, SPG15, and SPG48 patients share various clinical features and were supported by biochemical/cell biological data from Spg11 and Spg15 KO mouse models and cultured cells both from patients and mice, respectively, demonstrating e.g. accumulation of autolysosome storage material, defects in the autophagic lysosome reformation process, and the loss of cortical motoneurons and Purkinje cells.

      Major concerns:

      i) Fig. 1, 2, 3: major disadvantage of this study is the analysis of overexpressed proteins (SPG11-V5, GFP-Sec61, and Lamp1-mCherry) which might contribute to the observed strong expression of SPG11-V5 in the ER/ER-enriched fraction. The results should be compared with the endogenous expressed proteins.

      Authors response :* As stated by reviewer 1, there are no good antibodies to endogenous spatacsin, and therefore we have to rely on expression of tagged spatacsin to study its localization by immunohistochemistry. For the colocalization with the ER, we stained the latter by GFP-Sec61 that is a widely used marker for this compartment. To confirm our results, we plan to try to perform new STED imaging with REEP5 antibody to stain the ER, and Lamp1 antibody to label lysosomes, avoiding overexpression of proteins to label the subcellular compartments. Furthermore, as it is not possible to localize endogenous spatacsin by immunostaining, we addressed its localization by biochemical fractionation and western blots comparing wild-type and Spg11 knockout samples. *

      For Figure 2, the data presented were indeed obtained using transfection of Lamp1-mCherry. However, we confirmed our observation of Figure 2A using alternative staining of lysosomes (Lysotracker or loading of lysosomes with Texas-Red Dextran). We therefore think that our data presented in figure 2 are valid, and that the effect we observed on tubular lysosomes was not affected by expression of Lamp1-mCherry.

      In Figure 3, the lysosome were labelled with Texas-Red Dextran, and thus all the data presented in figure 3 do not rely on overexpression.

      In Fig. 1C the lack of the mature Cathepsin D form which is proteolytically generated only in lysosomes from the higher molecular mass precursor is misleading and should be related to presence of lysosomal membrane proteins.

      Authors response: As requested, we have performed a western blot to show the lysosomal membrane protein Lamp2 on the ER- and lysosome-enriched fractions (Figure 1B). This western blot shows that a significant proportion of Lamp2 is actually present in the ER-enriched fraction, which may be explained by the strong association of ER with late endosomes and lysosomes previously described (Friedman et al, 2013). Yet the lysosome-enriched fractions that contained no ER markers do not present spatacsin staining, suggesting that spatacsin is either in the ER or in lysosomes associated with the ER. We reformulated the text of Figure 1 according to the new included data (p 5-6). The 3-colours STED experiment that we plan to perform to answer reviewer 1 comments will help discriminate between these possibilities.

      Fig. 1D: the TEM image shows only a single lysosome and proposed ER contact zones in wt-MEFs without comparison with Spg11 KO MEFs (only in the quantification). Without double immunogold labeling of SPG11 (and their lack on SPG11 KO cell lysosomes) and known ER contact-site proteins this image and the conclusion are insufficient.

      Authors response : We have added an image of a lysosome taken from a knockout fibroblast (Figure 1E). As stated above there are no good antibodies to spatacsin for immunostaining, so it will not be possible to perform double immunogold labelling. This prevents us from claiming that spatacsin is a protein enriched at contact site. We therefore modulated our result section and discussion accordingly (p.5-6 and p.16).

      ii) The rationale for the selection of the deleted Spg11 region D32-34 is not clear. What are the symptoms of this Spg11 knock-in mouse? A more detailed description of the phenotype is required Is the phenotype including the accumulation of LC3-positive material similar to the phenotype of the SPG11 KO mouse which has been published by Varga et al.(2015) and Branchu et al. (2017) ? If not, is the new mechanisms reported here not so important?

      Author response : We have added new data (Supplementary Figure 3E-F) showing motor and cognitive impairment in mice expressing truncating spatacsin, although the motor dysfunction is slightly less marked than in Spg11 knockout animals. We also checked for accumulation of autophagy markers. We did not use LC3, but p62 that labels substrates to be degraded by autophagy. We observed accumulation of p62 in Spg11 knockout and in Spg11D32-34/D32-34 mouse neurons (Supplementary Figure 3G). These data support the functional importance of the domain encoded by exons 32 to 34 of Spg11. We commented on this in the text (p.9).

      iii) p8/Fig. 3F/Suppl.Fig.3F- the most important part of the manuscript: what are the parameters of lysosomal staining in images that were used to identify genes important for lysosome tubulation by the neural network?

      Authors response : For screening in Figure 3, lysosomes were stained by loading fibroblasts with Texas-Red Dextran overnight, followed by a wash of at least 4 hours. The neural network was first trained to discriminate between control and Spg11-/- fibroblasts, using any parameters of the lysosomal staining, not necessarily lysosome tubulation. This is a completely unsupervised and unbiased method, but one of its drawbacks is that we do not know which parameters were used by the network to discriminate between control and Spg11-/- fibroblasts. Therefore, we validated the classification performed by the neural network on a data set independent from the training set before using it for the screening. We rephrased the paragraph to make it clearer (p.9).

      I cannot understand how the authors predict the probability of the cell to be considered as an Spg11 KO fibroblast (why not as an Spg11 D32-34 knock-in fibroblast?) as the basis for the selection of interaction candidates.

      Author response : The neural network was trained on sets of images obtained from wild-type and Spg11 KO fibroblasts, which were expected to represent extreme lysosomal phenotypes linked to spatacsin function. We could therefore predict the probability of cells to be considered as Spg11 KO, not as Spg11 *D32-34 fibroblasts. We clarified this in the text (p9). *

      A simple statement that the neural network approach identified those genes is too weak and requires more convincing experimental data. It has to be shown at least for the 8 positive genes in both approaches how the siRNA treatments of these genes phenocopied the lysosomal changes and of course the effect of the downregulation on the protein level of their products both in wild-type control and Spg11 D32-34 knock-in MEF. The Suppl. Fig.3F is completely unclear. How were the Y2H interaction partner validated? Did the authors use the identified 8 interaction candidates as full length bait to demonstrate the interaction with the Spg11 exons 32-34 ?

      Author response : The purpose of the siRNA screen was to quickly identify putative candidates important for the regulation of lysosome dynamics. We identified 8 candidates possibly implicated in lysosomes dynamics based on the two analysis methods. We have added in Supplementary Figure 4 C-D the effect of both siRNA on lysosomal function by the two methods of analysis compared to the effect of siSPG11. However, here we aimed to identify candidates and we do not claim that every one of these eight proteins were indeed implicated in the regulation of lysosome dynamics. We corrected the text, accordingly, stating that the products of the 8 identified genes are good candidates to regulate lysosomal function (p.10). We validated the role of one of the identified candidates, UBR4, and we showed that the UBR4 siRNA indeed downregulates the protein level (Figure 4C). We only validated the interaction of spatacsin Cter with UBR4 by co-immunoprecipitation (Figure 4B).

      *For the 7 remaining candidates, full characterization would indeed be required to validate their role and elucidate their mechanisms of action, but this is out of the scope of this manuscript. *

      p8/Fig.3F: the genes identified in both approaches have to be listed in the Fig. 3F-Table.

      Authors response : We have added in new Figure 3F the list of the 8 candidate genes that could contribute to regulate lysosome function.

      The GO process- ubiquitin-dependent protein catabolic process is neither positive for the neural network nor for the directed analysis but positive for both analyses? Please explain. Similarly, the GO process proteolysis involved in cellular protein catabolic process -is not positive for the neural network analysis but again positive for both analyses.

      Authors response : We agree with the reviewer that Table 3F in its older version could be a bit confusing. GO analysis is based on “enrichment” of biological processes within a list of proteins. As we did not have the same number of proteins in the 3 analyses provided in original Table 3F, we got variability in the identified biological processes. To simplify, we have therefore chosen to present only the GO analysis for the 8 candidates that were most likely implicated in lysosomal dynamics according to our two analyses of the siRNA screen which is the most relevant for our study (new Figure 3G).

      For Fig. 3G the mutant ubiquitin-K0 staining in wild-type MEF cells has to be shown as well as for the Spg11 ki/KO MEFs (+ quantification of the respective data)

      Authors response : As stated by Reviewer 4, the expression of lysine-null ubiquitin may impact many different cellular pathways. We therefore removed this part of the data in order to simplify the manuscript (p.10)

      iv) The interpretation of the Y2H-interactome analysis by the authors is hard to follow. They searched with the exon 32-34 cDNA for binding partner, selected 3 degradative GO processes and showed by overexpression of a mutant Ub-K0 plasmid in wild-type MEFs a decreased number of tubular lysosomes, as well as their dynamics (without showing the control data in Spg11 KO or ki-MEFs). Thus, poly-ub of proteins should be in some way responsible for a lysosomal phenotype of Spg11ki MEFs.

      Now they went to AP5Z1, the second binding partner of SPG11, which is reduced in its abundance upon overexpression of Spg11-GFP. I would expect to do the respective control experiment to show that in the absence of SPG11 or in the knock-in cells the amount of AP5Z1 has to increase. However, in the studies by the Huebner group by deletion of Spg11 or the other binding partner Spg15, no increase of AP5Z1 protein levels has been observed. The authors have to comment on this discrepancy.

      *Authors response : We agree that this is an important point to discuss, and we failed to do it in our first version. *

      *The complete knockout of spatacsin strongly modifies the levels of its partners. We previously showed that spastizin levels are decreased by >90% in Spg11 knockout (Branchu 2017). Furthermore, the levels of AP5Z1 have been shown to be decreased by ~50% in fibroblasts of SPG11 and SPG15 patients (Hirst et al, 2015). *

      Our work shows that spatacsin promotes the degradation of AP5Z1 by lysosomes. It is possible that other degradation mechanism may exist, and could explain the lower levels of AP5Z1 in knockout cells. Furthermore, it was proposed that AP5Z1 stability may depend on the presence of spatacsin and spastizin (Hirst et al., 2013)*. Therefore spatacsin may contribute to tightly regulate AP5Z1 levels by contributing both to its stability, and to its degradation. We have carefully discussed this point (p.16). Furthermore, the experiments requested by reviewer 2 in point (vi) that we are planning to perform will help clarify the mechanisms of AP5Z1 degradation both in presence and absence of spatacsin. *

      Then the authors found that the selected interaction partner of the exon 32-34 sequence, UBR4, does not bind to the Spg11-GFP construct lacking the domain encoded by exons 32-34 but to the C-terminal domain of Spg11-GFP. Unfortunately, all these IP-experiments were shown as cut and paste figures, preventing the direct comparison between the input and the IP protein amounts (since the information is missing what percentage of the input and the IP has been loaded per lane, the evaluation and significance of these Co-IPs are unclear).

      Authors response : We have added in the Figure legend the fact that the input represents 5% of lysate added to the immunoprecipitation assays

      v) p9: AP5 (Z1) is a cytoplasmic protein and can be localized on the cytoplasmic surface of lysosomes. How should the GFP-mcherry-AP5Z1 protein enter the lumen of lysosomes justifying the quenching of the GFP signal? A positive control has to be included in the experiment shown in Fig. 4E demonstrating the effect of MG132 under identical conditions of a protein substrate for proteasomal degradation.

      Authors response :* We agree this is an important control. We plan to add a control showing accumulation of ubiquitin in lysates upon MG132 treatment to show it was indeed effective. *

      vi) Fig. 5A: In contrast to GFP-mcherry-AP5Z1, spastizin-GFP is localized at the cytoplasmic surface of lysosomes (co-staining with LAMP1-mcherry) in wild-type MEFs. In regard to the incomplete data commented under "minor points Fig.4/Suppl.Fig.4", I suggest to perform a simple control experiment with overexpressed GFP-spastizin and mCherry-AP5Z1 in wild-type MEFs (at the best also in Spg11 KO MEF) with and without bafA treatment, which will clearly demonstrate whether single components of the trimeric Spg11, spastizin-AP5Z1 complex are degraded independently of each other in lysosomes.

      *Authors response : As stated above, we will perform this control experiment, and will add the data in Figure 5 in future revision. This will help clarify the mechanism of degradation of AP5Z1 and spastizin both in presence and absence of spatacsin. Discussion of this point will also help to clarify the point iv raised by reviewer #2. *

      vii) why did the authors neither mention nor discuss the described role of SPG11 in autophago-lysosome reformation (ALR)?

      *Authors response : We did not discuss ALR in our first version as we did not investigate autophagic conditions. However, due to the well-described role of spatacsin in ALR, we agree that we should discuss ALR in our manuscript, and we added a paragraph (p.15). *

      Minor points

      • Figure 1 A, B, D, and G: ER-lysosome contact sites. The quantification of the co-localization of spatacsin-V5 with the ER marker protein GFP-Sec61b has to be given.

      Authors response :* We plan to add quantification data performed on STED images showing localization of Spatacsin-GFP together with ER and lysosomal markers. This data will be added in Figure 1. *

      Moreover, the authors analyzed overexpressed tagged-proteins only. The results should be compared with the endogenous proteins.

      Authors response :* As stated above, there are no good antibodies to endogenous spatacsin for immunostaining. We will add new STED images with antibodies against endogenous Reep5 and Lamp1 to label the ER and lysosomes together with overexpressed spatacsin. Regarding endogenous spatacsin, we could only investigate its localization by subcellular fractionation and western blots comparing wild-type and Spg11 knockout samples. We added biochemical data suggesting that spatacsin is enriched either in the ER or in lysosome membrane associated with the ER. These data have been added in Figure 1 and in text (p.5) and we added a paragraph in discussion regarding spatacsin subcellular localization (p.15). *

      p8/Figure 3: what does the 'analysis of trained neural networks' mean?

      Authors response : We did not analyzed the trained neural network, but we used this trained neural network to perform image analysis. We clarified the text (p.10).

      Figure 4: what happens with the other AP5 subunits?

      Authors response : This is a very interesting question. We will test whether overexpression of spatacsin-GFP induces a degradation of some other AP5 subunit, provided we get specific antibody. We will add the data in Figure 4A.

      Fig.4F/Suppl.Fig4: live images of GFP-mcherry-AP5Z1 + lysotracker staining have to be shown both for wild-type MEFs with and without bafilomycin A treatment(as in Fig.4F), and in Spg11 KO and Ki MEFs +/- bafA.

      Authors response : We will add these data in Figure 4 (WT Mefs +/- Baf A) and in Supplementary Figure 5 (Spg11KO and SPG11D32-34 Mefs +/- Baf).

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      This manuscript highlights an interesting localization of spatacsin in the endoplasmic reticulum (ER)-lysosomes contact sites. In addition, it implicates spatacsin in regulating tubular dynamic lysosomes. Mechanistically, the authors propose that spatacsin interacts with UBR4 to promote the autophagic degradation of its binding partner AP5Z1 at the lysosomes. In turn, this would also regulate the amount of spastizin at the lysosomes, which is known to interact with anterograde motors. The authors further show that AP5Z1 interacts with p150Glued. Thus, the balance between AP5Z1 and spastizin at the lysosomes would determine lysosomal trafficking directionality.

      Major Comments

      1. Several crucial results of the manuscript are based on quantifications performed on immunofluorescence stainings. Data points in graphs show individual cells or individual lysosomes and the authors apply statistical tests on replicates that cannot be considered biologically independent, since they come from the same experiment or even the same cell. It is recommended to show superplots where both the individual data and the average of each independent experiment is indicated as recommended by Lord et al. (J Cell Biol 2020 219 (6): e202001064.). Statistics should be performed only on independent biological replicates.

      Authors response : We agree with the comments of the reviewer regarding data presentation. We have therefore changed the presentation of all graphs of the manuscript using superplots that allow us to show all the points that were analyzed as well as the mean value for each biological replicate, and performed statistical analyses by comparing the biological replicates as proposed in Lord et al, JCB 2020 (10.1083/jcb.202001064).

      The authors have used yeast two-hybrid to search for spatacsin interactors. Although in the manuscript they refer to supplementary tables that should show these interactors, the available Tables are confusing and refer to the following downregulation experiments.

      Author response : We have added two supplementary data tables (Now Supplementary Tables 1 and 2) to give the list of interactors of wild-type C-terminal domain of spatacsin and spatacsinD32-34, respectively.

      Supplementary Tables 3 and 4 now refer to the analysis of the downregulation experiments by respectively the neural network method and the tubular lysosome detection method.

      An experiment to demonstrate that endogenous UBR4 and spatacsin interact by co-immunoprecipitation would be crucial.

      Authors response : We agree with the reviewer that it would be important to test whether endogenous spatacsin and UBR4 are interacting by co-immunoprecipitation. So far we have not managed to immunoprecipitate either endogenous spatacsin or endogenous UBR4 with the antibodies we tested, which prevents us to test the interactions of endogenous proteins by co-immunoprecipitation. We are not sure we can provide this result.

      Several important experiments to unravel the mechanistic role of spatacsin (Figure 4 and 5) are performed upon overexpression. This is a major limitation of the study and the authors should address it as much as possible. Western blots and immunoprecipitations are shown that appear to have been performed only once and have no quantification. As an example, in Fig 4A the difference in levels of AP5Z1 upon spatacsin overexpression or UBR4 downregulation are very minor. I would be very careful in drawing big conclusions, without additional repetitions and additional experiments in an endogenous setting.

      *Authors response : We agree that a lot of our experiments used overexpression. We have now added to the manuscript new data obtained in MEFs where we downregulated spastizin or AP5Z1 (Figure 6). They confirm the role of spastizin in the regulation of lysosome dynamics. Furthermore, our new data show that levels of AP5Z1 must be tightly regulated as both overexpression and downregulation of AP5Z1 affects lysosome dynamics (p.12). We also discussed these data carefully (p.16 ). *

      Furthermore, we agree that our presentation did not indicate that the western blots were repeated several times. We have now added quantifications for the western blots presented in Figures 4 and 5. Furthermore, we have also added the data showing that downregulation of UBR4 led to higher levels of AP5Z1 in control fibroblasts (Figure 4C).

      The authors suggest a model by which UBR4 recruited by spatacsin is involved in autophagic degradation of AP5Z1. The data shown do not support this conclusion. First, in Figure 4A downregulation of UBR4 does not increase levels of AP5Z1 above the control in lane 1, but only when spatacsin is overexpressed. The effect of downregulation of UBR4 in wilt-type cells on AP5Z1 should be investigated. Secondly, there is no experiment directly proving that the stability of AP5Z1 depends on UBR4.

      Authors response : We have added new western blots (and quantification) in Figure 4C showing that downregulation of UBR4 increased levels of AP5Z1 in control conditions. The fact that downregulation of UBR4 increased levels of AP5Z1 in control conditions suggests that UBR4 contributes to regulating the levels of AP5Z1. However, we do not show whether UBR4 directly promotes the degradation of UBR4, which has been added in the discussion (p15). To test whether UBR4 affects the stability of AP5Z1, we will monitor whether downregulation of UBR4 by siRNA increases the half-life of AP5Z1. These data will be added on Figure 4.

      The authors suggest that the interaction of spatacsin with spastizin or AP5Z1 are in competition. This is an interesting hypothesis, however to conclusively demonstrate this, pull-down experiments in KO cells and not upon extreme overexpression should be performed.

      Authors response : We agree that testing the interaction of spatacsin with its partners in SPG15 KO or AP5Z1 KO fibroblasts would be a very good control of our hypothesis. However, we previously showed that the levels of AP5Z1 are lower in SPG15 KO than in control fibroblasts (Hirst et al, 2015), which introduces a bias in the analysis. We therefore plan to concentrate on AP5Z1 fibroblasts and investigate whether interaction of spatacsin with spastizin is modified in these cells. An alternative would be to monitor the effect of siRNA downregulating AP5Z1 on the interaction between spatacsin and spastizin. We will add these data in Figure 5.

      Minor comments

      1. In figure 1G and 1H the overlapping area between lysosomes and ER is quantified. Considering that the ER occupies a large portion of the field a 90{degree sign} flipped control for both WT and KO would be important to sort out random colocalization. In this direction, it would be also essential to show that the total amount of lysosomes is not different in WT and KO, especially because in figure 1A the lysosomes in WT and KO seem to be different not just in shape but also in number and size. A different number or size of lysosomes affects this analysis.

      Authors response :* We added quantifications in Supplementary Figure 1F showing that 90° flipped controls are indeed not capturing the same proportion of contacts between the ER and lysosomes. We also added quantifications in Supplementary Figure 1D-E showing that the average size of lysosomes and the number of lysosomes per unit area are similar in control and Spg11 KO fibroblasts and mentioned it in the text (p.6). If the lysosomal staining appears different in Spg11 KO fibroblasts it is because lysosomes are clustered around the nucleus, an observation that we reported previously (Boutry et al, 2019). *

      In the second chapter of the Results, the authors state: "we observed by live imaging a higher number of lysosomes with tubular shape in Spg11+/+ compared to Spg11-/- cells", however the number of elongated lysosomes is quantified per area. Why the number of elongated lysosomes is not quantified over the total amount of lysosomes?

      Authors response : The point raised by the reviewer is a fair point. The purpose of our analysis was to compare the number of lysosomes with tubular shape in control and Spg11 KO cells. As the number of lysosomes per unit area is invariant between control and Spg11 KO cells as shown in new data included in Supplementary Figure 1D, normalization to total number of lysosomes or to cell surface reflects the same difference in phenotype.

      The In the fourth chapter of the Results, the authors state:" In wild-type MEFs, mCherry was colocalized with lysosomes. In contrast, GFP that is sensitive to pH was poorly colocalized with lysosomes, suggesting that AP5Z1 was mainly inside the acidic subcellular compartment (Figure 4F)." If the aim of the authors is to shown that AP5Z1 is mainly into the lysosome, the amount AP5Z1-mcherry inside and outside the lysosome need to be compared, with a proper statistical analysis. There is also a lot of GFP signal in the cytosol. Why is that?

      *Authors response : We agree with the reviewer, we will add quantification of the proportion of AP5Z1-mCherry inside lysosomes on Supplementary Figure 5. *

      Regarding the GFP-AP5Z1 signal in the cytosol, AP5Z1 has no transmembrane domain and may thus exist as a cytosolic protein. Since GFP is quenched in the acidic environment of lysosomes, the GFP fluorescence of the mCherry-GFP-AP5Z1 protein is outside lysosomes, and it appears partly cytosolic. Of note, there is also some cytosolic mCherry signal that is less visible due to the high level of mCherry fluorescence in lysosomes. We will clarify this point with the quantification of the proportion of mCherry signal compared to GFP inside the lysosomes and add it in Figure 4.

      construct used in the paper is a C-terminal tagged version of spatacsin. The authors should consider to test an N-terminal tagged construct at least for the localization experiments.

      Authors response : We added an immunostaining image of Spatacsin with an N-terminal tag (Supplementary Figure 1B) and mentioned it in the text (p.6). As spatacsin with a C-terminal tag, it presents a diffuse distribution that poorly co-localizes with lysosomes.

      Figure 5C: a negative control and the quantification are missing.

      Authors response : A non-transfected cell is present on Figure 5C, visible thanks to the Lamp1 immunostaining, and that we considered as a negative control. In this non-transfected cell, we detected no PLA signal. We added an asterisk to point the non-transfected cell on Figure 5C. Quantification will also be added in the revised version after we have performed the PLA experiment required by Reviewer 1.

      Reviewer #3 (Significance (Required)):

      Since spatacsin, AP5Z1 and spastizin are all implicated in hereditary spastic paraplegia, the data are of potential interest not only for basic cell biology, but also to understand the pathogenesis of the disease. In addition, the manuscript proposes a novel model regulating trafficking of dynamic lysosomes.

      Reviewer #4 (Evidence, reproducibility and clarity (Required)):

      Pierga et al. reveal subtle differences in lysosome morphology, ER-contact, and trafficking in the absence of Spatascin. These data are replicated with a truncated Spatascin, presumably a loss of function. Two-hybrid screening of the deleted sequence from this truncation for interactors and then asked whether these hits could phenocopy the lysosome morphology changes. This led to an assertion for a role for ubiquitination in these effects. Rather than these hits the group then investigates previously known Spatascin interactors and reports similar complex but subtle abnormalities via overexpression or knockdown of these. While data show overlapping phenotypes by modulation Spatascin, AP5z1, and Spastizin, the manuscript is confusing, leaps from experiment to experiment, and does not provide novel rigorous mechanisitic insight. It conflates all the discrete lysosomes aspects into a collective to link them. The title is over-stated and not appropriate for the experiments.

      The localization of endogenous Spatascin is lacking - over-expression is prone to artifact and the punctate data on the V5 suggests much more work is needed to understand where in the cell it is. It would seem much more work is needed here.

      Authors response : As stated by reviewer 1, there are no good antibody to endogenous spatacsin, and therefore we have to rely on expression of tagged spatacsin to study its localization by immunofluorescence. When performing the images, we avoided the cells with the highest ovexpression of tagged spatacsin. Yet, we agree that this is still overexpression. That’s why we included subcellular fractionation data where we can detect endogenous spatacsin (Figure 1A-1B). These data confirmed that spatacsin is enriched in the ER or in lysosome fraction tightly associated with the ER.

      Furthermore, the EM data (1E) would suggest the far majority of lysosomes are in contact with ER - these seems uncharacteristic.

      Authors response : The EM data in figure 1E indeed shows that the majority of lysosomes are in contact with the ER, as previously shown by other groups (Friedman et al, 2013, Höglinger et al, 2019).

      The phenotypes analyzed are very subtle, and while statistically significant the biological impact is unclear - in many cases individual lysosomes (or lysosome-ER contacts) are considered as an 'n'. While these results are probed across multiple independent experiments the batch effects and how uniform per cell the events are is unclear.

      Authors response : We agree with the comments of the reviewer regarding data presentation. ‘n’ represented individual cells, but did not actually take into account the variability across experiments. We have therefore changed the presentation of all graphs of the manuscript using superplots that allow us to show all the points that were analyzed as well as the mean value for each biological replicate, and performed statistical analyses by comparing the biological replicates as proposed in Lord et al, JCB 2020 (10.1083/jcb.202001064).

      In fig 2H critical data are missing - the effect of Spatascin KO on the transition between these morphologies should be considered as in G. Otherwise the relevance is unclear.

      Authors response : We have added this quantification on Figure 2I. It shows that transition of morphology of lysosomes from round to tubular in Spg11 KO cells is still associated with a change of speed, although the average speed attained is halved compared to conditions where spatacsin is present. This shows that loss of spatacsin does not abolish morphological transition of lysosomes but limit their speed in the tubular shape. We commented on this new data in the text (p.8).

      The impact of over-expressing a lysine-null Ub ( Fig 3) is far too crude and non-specific to have meaning here. It is assumed that the only proteins affected are those of interest. This is consistent with much of the paper where "true-true-and unrelated" is more likely than the presumption of causality.

      Authors response : It is true that the expression of lysine-null ubiquitin is really crude and may impact many different cellular pathways. Furthermore, the results obtained with the lysine-null ubiquitin do not contribute to the rest of the paper. We therefore removed the original Fig3G, H, I and Fig 4B and updated the text accordingly (p.10).

      The blots in Fig4 are a relatively poor quality and not quantified over repetition.

      *Authors response :Spatacsin and spastizin are large proteins, and there is not much choice for antibodies able to detect these proteins. Yet we have validated their specificity by western blot using knockout cells (spatacsin) (Supplementary Figure 4 A-B) or siRNA (spastizin) (Supplementary Figure 7B). We agree that our presentation did not indicate that the western blots were repeated several times. We have added quantifications for the western blots present in Figures 4 and 5. We also changed some illustrative western blots to improve quality. *

      Controls are missing and Fig5 suffers from a reliance on over-expression - there is a massive over-expression of AP5Z1 which may be affected the stoichiometry of these overall interactions, but with an n=1 its hard to know and its not clear what these data add. Again, while statistically significant (5E and F) due to the nature of data analysis (every lysosome=n of 1) it is not clear how biologically significant UBR4 siRNA or AP5Z1 over-expression is - as the accumulation of AP5Z1 in these two conditions is orders of magnitude apart - again likely unrelated.

      Authors response : We added quantification for this western blot (Supplementary Figure 6A).

      *As stated above we have changed the representation of the graphs. Each point represents one cell, and we included the mean value for each biological replicate. *

      Preventing degradation of AP5Z1 by UBR4 siRNA or overexpression of AP5Z1 do not indeed have the same effect on total AP5Z1 but do have a similar effect on the interaction of spatacsin with its partners evaluated by co-immunoprecipitation, as illustrated by the quantifications that we have added. We clarified this in the text (p.12). As requested by reviewer 3, we will also investigate the effect of AP5Z1 knockout or downregulation on the interaction between spatacsin and spastizin assessed by co-immunoprecipitation. These data will be added in Figure 5 and will strengthen our conclusions.

      Fig 6 begins to conflate the fact that different lysosome morphologies appear to have different trafficking properties even in WT cells and that many of these targets affect morphology - therefore to conclude a direct effect on trafficking seems inappropriate.

      Authors response : In original Figure 6, we showed that Kif13A-ST and p150CC1 changed the proportion of tubular lysosomes (previous Figure 6 and H), and the data showing that these constructs changed the trafficking of lysosomes were presented in Supplementary Figure 5 B-C. We have now moved the data showing the effect of Kif13A-ST and p150CC1 in the main Figure (Figure 6F and 6I) to facilitate the interpretation of the data. Therefore, expression of Kif13A-ST and p150CC1 do not only affect the morphology of lysosomes, but also impaired their trafficking. We thus do not extrapolate lysosome dynamics from their morphology, we actually quantify lysosome dynamics.

      Fig 7 extends this into polar cells (neurons) but still it is not clear whether form (morphology) dictates function (likelihood of trafficking or directionality.

      Authors response : We did not only analyzed neurons because they are polarized cells, but because neurons are the main cells affected by neurodegeneration observed in absence of spatacsin (Branchu et al, 2017). We added new data on Figure 7 showing that tubular lysosomes in axons are actually more dynamic than round lysosomes, as observed in fibroblasts. We added these data in Figure 7 and text (p.13).

      Investigation of lysosome trafficking in axons also allowed us to investigate the directionality of movement, which is difficult in MEFs. We clarified this point in the text (p.13).

      In sum, there is a lot of data that collectively points to a partial localization of Spatascin at Er-lysosome contacts and an influence on morphology and trafficking of lysosomes in the cell, but at the end of the day very new mechanism is brought to light.

      Authors response : The mechanisms regulating trafficking of lysosomes are far from being fully resolved. Our manuscript shows that spatacsin contributes to this regulation by modulating the degradation of AP5Z1. This in turn regulate the lysosomal association of AP5Z1 and spastizin that interact with motor proteins to control lysosomal dynamics.

      Reviewer #4 (Significance (Required)):

      This manuscript is directed to the basic cell biology community - involving ER, lysosome, and microtubule dependent trafficking. There are some new analytical tools employed and many co-factors and binding partners of Spatascin considered but frankly too many to adequately and rigorously control for. Because of this the manuscript is very unfocused, hard to follow and makes too many assumptions about shared dynamics ? necessarily arising from shared morphology - lysosomes are highly dynamic and can be affected by virtually any change in intracellular trafficking or protein/membrane transport. This is not appropriately considered.

      Authors response : We have clarified our manuscript to show that dynamics is not necessarily arising from a tubular morphology. It turns out that lysosomes with a tubular morphology indeed are more dynamic that lysosomes with a round morphology. Importantly, in all our experiments dealing with lysosomal dynamics, we have actually included a quantification of lysosome dynamics using time lapse imaging as detailed in methods (p.21).

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary:

      This paper identifies a role for the hereditary spastic paraplegia protein spatacsin in lysosome morphology, positioning and dynamics, and undertakes detailed mechanistic studies to try to identify the mechanism for this effect. In doing so the paper elucidates further mechanistic information about the properties of two other hereditary spastic paraplegia proteins, spastizin and AP5Z1. The work is done in mammalian cells and uses a combination of over-expression, depletion and biochemical studies. The main findings are:

      1. The authors present evidence that spatacsin is an ER-localised protein.
      2. Murine embryonic fibroblasts lacking spatacsin have a reduced number of tubular lysosomes and the remaining lysosomes are less motile. In general, a relationship between tubular lysosome morphology and lysosome motility, often in association with the endoplasmic reticulum (ER), is demonstrated. These tubular lysosomes are catalytically active and acidic.
      3. In terms of mechanism of this effect, by combining a yeast-two hybrid and siRNA phenotypic screen, the authors identify a number of spatacsin-interacting proteins that also regulate lysosomal tubulation. The most important of these for the purposes of this paper is UBR4, an E3 ubiquitin ligase.
      4. The authors show that spatacsin and UBR4 promote degradation of AP5Z1, and that this property required the ability of spatacsin to interact with UBR4. Somewhat surprisingly, as AP5Z1 is a coat protein, this degradation appeared to occur within the lumen of the lysosome - the authors speculate how this could be in the discussion.
      5. The authors then demonstrate that AP5Z1 and spastizin, both hereditary spastic paraplegia proteins, compete for binding with spatacsin.
      6. The relationship between spatacsin, spastizin, AP5Z1 and motor proteins in then examined. There is a known interaction between spastizin and KIF13A and expression of a dominant negative KIF13A protein reduced lysosomal tubulation. The authors then demonstrate an interaction between AP5Z1 and the p150Glued dynein/dynactin complex member, then showed that expression of a dominant negative p150Glued protein reduced lysosomal tubulation.
      7. Finally, that authors demonstrate the relevance of these findings to neurons, the target cells of hereditary spastic paraplegia, by showing that lysosomal tubulation and axonal transport are reduced in mouse neurons lacking spastacsin, and that depletion of UBR4 or AP5Z1 affected these as expected from the experiments above.

      Major comments:

      Overall I believe that the key conclusions of this paper are generally convincing and that the work is of high quality. However, I do have some reservations:

      1. The localisation of spatacsin on the ER. It is always difficult to be convinced about colocalization of a diffuse punctate marker and the ER. From the STED experiments in figure 1, while it definitely seems that there is some spatacsin on the ER, there also appears to be some spatacsin puncta that are not. I'd like to know if these puncta represent lysosome-associated spatacsin. This is important for interpretation of the subsequent experiments (see point 3 below). I also think quantification of these co-localisation will increase confidence in the results. In addition, a caveat of the immunofluorescence studies is that they use over-expressed spatacsin. I appreciate that there are no good antibodies to endogenous spatacsin, but I don't think this limitation is sufficiently acknowledged. As the claim of ER-localisation is critical for the proposed mechanistic model, and in the absence of experiments with endogenously tagged spatacsin, this makes the biochemical fractionation studies of figure 1C very important. To make these more convincing I would prefer to see additional control markers to verify the separation of lysosomal and ER compartments - e.g. lamp1, lamp2, an ER tubular marker such as a REEP5 or a reticulon.
      2. The authors generally do a good job of quantifying their results. However, this is lacking for the biochemical experiments (immunoblotting and IP) in figures 4 and 5, and I would prefer to see these quantified (the quantification should include data from repeat experiments so that we can judge the reproducibility of the results).
      3. On page 10, referring to the proximity ligation results, the authors comment: "This suggests that the spatacsin-spastizin interaction occurs at contact sites between the ER and lysosomes to allow spastizin recruitment to lysosomes". I'm not sure this statement is fully supported, as mentioned at point 1 above it is possible that some steady state spatacsin is at lysosomes. To fully support this, we'd need to see the PLA signal also convincingly co-localise with an ER marker.
      4. In figure 6C and D the effect of spastizin on lysosomal tubulation and dynamics is investigated. Wartmannin treatment is used to do this, as it is known to remove spastizin from lysosomes. However, this is a very indirect manipulation that could have many other consequences and it would be better to demonstrate this directly by showing the effect of depletion of spastizin on lysosomal morphology/dynamics. I also think the role of AP5Z1 in tubulation/dynamics would be better supported with additional experiments to deplete the protein - at present only over-expression is examined.
      5. While the experiments showing that over-expression of dominant negative forms of KIF13A and p150Glued affect lysosomal tubulation/dynamics provide good circumstantial evidence that spatacsin influences these lysosomal properties via its interactions with spastizin and AP5Z1 (which bind to these motor proteins), the authors have not shown that the interaction of the motor proteins with spastizin and AP5Z1 is required for this ability to regulate lysosome tubulation/dynamics. This means that the model presented in figure 7 is not fully supported by the data. If the authors have been able to map the binding regions for these interactions then perhaps this could be investigated with rescue experiments, although I appreciate that this is potentially a major piece of work and perhaps outside the scope of this paper. An alternative would be that the authors acknowledged this part of the model as somewhat speculative.

      6. Are the data and the methods presented in such a way that they can be reproduced?

      Yes - Are the experiments adequately replicated and statistical analysis adequate?

      In general I am not convinced that the statistical tests are applied rigorously in this paper. Most experiments are done three times, but the "n" used for statistical testing is typically chosen as, e.g. the number of cells, number of lysosomes, rather than number of biological repeat experiments. This means that inter-experimental variability is not rigorously taken into account. A more rigorous practice would be to use the mean measures for each of three biological repeats and apply the statistical tests to the three means, so n=3 if three repeats were done. Superplots would be a nice way to graphically display these data.

      Minor comments:

      1. In supplementary figure 3D I cannot honestly say that I see the smaller band.
      2. When first called out, I expected supplementary tables 1 and 2 to show the list of interactors with wild-type spatacsin and spatacsind32-34 respectively, but this is not what they show.
      3. The experiments in Figure 4A are a little problematic in the way that they are called out. The first call refers to just a small subset of the data in the figure, and the figure is then called out at various points later in the paper. This is quite confusing. Is there any way this could be simplified?
      4. The section on page 10: "Spatacsin also interacts with spastizin, and is required to recruit spastizin to lysosomes (Hirst et al., 2021). ........ We hypothesized that spatacsin interaction with spastizin was required for spastizin localization to lysosomes." Is odd, as the authors seem to be hypothesising an observation that they have just said has already been demonstrated.
      5. Can the authors explain why there is so little interaction between wild-type KIF13A and spastizin?
      6. In figure 6G p150Glued signal is also present in the control IP lane, which casts doubt on the specificity of the interaction. Could the authors generate a cleaner result?
      7. I would be interested to see how AP5Z1 expression differs between neurons with and without spatacsin- we would expect similar results to those shown in the MEFS.

      8. Are prior studies referenced appropriately?

      Yes. - Are the text and figures clear and accurate?

      Yes - Do you have suggestions that would help the authors improve the presentation of their data and conclusions?

      Overall I thought the presentation was good. However, this is a complex paper and anything that the authors can do to simplify the textual descriptions of the experiments would be helpful. There are quite a few long multiphrase/multiclause sentences that could perhaps be broken up or simplified, e.g. I had to read the following three or four times to understand it: "Downregulation of UBR4 that prevented degradation of AP5Z1mediated by spatacsin (Figure 4A) led to higher interaction of spatacsin with AP5Z1 and decreased the interaction of spatacsin with spastizin (Figure 4A)."

      Referees cross-commenting

      Thanks for the opportunity to comment on the other reviews. It does seem that there is a consistent theme that reviewers are concerned about the over-reliance on over-expression experiments and the need for additional experiments using endogenous antibodies or protein depletion methodologies to strengthen the data. In addition, I and at least one other reviewer feel that it is not adequate to use number of cells as the "n" for statistical testing, and that true biological repeats are needed.

      Significance

      • Describe the nature and significance of the advance (e.g. conceptual, technical, clinical) for the field.

      I think this paper represents a significant conceptual advance in our understanding of the mechanisms by which lysosomal dynamics are controlled in non-polarised cells and neurons. In addition, it elucidates mechanisms that may underlie multiple forms of hereditary spastic paraplegia, a hereditary form of motor neuron disease.<br /> - Place the work in the context of the existing literature (provide references, where appropriate).

      This is a significant conceptual advance on the current literature on spatacsin and on the molecular mechanisms controlling lysosomal morphology/dynamics. The paper elucidates important mechanistic details of the relationship between three key proteins involved in hereditary spastic paraplegia, while also shedding light on the basic biology of lysosomal morphology and dynamics. - State what audience might be interested in and influenced by the reported findings.

      Basic cell biologists interested in the ER, in lysosomes, in ER-organelle contacts. Scientists interested in the causation of hereditary spastic paraplegias. - Define your field of expertise with a few keywords to help the authors contextualize your point of view. Indicate if there are any parts of the paper that you do not have sufficient expertise to evaluate.

      Membrane traffic, lysosome function, ER-endosome contacts, hereditary spastic paraplegia.

    1. One reason may be the emotional investment we all have in language. Language is more than a neutral medium for transmitting a message. It has washed over us like a river continually since birth. We use it constantly. It shapes who we are. Think back to your earliest memories. Can you ever remember a time when you were without language? Identity and language twine about each other so tightly that they are impossible to separate. Children of immigrant families, for example, often associate the language of their home with warmth and strong personal connections, with the deepest, private sense of who they are, in contrast to the formal public language of school and the outside world.

      What I have gathered from this is that language makes us the people we are today and in this context that is correct because language is the one thing that we as people or social beings can’t be without it it connects us to not just people but places and beings around us and it transcends across the globe and it’s very interesting to think about language like this.

    1. Author Response

      Reviewer #1 (Public Review):

      Ciliary length control is a basic question in cell biology and is fascinating. Regulation of IFT via calcium is a simple model that can explain length control. In this model, ciliary elongation associates with an increase in intraciliary calcium level that leads to calcium increase at the ciliary base. Calcium increase acts to reduce IFT injection and thus ciliary assembly rate. The longer the cilia, the more increase of calcium level and the more reduction of IFT injection and thus the ciliary assembly rate. When the cilia approach the genetic defined length, the gradual reducing assembly rate eventually balances the constitutive disassembly activity. Cilia then stop elongation and a final length is achieved. This work tested this model by manipulating the calcium level in cilia by using an ion channel mutant and treatment of the cells with EGTA. In addition, IFT injection was measured before and after calcium ciliary influx. Based on the outcome of these and other experiments, it was concluded that there is no correlation between changes in calcium level and IFT injection, thus challenging the previous model. This work is well written and the experiments appear to be properly executed. It nicely showed an increase of intraciliary calcium during cilia elongation, and beautifully showed that ciliary calcium influx depends on extracellular calcium. However, I felt the current data are inadequate to support the author's conclusion.

      We thank the reviewer for the positive assessment of the interest in our work, and we have performed additional experiments to address the reviewers concerns as discussed below.

      The authors showed that ciliary calcium increases along with ciliary elongation, which correlates with reduction of IFT injection. Thus, this result would support that calcium increase reduces IFT injection. To test whether reducing calcium influx would alter the IFT injection, the authors used an ion channel mutant cav2. Indeed, ciliary calcium level in the mutant cilia appears to be lower compared to the control in average. After measuring ciliary calcium level and IFT injection during ciliary elongation with mathematical analysis, it was concluded that reducing ciliary calcium level did not lead to increased IFT injection, which is distinct from the control cells. Thus, the authors concluded that calcium does not act as a negative regulator of IFT injection. However, if one examines the calcium flux in Figure 3B and IFT injection in Figure 4B of cilia less than 6 micron, one may draw a different conclusion. For the mutant cilia, the calcium influx is higher than that in control cilia and IFT injection is reduced compared to the control. Thus, this analysis is the opposite of the authors' conclusion, and is supporting the previous model. There is a rapid change in ciliary assembly rate at the early stages of ciliary assembly (see Figure 1C), thus, the changes in calcium influx and IFT injection in the earlier assembly stage would be more appropriate to assess the relationship between intraciliary calcium level and IFT injection.

      We thank the reviewer for raising this issue, which led us to examine the data more carefully. In looking at the numbers of cells with flagella in each length range, we became concerned that the apparently low calcium influx in shorter flagella in control cells compared to ppr2 or EGTA treatment might actually due to bias from technical issues: it is relatively difficult to image shorter flagella in our TIRF imaging setup, because shorter flagella have less flagellar surface area to attach the coverslip. The more motile the flagella are, the more likely are the cells to detach when their flagella are short, because the bending force of the flagella is strong enough to pull them away from their small area of adhesion. This effect is much stronger in control cells than in either the ppr2 mutants or EGTA treated cells, whose flagella are less motile. This led to a reduced number of cells examined with flagella shorter than 6 um (17 versus 34 for control and ppr2 cells, respectively). To overcome the difficulties and biased result, we observed more flagella in control cells. The new data has now been integrated with our previous data and shown in Figure 3. The new result shows that calcium influx in control cells is in fact higher than in the ppr2 mutant cells. So, our result is remains consistent with our conclusion, and we believe that it is not useful to analyze the shorter flagella separately.

      The authors used EGTA treatment to support their conclusion. However, EGTA treatment may induce a global calcium change of the cell, the outcome may not reflect actual regulation of IFT injection by ciliary calcium influx. For example, as reported elsewhere, the change of cAMP level in the cell body and cilia has a different impact on ciliary length and hedgehog regulation. The slower assembly of cilia in EGTA treated cells may be caused by many other factors instead of sole regulation by IFT.

      It is certainly possible that EGTA is affecting some process inside the cell that then indirectly affects IFT. Our experiments cannot rule this out. The fact that similar effects are seen with the ppr2 mutant argues against this idea, but again cannot rule it out. We have added the following caveat to the discussion:

      "Other calcium dependent processes in the cytoplasm might also potentially address IFT, and our results cannot rule out this possibility. However, we note that the ppr2 mutant also fails to show the effect on IFT or regeneration predicted by the ion current model."

      The authors only examined the impact of reducing ciliary calcium influx. To further support the authors' conclusion, it is recommended that the authors should examine IFT injection in a condition where ciliary calcium level is increased. Using calcium ionophore may not be a good choice as it may change the global calcium level. One approach to consider is using mutants of a calcium pump present in cilia.

      We thank the reviewers for this suggestion. The calcium current model would predict that if a calcium pump mutant failed to export calcium, the increased calcium building up inside the flagellum should lead to decreased IFT entry and a shorter flagellar length. We found at least two calcium pumps in the published Chlamydomonas flagella proteome (Pazour et al., 2005) and ordered several mutant strains from Chlamydomonas Library Project (CLiP) which are annotated as affecting these pumps. We measured the flagellar length of these potential calcium pump mutant strains, but none showed a statistically significant difference in length relative to control cells. We have now included this data as Figure S4. Because no length change was observed, we did not perform the extremely time consuming process of constructing strains that contain these mutations along with DRC4-GCaMP and KAP-GFP.

      As an alternative strategy to get at this reviewer's suggestion, we measured DRC4-GCaMP and KAP-GFP intensity in 1 mM CaCl2 treated flagella and found that CaCl2 treatment increases both the flagellar calcium level (Figure 3, see below) and IFT injection (Figure 4). This increase in IFT injection is the opposite of what the calcium current model predicts.

      Based on these results, we think the calcium pump experiment is not necessary because of the following reasons. 1. These calcium pump mutants might not increase the flagellar calcium level. 2. Even if the flagellar calcium was increased in these mutants, it does not affect the flagellar length and thus our conclusions would still hold. 3. These mutant strains might still have functional calcium pumps since the existing data on calcium pumps in flagella is likely to be incomplete. 4. The CaCl2 experiment clearly increased the flagellar calcium level inside flagella, directly addressing the point that the reviewer is getting at.

      The conclusion on line 272-273 may need more evidence. The authors showed that addition of 1 mM CaCl2 does not change ciliary assembly, and used this as one of the evidences to argue against the ion-current model. The addition of calcium extracellularly may not alter intracellular/intraciliary calcium level given that cells have robust systems to control calcium homeostasis. To support the authors' conclusion, one should measure the changes of calcium level in the cell/cilia or revise their conclusion.

      We have now performed these measurements and have included the data in Figure 3D.

      The authors showed nicely the changes in IFT properties before, during and after ciliary calcium influx and found that the intensity and frequency of IFT do not have a correlation with calcium influx though calcium influx restarts paused IFT trains for retrograde transport as previously reported (Collingride 2013). The authors again concluded that this is supporting their conclusions in that there is no correlation between IFT injection and calcium influx. However, I am not sure whether the short pulses of calcium influx at one time point would change the calcium level in the whole cilia in a significant way that would alter IFT injection at the ciliary base.

      We agree that individual pulses might not have an effect on the average level of IFT injection. We were specifically trying to see if, having previously ruled out the predicted correlation at the level of average rates, there might still be a trace of the correlation for individual events.

      Reviewer #2 (Public Review):

      The authors use a genetically encoded calcium indicator to measure Ca in flagella to establish that Ca influx correlates with flagellar length. (Despite this correlation, there is so much noise that it is dubious that Ca level can regulate the flagella's length.) Then, they show that reduced Ca decreases the rate of IFT trains entering flagella, which ruins the ion-current model of regulating flagella's length. (Ca can still be one of the factors that sets the target length.) Ca does not seem to change the disassembly rate either. There are also no correlations between Ca influx spikes and IFT injection events. Curiously, these spikes broke pauses of retrograde IFT trains, but that still did not affect IFTs entering dynamics.

      Some other possibilities like Ca regulating unloading rates are discussed and convincingly rejected.

      The study ends with an interesting Discussion, which talks about other possible models, and concludes that the only model not easily rejected so far is the mechanism relying on diffusion time for kinesins from flagella to the cell body being greater in longer flagella.

      The paper is well written, very thorough, contains significant results.

      We thank the reviewer for this strong positive assessment.

      Reviewer #3 (Public Review):

      This work by Ishikawa et. al is focused on testing the hypothesis first proposed by Rosenbaum that Ca2+ levels in the primary cilia act as an internal regulator of cilia length by negatively regulating intraflagellar transport (IFT) injection and/or microtubule assembly. The authors first built a mathematical model for Ca2+ based regulation of cilia length through the activity of a Ca2+ dependent kinase. They then tested this model in the growing cilia of Chlamydomonas cells expressing an axonemal localized GCaMP. Ca2+ levels were manipulated genetically with a calcium channel deficient mutant line and with the addition of EGTA. While increases in Ca2+ levels do correlate with cilia length as expected by the model they found that IFT injection was positively correlated with IFT injection and increased axonemal stability which contradicts its potential as a mechanism for the cell to internally regulate cilia length.

      Overall the conclusions of the paper are supported by their data. They greatly benefit from first establishing their model in a clear form and then experimentally interrogating the model from multiple angles in order to test its viability. The importance of cilia length to our understanding of human health has only become greater in recent history and the authors are making a significant contribution to our understanding of ciliary length regulation.

      We thank the reviewer for this positive assessment, including of the relevance of the model. We have attempted to address all suggestions.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      We thank the reviewers for thorough reading and for providing useful suggestions to improve our manuscript. We find two major issues indicated by the reviewers.

      1. Lack of pathophysiological relevance to attract a broader readership – to address, we have stained brain slices of PD patient’s with p129-Syn and Lamin B1 antibodies. Microscopy images show extensive lamina damages in the patient brain slices which contain p129-Syn positive inclusions. These images are now included in the current revision of the manuscript as 6C-D. We think that these results in the pathologically relevant systems will now establish a connection between lamina defects with neurodegeneration in PD and will be attractive for a broader audience.

      Experimental issues as indicated by major and minor points – majority of the points have been addressed in the current revision attached herewith. Given opportunity to submit a full revision, we shall incorporate more experiments to address all the points in the final revised manuscript.

      Point by point response to reviewer’s concerns:

      Reviewer 1

      R1: The work by Mansuri and collaborators reports that LB-like filamentous inclusions of α-Synuclein are able to associate with and perturb the nuclear lamina due to an unbalanced mechanical tension between cytoskeleton and nucleoskeleton. Consequently, lamina-injuries are proposed as a major driver of proteostasis sensitivity in cells with LB-like Syn-IBs.

      It is a complex work, in which a range of different cellular, biochemical and molecular techniques have been used. Readers of the paper (including the undersigned) will be wondering if a similar behaviour occurs in pathological systems, such as iPSC derived dopaminergic neurons arising from patients carrying the synuclein pathological mutations reported in this work.

      Response: We thank the reviewer for bringing out the lack of pathophysiological relevance in our manuscript. To address, we imaged post-mortem thalamus sections of a Parkinson’s Disease (PD) patient (BioChain Institute Inc., USA Cat# T2236079Par) and a control (BioChain Institute Inc., USA Cat#T2234079). Our experiments clearly show extensive lamina deformities in the patient brain (Fig. 6C-D) and connects with neurodegeneration in a pathological system.

      Major points

      R1: Authors should explain why there is a so high amount of p129-Syn in unseeded neurons (Fig. 1Ai, Fig. S1Bi): "p129-Syn was distributed throughout the neuron cell body and projections including light staining in the nucleus", as its accumulation is typical of PD-like α-syn aggregates. Similarly, unseeded neurons labeled with p129-Syn in Fig. 1Ai, Fig. 1Bi, and Fig. S1Bi and Fig. S1Ci are very different each other. Why? As neurons are unseeded, the pathological signature of PD-like α-syn aggregates should be very low or absent in all cases.

      Response: We agree with the reviewer that very low amount of p129-Syn should be present in unseeded neurons. We standardized microscopy parameters using fields that contained neurons with both large LB-like perinuclear IBs and smaller peripheral Syn-filaments. We used Leica SP8 confocal microscope. Argon laser power was kept constant at 30% of full potential while Smart Gain was titrated to visualize the smaller filaments. For example, the smaller filaments were not clearly visible in Annexure Figure 1Ai when Smart gain was 690V. Smaller filaments were prominent when the Smart Gain was increased to 848V (Annexure Figure 1Aii, included with the revision plan attached herewith). We also observed light intra-nuclear staining of p129-Syn at 848V Smart Gain when we zoomed the arrow indicated nucleus in Fig. 1Aii shown below as Annexure Figure 1Aiii. Accordingly, we used Smart Gain: 650-850V in all the images presented in the manuscript. Brightness and contrast are now adjusted for all the images prepared for the revised manuscript for the optimum view of the immunostaining. All the raw image files will be submitted to https://www.ebi.ac.uk/biostudies in due course.

      In order to rule out imaging artefacts at the higher Smart Gain (650V – 850V), we performed a control experiment without adding primary antibody against p129-Syn during immunostaining. Secondary antibodies were added and the Smart Gain was ~950-1000V during imaging. The light staining of p129-Syn as visible in Fig. 1Ai and 1Bi in the revised manuscript were not visible in this experiment (Annexure Figure 1B).

      A table indicating the Smart Gain for all the images is included in the revised manuscript as__ Methods Table S5 - Laser Intensity.__

      Reviewer 1 has also pointed out the difference in staining of p129-Syn in Fig. 1Ai and Fig. 1Bi. For Fig.1Ai, Rabbit monoclonal (p129-Syn (MJF-R13 (8-8), epitope: phosphoserine 129, cat# ab168381), and for Fig. 1Bi Mouse monoclonal (P-syn/81A, epitope: phosphoserine 129, cat# ab184674) were used. This information is now included in the figure legends. The difference in the staining pattern is due to the use of the different primary and secondary antibodies.

      Lastly, we want to emphasize that the staining pattern seen in unseeded neurons () are not the typical PD-like Syn-aggregates but the soluble p129-Syn that is yet to be incorporated into the amyloid-filaments. p129-Syn ((antibody MJF-R13 (8-8)) staining pattern in 1Ai is continuous in the projections and light dotted in the periphery and inside nucleus. These dots also accumulate on the Microtubule Organizing Centre (MTOC) indicating the presence of aggresome-like inclusion bodies in the neurons. The staining pattern in 1Bi (antibody P-syn/81A) is dotted throughout. In both the cases, the continuous or dotted staining were not observed after seeding. The continuous staining at the projections seen in 1Ai is broken into smaller filaments in 1Aii (indicated by arrowheads). The broken filaments are much more increased in number and length in Fig 1Bii and the staining-intensity prominently increased. Accumulation of multiple larger filaments into perinuclear LBs is typical PD-like (Fig. 1Bii, yellow arrowhead).

      The continuous staining and the broken staining patterns at the projections are also visible in the zoomed out MIP images presented in S1Bi and ii, respectively. The increase in fluorescence intensity of p129-Syn staining is prominent between S1Ci and ii indicating accumulation of p129-Syn in the form of large amyloid filaments in seeded neurons.

      We now discuss the staining patterns in the revised manuscript. Please see pages 4-7.

      R1: Authors should try to perform a more accurate quantification of the various colocalizations reported along the manuscript, i.e. by reporting the Pearson correlation coefficient or the Mander's overlap coefficient.

      Response:As suggested by the reviewers, Pearson’s co-localization coefficient values have been added separately for all figured showing co-localization in Supplementary note: Colocalization figures and table.

      Minor points

      R1: In Fig. S1B the red fluorescent signal arising from γ-tubulin staining is not visible in the merged picture.

      Response: Fig. S1B are the zoomed out MIP images of Fig. 1A. γ-Tubulin stains centrosome as tiny dots at the perinucleus in one of the z-sections of the MIP. To visualize these tiny dots in the MIP images, we have 1) optimized the brightness contrast of the MIP images and 2) provided a separate channel for γ-tubulin (arrowheads). These corrections are included in the revised version.

      R1: Page 6: results of Fig. S1D-E should be explained properly (CALNEXIND and CMX-Ros staining).

      Response:As suggested, we revised this part in Page 7.

      R1: Fig. 2A: the indication of SNCA in western blotting is not proper, as in this experiment you evaluated the protein level, so it is better to report "α-syn";

      Response:We agree with the reviewer. SNCA in western blots has been changed to α-Syn all the figures and figure legends.

      R1: Fig. S2B: there is great variability in the number of SNCA(A53T)- EGFP and SNCA(DM)-EGFP cells with IBs during the course of PFF-incubation, so that authors did not reveal any significant difference. I think it is not completely correct to emphasize this data at page 9, lanes 12-13;

      Response:We agree with the reviewer that the difference in number of SNCA(A53T)-EGFP and SNCA(DM)-EGFP cells with IBs was not statistically significant. Yet, we always observed aggressive biogenesis LB-like IBs in SNCA(DM)-EGFP cells. The statement in the manuscript is now corrected as per the reviewer’s suggestion (Page 9).

      __R1:__Did authors reveal any cytotoxicity upon Congo Red treatment at the indicated concentrations (Fig. S2G)?

      Response: Previously, Congo Red incubation was found to be non-toxic for neuronal cells even at 350 µM (PMID: 7991613). We have now performed MTT assay after Congo red treatment in our cells. The graph is now included as S2H. We did not observe any difference in cell viability even after treating the cells with the highest dose (100 µM) used in the experiment.

      R1: I have concerns about the percentages reported in Fig. S2G: the percentage of cells with filaments in the absence of Congo Red is apparently too low as compared to the previously reported percentages.

      Response:The reviewer is right. Number of Syn-filament containing cells varies between experiments because of ‘age’ of the recombinant amyloid seeds, different batches of seed preparation etc. We are repeating this experiment to increase the biological N. Results will be included and discussed in the final revised version

      R1: Fig. S2G: I also believe that authors should report representative images of cells treated with Congo Red, in which Syn-filament biogenesis is prevented;

      Response:As instructed by reviewer, the images are included in Fig. S2G.

      R1: Fig. 2Eiii: The stick arrowhead seems to indicate a separate blob that is not so red: authors should consider to show separated channels and not only the merged picture (as in Fig. S3).

      Response:We agree with the reviewer that the blob is not so red. We could not accommodate the separate channels in the main figure because of space constraint. Therefore, we presented the separate channels in Fig. S3A. Now we are including the stick arrowhead also at Fig. S3A.

      R1:Page 10: authors should explain why they performed the LC3 staining;

      Response:Previous reports indicated association of LC3B with α-Synuclein inclusions in neurons (PMID: 21412173, 31375560). Therefore, we also stained our cells with LC3 antibody. The references are now incorporated in Page 10.

      R1: Why in Fig.2i, SNCA(DM) the ubiquitin signal is pink and not red?

      Response:The blue of the DAPI is slightly overlapping with the ubiquitin staining at the aggresomes as these bodies are perinuclear making it appear pink. Separate channels are provided in Fig. S3E.

      R1: Fig. 3, western blotting: as I previously reported, I think it would be better to write "total α-syn" instead of SNCA. Fig. 3D: is should be useful to explain properly the content of the soluble and insoluble fractions.

      Response:We agree with the reviewer. SNCA in western blots has been changed to α-Syn all the figures and figure legends.

      R1: Explain in the legend of Fig. 4 what is h2b tdTOMATO

      Response:We thank the reviewer for pointing out the lack of information. This is now included with a reference in the revised manuscript.

      Significance

      R1: Overall this is interesting to read, a lot of data are presented, demonstrating a new potential phenomena that would be important to a specialized audience in the field of synuclein misfolding, aggregation and cellular toxicity.

      Response: We have now included immunofluorescence images of post-mortem thalamus sections of a Parkinson’s Disease (PD) patient (BioChain Institute Inc., USA Cat# T2236079Par) and a control (BioChain Institute Inc., USA Cat#T2234079). Our experiments clearly show lamina deformities in patient brain (Fig. 6D). We think that these experiments will highlight the pathophysiological relevance of the manuscript to make it appropriate for a wider audience.

      Reviewer 2

      __R2:__The present paper titled "Nuclear-injuries by aberrant dynein-forces defeat proteostatic purposes of Lewy Body-like Inclusions" provides an in details and compelling study about the formation of aggregates of SNCA in presence of PFFs, which other proteins play a role in the formation of this inclusions, and which pathways are the major players. They study provides many well-done experiments to highlight the composition and the process formation of these aggregates. unfortunately I think the study is lacking in connecting these events with neurodegeneration. how do all the pathways study impact viability and functionality of neurons and other disease relevant cells like astrocytes and microglia? it is thus a work which mainly focuses on the pathways leading to the formation of inclusions leaving untouched the question of how this might impact the disease. This does not take away the value of the findings but it should be taken in consideration when deciding which journal to submit.

      Response:We thank the reviewer for the encouraging words and also for bringing out the lack of pathophysiological relevance in our manuscript. To address, we have performed immunofluorescence experiments with post-mortem thalamus sections of a Parkinson’s Disease (PD) patient (BioChain Institute Inc., USA Cat# T2236079Par) and a control (BioChain Institute Inc., USA Cat#T2234079). Our results show extensive lamina deformities in patient brain (Fig. 6C-D) connecting neurodegeneration in PD with lamina injuries.

      Further, although we found that LB-containing primary neurons and Hek293T cells do not show any loss in cell viability as estimated by LDH and MTT assays respectively (Fig 4A-B), they show sensitivity to additional stresses. LB-like IB containing Hek293T cells were unable to trigger stress response pathways and were vulnerable to heat stress. These results were already included in the earlier version of the manuscript (Fig. 4H-I). We now estimated sensitivity of neurons in presence of additional stress. We have subjected LB-containing neurons and control neurons to heat stress and estimated induction of Hsp chaperones by western blot and quantitative mass spectrometry. Preliminary results (included herewith) indicate that Hsp-upregulation is defective in neurons with LB-like IBs. These results are now included as Figure 4J-M in the attached revised manuscript. Repeat experiments with quantitative mass spectrometry will be included in the final revision.

      R2: I have a few suggestion for each figure which will not take much time, energies or expenses but that would overall make the paper easier to read and digest.

      R2::Fig 1: quantification of aggregates dimension, number and colocalization score with p62 (Pearson)

      Response:Co-localization score with p62 is included in the current revision (Supplementary note: Colocalization figures and table). Quantification of aggregate dimension, number etc. in neurons have been already documented by Mahul-Mellier et al. (PMID: 32075919). We are following the same protocol and therefore did not repeat the counting for neurons. However, if the reviewer thinks that its mandatory, we shall do that and include with full revision.

      __R2:__Fig 2: aesthetic comment: the way to read the figure should be consistent throughout the figure. they should be assembled either all in vertical or all in horizontal.

      Response:We tried. We find the current organization is the best fit to accommodate all panels.

      R2: Fig 3: 3E better to put an image without nocodazole to visualize the difference

      Response:The control image is now added in Fig. 3E.

      R2: 3D probe WB also for SNCA

      Response:Sorry for the confusion. The western blots in 3D are probed for both total Synuclein and p129-Syn. As suggested by the first reviewer, we have also changed SNCA to α-Syn which indicates the total Synuclein protein level.

      R2: 3K this WB needs quantification to backup the statement made

      Response:We are repeating this experiment. Results will be included and discussed in the final revised version.

      R2: 3I check the - and + for PFF and doxy. I believe they are wrong

      Response:We have rearranged the figure. The scheme in Fig. 3I (now Fig. 3H) is correct but we have made it simpler to avoid confusion.

      R2: Fig 4: missing IF of peri nuclear IBs with HS

      Response:The images are now included as Fig. S4E and discussed in page 19.

      R2: Fig 5: quantification of H2BTdTom exit from the nucleus

      Response:We have performed this experiment as a supporting evidence of the nuclear damage in presence of LB-like IBs. We have quantified the damages in Fig. 5A and D. We have also performed quantitative mass spectrometry to show nuclear entry of associated organelle proteins (Fig. S5G). We think, quantifying the H2BTdTom exit will not be a significant value addition to the manuscript.

      R2: Fig 6: some neurons with large PFF seems very unhealthy. is it possible to quantify neuronal viability may not with MTT which is not suited for single cells analysis?

      Response:The reviewer correctly pointed out that neurons with large LB-like IBs seemed unhealthy which was confirmed by ƴH2AX staining indicative of extensive DNA damage in Fig. 6B.

      R2: maybe it would be nice to have a WB with soluble and insoluble SNCA and p129 with ciliobrevin D with and without PFF. Ciliobrevin D might also impact degradative systems as demonstrated by the EHNA compound (PMCID: PMC5584856).

      Response:We have performed the dynein experiments to figure out the role of cytoskeleton-nucleoskeleton tension in the lamina injuries in LB-like inclusion containing cells. However, we think that the reviewer has correctly pointed out that dynein may have a direct role in degrading Synuclein by either autophagy or proteasome. Given the results of the suggested experiments are not going to change the final conclusion of the manuscript, we propose to limit ourselves in discussing this possibility and citing the paper in the current revised version of the manuscript (page 29).

      Significance

      R2: As already stated above, the experiments are correctly performed and the evidence are well-presented and demonstrated. the realm that this paper falls into is not though neuroscience. The aim of this paper is to study the formation of inclusions regardless of their impact on disease-relevant cell type functions. the presented experiments are numerous and even though the message is pretty clear some figure might be too crowded to correctly convey the message (see fig 3). some of these findings even tough with much less details were already suggested by other papers (PMCID: PMC5584856) in which the importance of the dynein was studied in the context of the communication between autophagy and proteasome. I think adding this angle with few experiments might add a little bit more relevance but it is also true that this paper has already a lot of data.

      Response:Thank you very much for the encouraging comments

      R2: the type of audience for this paper I think is a very specialized audience which is interested in molecular mechanisms of inclusions formation and protein-protein interaction. as a final statement the paper is beautifully done and is relevant but it lacks the translational angle.

      Response:We again thank the reviewer for reminding the lack of pathophysiological relevance. We have now included microscopic images of brain slices of PD patients with extensive lamina defects (Fig. 6D) and think this will attract a broader audience.

      R2: my field of expertise is neuroscience. I have expertise in bimolecular techniques as well as cellular techniques to study neurodegenerative diseases

    1. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary

      In this manuscript, the authors investigated the role of AF10, a subunit of DOT1L histone-methyl transferase complex for writing H3K79me1-2-3 marks, in cellular reprogramming. Using siRNA-mediated knockdown and chemical inhibitors, the authors show that AF10, and DOT1L as a whole, are inhibitory to reprogramming of mouse embryonic fibroblast cells (MEF) to induced pluripotent cells (iPSC), suggesting that AF10 plays an important role in determination and changes in cell lineages. The authors also show that this effect of AF10 is not transcription mediated. Based on their ChIP experiments of H3K79me1,2,3 and RNA Pol II, the authors claim that the effect of AF10 is mediated by "changes in epigenome circuitry".

      Major comments

      1. The claim that AF10 and DOT1L inhibits reprogramming of MEF to iPSC is largely supported by authors' experiments. Mostly, the authors used expression levels of NANOG as a mark for pluripotency. While it is a well-documented mark, an orthogonal mark (such as colony morphology, embroid bodies, etc.) will increase the rigor and confidence. This is especially important in the context of testing something like DOT1L complex which plays important role in transcription.
      2. The data presented here largely supports the claim that AF10-mediated effect is not through transcription.
      3. The authors final model ¬- "negative feedback by RNA-PolII recruited DOT1L leading to ESC-like state" - is not supported by the data presented here.
        • For example, at line 295, the authors say that H3K79me1 pattern in ΔAF10 "resembles the H3K79me1 found in ESCs which are much more TSS-enriched for this modification compared to MEFs." However, the data in 5H show that the pattern in ESC matches more with AF10 fl than ΔAF10.
        • At line, 299, "given that AF10 deleted cells retain H3K79 methylation..". This statement highly contradicts data in 4B, 4C, 5G and 5H where it is shown that deletion of AF10 leads to substantial loss of H3K79me1,2.
        • While the authors showed there are changes in H3K79 methylation pattern upon AF10 deletion, its link to changes in iPSC reprogramming is not shown. The Pol II occupancy data, shown for WT MEFs and ESC, do not support any of part of this claim. Even further, there is no evidence for changes in Pol II occupancy levels upon AF10 deletion.
      4. How do authors reconcile that there is increased expression of AF10 in pluripotent cells (Fig. 1A and 1B) although it inhibits pluripotency?
      5. Line 341, "We do not find any evidence that H3K79me2 opposes spreading of H3K27me3 in reprogramming to iPSCs" seems to be an over-interpretation. The experiment just shows that inhibition of PRC does not change global H3K79me2 levels. A direct role of H3K79me2 on H3K27me3 is not tested here.
      6. Fig S1D shows that deletion of AF10 can have additional effect to inhibition of DOT1L. This is in contrast to most of the main figures, especially, fig 1E. Some comment about this discrepancy is warranted.

      Minor comments

      1. It might help the reader if authors put a schematic of reprogramming regimen for Fig. 1A.
      2. At line 146, the authors inference " ΔAF10 is estimated to contribute about 40% of the DOT1Li phenotype in reprogramming" is not clear. It may help the reader the reader if more information is provided for their analyses and interpretation.
      3. Line 324, a typo: it should be "AF10"
      4. Line 456, It might be better for readers if the authors report whether and how RT-qPCR was normalized to housekeeping genes etc.
      5. Line 582, It is not clear at what step human cells were spike in. Also the type of human cells should also be reported.
      6. At many places (e.g. Fig 1E, Fig S3D) authors seem to have used multiple t-tests. Please consider using something like ANOVA to avoid multiple t-test error.
      7. Fig 1E. It is commendable that authors show factor independent reprogramming. It will be helpful for readers if authors show number of days for OSKM-dependent and OSKM-independent growth in the schematic.
      8. Fig S1C is not clear as such. Please add more information in the figure or legends.

      Referees cross-commenting

      With regards to reviewer1's comments: I particularly agree with major points 1 and 2 that authors' current model regarding feedback regulation needs more evidence. The technical concerns regarding ChIP normalization, esp. point 5, are also well-warranted.

      With regards to rev3's comments: The major concern about another similar study is well-warranted. The authors may want to explicate compare and contrast their key inferences with the other study.

      Significance

      The present work provides good evidence that AF10-mediated H3K79me can contribute to cellular reprogramming independent of steady-state mRNA levels. However, I think that the manuscript falls short of providing the basis for it. The claim that it is through subtle changes in H3K79me patterns seems nebulous and unsupported by the data presented here. If the manuscript finds the mechanistic basis for AF10's role in cellular reprogramming, it will be of interest to readers in general epigenetics as well as clinical fields that use histone methyl transferase inhibitors for treating leukemia.

      I am not an expert in the field of cellular reprogramming; so, I may not be able to judge the merits or caveats of authors' reprogramming methods and analyses.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Manuscript number: RC-2022-01682

      Corresponding author(s): Peter Keyel

      1. General Statements

      We thank the reviewers for their thorough and critical analysis of our manuscript. We have addressed most of the concerns and questions with our revised version. To address the remaining concerns, we plan to perform two lines of experiments— aerolysin sensitivity of dysferlin null C2C12 muscle cells and aerolysin sensitivity of ESCRT-impaired cells. When these experiments are complete, we believe the revised contribution will provides important novel insights into membrane repair that will appeal to a broad audience.

      Reviewer comments below are in italics.

      Description of the planned revisions

      Reviewer 1

      Major

      In order to show that patch repair is indeed protecting cells against aerolysin, the authors should disrupt patch repair of the cells under study and observe and increased toxicity.

      Reviewer 2

      Major

      *1. The effect of dysferlin overexpression does not indicate that patch repair is a protective mechanism or that dysferlin plays a significant role in aerolysin resistance. The authors should knock out dysferlin and assess cell resistance to lysis. *

      Reviewer 3

      Significance

      The work presents a foundation to further investigate into the mechanism of aerolysin function, following the discovery of the role of extracellular Ca2+ in its activity. As aforementioned, the role of dysferlin in resisting aerolysin also has potential, but the limitations of this work were discussed including the absence of performing a dysferlin knockout, although performing this experiment may help to strengthen the current finding.

      We agree with all 3 reviewers that a dysferlin knockout will complement our gain-of-function studies and this will strengthen the manuscript. We plan to challenge C2C12 myocytes that express control shRNA or dysferlin shRNA with toxin and determine their sensitivity.

      We chose this system instead of targeting a patch repair protein in HeLa cells for 3 reasons. First, it will provide the corresponding loss-of-function experiment to match the gain-of-function experiments we have already done. Second, other patch repair proteins work redundantly with other proteins, complicating their knockdown and/or their disruption may interfere with lipid/protein transport. Finally, dysferlin null C2C12 cells are commercially available, so other groups will have an easier time replicating our results.

      Reviewer 1

      Significance

      *and in the statement that a cellular process that has been artificially introduced in the experimental system is the cellular protection mechanism against aerolysin attack. In order to prove that this process is a bona fide protection mechanism, the authors should show that it is present without the need of overexpressing a protein that is not expressed at all either in the used cell line (HeLa), or in the natural cellular target of aerolysin (epithelial cells). The significance of the proposed protection mechanism is therefore questionable. *

      We plan to address this concern by using C2C12 muscle cells that have and do not have dysferlin. Muscle cells are natural cellular targets of Aeromonas during necrotizing soft-tissue infections.

      Reviewer 2

      Major

      *2. ESCRT complex was shown to play a role in plasma membrane repair following mechanical damage or perforin treatment of cells (Jimenez 2014, and Ritter, 2022). Whether ESCRT is important in aerolysin pore repair can be assessed by knocking out the Chmp4b gene or overexpressing dominant-negative mutant of VPS4a, E228Q. *

      We plan to use a previously characterized (Lin 2005 PMID: 15632132) inducible system (TRex cells) to express the dominant negative VPS4b E235Q in cells. We plan to pulse cells for 2 h with 1 ug/mL doxycycline one day prior to the assay. This pulse time and dose strikes a balance between cell death due to non-functional ESCRT, and compromising ESCRT function. Then we will challenge parental cells (TRex) or TRex cells expressing VPS4b E235Q with toxin and measure lysis. We also plan to compare plus/minus doxycycline as a further control. We will also use fluorescent toxins to compare binding across cell types.

      One caveat on the ESCRT work is that ESCRT has an essential role in MVB formation, and ESCRT effects might be due to perturbation of protein/lipid flux through this system in addition to their recruitment to the plasma membrane. Even with knockdowns and overexpression, it can be challenging to interpret some of the pleiotropic effects of altering the ESCRT complex. While we do not contest the role for ESCRT in plasma membrane repair, we suspect the role for ESCRT will be more complicated than previously appreciated. Digging deeper into these possibilities beyond our proposed experiment is beyond the scope of this manuscript.

      Description of the revisions that have already been incorporated in the transferred manuscript

      Reviewer 1

      *Major: The authors conclusions contradict established results, which they cite. Yet experimental conditions are not similar in two ways: toxin concentration-wise and toxin treatment duration-wise. *

      We agree with the reviewer that there were differences in experimental design between our study and the other cited studies. Due to the cited differences, our results, Gonzalez et al and Larpin et al are not necessarily contradictory on most points. Our conclusions differ from Gonzalez et al in that we do not think K+ efflux drives repair in the first hour, and differ from Larpin et al in that we observe Ca2+ flux after aerolysin challenge. Along with the toxin variables discussed below, we also discussed the potential cell type differences between the studies that may account for the discrepancy. We have now included these additional differences in our manuscript on line 435 for Larpin et al and lines 423-425 for Gonzalez.

      Our study set out to do something distinct from the prior studies. The prior studies did not compare the efficacy of distinct membrane repair mechanisms to the same toxin because that was not their study aim. Hence, our goal is not to prove the prior literature wrong, but contribute to a better understanding of the immediate membrane repair events triggered by aerolysin. We argue that the significance of our contribution is this comparative approach to membrane repair, which has not previously been done, and our finding that aerolysin engages distinct, but overlapping mechanisms compared to CDCs. We have updated our significance to better convey our advance, which is explained on lines 99-102, 128, 519-525.

      *While we appreciate the efforts of the authors to standardize the concentration of toxins used based on hemolytic units, we note that the concentrations used are very much higher than in the other studies cited. Indeed, based on table 1, materials and methods, and the various experiments, aerolysin has a LC50 of approximately 200 HU/ml, which corresponds to about 2 ug/ml. This is approximately 200x more concentrated than for example in Gonzalez et al 2011 and Larpin et al. 2021. It makes the validity of direct comparison with those studies questionable. *

      We agree with the reviewer that the toxin concentrations are different from prior studies. This is why we argue hemolytic activity needs to be reported along with toxin mass.

      One potential explanation for this difference is purification method. We do nickel NTA purification from whole bacterial lysates, instead of from the periplasm. It is possible that the most active aerolysin precipitates early or is otherwise lost in our purification process, which accounts for both the lower toxin specific activity and lack of toxin precipitation during trypsin activation that we observe. To control for impurities, we purified two preps of our aerolysin to >90% purity after nickel beads. However, we did not observe a significant change in specific activity or cytotoxic activity. We interpret this finding to suggest there was a trade-off between improved specific activity due to increased purity and loss of specific activity due to toxin inactivation during the extended purification process.

      We have included a new figure (Fig S10) showing our toxin purification and activity.

      *We noticed that the authors activate pro-aerolysin at high concentration (in the range of 1 to 5 mg/ml) and at room temperature. In our experience, under these concentration, activation leads to immediate oligomerization and massive precipitation. The final concentration of active toxin is thus unknown. *

      When we titrated the trypsin to determine the optimal concentration of trypsin to use, we did not observe oligomerization/precipitation (Fig S10B). If there was precipitation of aerolysin after trypsin treatment, we would expect a difference in cytotoxicity between pro-aerolysin and aerolysin treatment. We did not observe significant differences in cytotoxicity between pro-aerolysin and activated aerolysin (see Figs 1-2). Finally, we measured hemolytic activity on trypsin-activated toxin, so any precipitation would be expected to occur prior to assessing hemolytic activity. Thus, we argue our use of hemolytic activity measured after trypsin activation mitigates this risk.

      * The authors keep their cells in toxin-containing medium for the whole duration of the experiments, typically 45 minutes. This is in stark contrast with 45 seconds to 3 minutes transient exposure to toxin in Huffman et al 2004. *

      We agree this is one of the differences. We also note Huffman et al examined cells at 6 or 28 h later. While we ruled out the impact of MAP kinases on membrane repair occurring within 30 min of toxin challenge, we make no claims about their ability to promote cell survival at later time points. We have clarified these differences in the manuscript (line 461).*

      The authors do not report binding and oligomerization assays of the toxins. The only figure showing a western blot (fig. 7) is of low quality and shows unexpected observations. Aerolysin Y221G mutant is expected to bind and oligomerize. Yet, no band is present at about 250 kDa (expected oligomer) or at about 47 kDa (monomer). In addition, in aerolysin lanes (1 and 2) the oligomer is saturated, seems to be covering three lanes, indicating a possible spill-over. *

      We performed binding studies in Fig S3C and Fig S5. For Fig 7, in the original blot, the cell lysate is a wider band than the MV band, but there are only two bands, that remained in their respective lanes. We have now included another independent biological replicate of the aerolysin blot as Supplementary Fig S7D which shows clear demarcation between cell lysate and MV pellet. This blot was not included in the main figure because in the process of stripping and reprobing for all of the targets, we lost detection of our penultimate targets. We agree with the reviewer that oligomer bands for the Y221G were very faint, and we expected them to be stronger. In the new blot (Fig S7D), some oligomer can be detected. As a result, we are hesitant to risk over-interpreting these findings.*

      Finally, while the patch repair hypothesis is interesting, it is unclear why the authors decided to overexpress dysferlin in cell lines that normally do not express it. Sure, there is a repair phenotype but this phenotype is artificially introduced. Dysferlin is not expressed at all in HeLa cells. *

      One challenge with membrane repair is the difficulty perturbing the system due to redundancies. While loss-of-function experiments are important, gain-of-function experiments also add confidence to the system. The simplest way to perform a gain-of-function experiment is to add a well-known patch repair protein to a well-characterized cell line lacking it. Thus, exogenous expression of dysferlin enables us to test the hypothesis that increasing patch repair enhances repair against the toxins.

      We have included this rationale now in the manuscript, lines 366-369

      *Furthermore, dysferlin is not expressed in epithelial cells, which are the prime target of aerolysin. Why then focus on this protein? *

      We chose dysferlin because it is well-characterized as a patch repair protein, whose defect causes Limb-Girdle Muscular Dystrophy 2B and Miyoshi Myopathy. Additionally, setting up this assay enables future work to probe the role of individual dysferlin domains in patch repair.*

      Minor: The graphic legends should be boxed out to be clearly separated from the data. In Figure 4A, it is mixed up with the data. *

      This has been corrected.*

      Some western blots are saturated, e.g. B-actin in figure 4B. Full blots should be provided. *

      We have added full western blots as requested as Supplementary Figs S11-12.*

      In the methods, aerolysin sublytic dose for HeLa cells is specified at 62 HU/ml. In figure 5C and D, 31 HU/ml kills more than 50% of HeLa cells. This is not compatible. *

      Even when controlling by hemolytic activity, and toxin prep, we find some variability in toxin activity between assays. For the live cell experiments, 62 HU/mL remained sublytic despite the higher activity in the flow cytometry assays. We controlled for death in our live cell imaging experiments, by including TO-PRO. This confirmed the toxin was at a sublytic dose in those experiments.

      We included a new figure S10C to show the variation in LC50 per assay as a function of toxin specific activity. We have clarified that the sublytic dose was for live cell imaging experiments, lines 640-641.

      *Figure 2A and B have quite different LC50 for starting conditions ({plus minus} 200 HU/ml in A, 600-700 HU/ml in B). Why is it so different? Y-axis has a linear scale in A and a logarithmic scale in B. It would make comparison easier to have the same scale in both panels. *

      We agree there is variability between assays. We note that toxin doses change vary in other manuscripts that report toxin mass. For example, aerolysin varies by 10-fold (2 – 20 ng/mL) between figures in Gonzalez et al 2011. We interpret this variation as a common challenge for toxin studies. We mitigate this challenge by including controls for each assay so the relative change can be assessed. We provide additional transparency by including Fig S10 to show batch-to-batch variability of both our toxin preps and assays.

      We have changed the scale to linear in Fig 2.*

      The letters detonating statistically significant groups are sometimes unclear. For example in Figure 1A and B, PFO belongs to group a and b simultaneously. What does this mean? *

      Samples that share letters are not statistically distinct from each other. In the example cited, PFO is not statistically significant compared to all other bars with an a and is not statistically significant compared to all other bars with a b. While confusing at first, the alternative is a mess of stars and bars.

      This has been explained in lines 981-985.*

      In Figure 8, aerolysin hat a LC50 in cells overexpressing GFP-Dysferin of approximately 1700 HU/ml in A and of approximately 400 HU/ml in B. Why is it so different? *

      This is due to intra-assay variation. We include controls for each assay to ensure the trend remains consistent.*

      In Figure S1, it is unclear what the plots « all events » vs « single cells » mean. *

      We have clarified these plots.*

      In the discussion, the authors write « First, survival did not correlate with overexpression, which would be expected if dysferlin acted as Ca2+ sink ». What is meant? GFP-dysferlin overexpression does correlate with survival in Figure 1A. *

      We meant that the extent of Dysferlin expression did not correlate with survival. If Dysferlin acted as a calcium sink, cells expressing 100x dysferlin levels should be more resistant than cells expressing 1x dysferlin levels. If Dysferlin needs to serve a cellular function, the brightest cells may not be more resistant (or even be less resistant due to aggregates, etc). We checked to see if the brightest Dysf+ cells had better survival than the dimmest Dysf+ cells. They did not. However, all Dysf+ cells had better survival than Dysf- cells.

      We have updated the manuscript (lines 496-498) to reflect these changes.

      Significance

      *General assessment: The study strength lies in the several possible protection mechanisms that are tested. The weaknesses lie in the contradictions of the results reported here with established mechanisms, *

      We disagree with the reviewer that findings that contradict previously proposed mechanisms are a weakness for significance. Instead, we argue this is a strength of our study’s significance. Replication of prior studies’ conclusions using distinct experimental conditions is critical for the reproducibility and rigor of the underlying science, and may give new insights into toxin biology. While we acknowledge the differences in approach, these differences narrow the prior mechanisms that may have been assumed to be widely applicable. The finding that they cannot be replicated in our system suggests one or more of the differences between the studies may drive a critical aspect of aerolysin biology. For example, the Ca2+ difference with Larpin et al could be due to a cellular Ca2+ channel present in HeLa cells that is absent in THP.1/U937 cells.

      This distinction is expected to spur additional research in the aerolysin field.

      * Advance: The study contradicts previously established results but the experimental conditions used here are quite different to those used in the earlier studies, which makes the comparison quite difficult. As such it does not really fill a gap. *

      We have rephrased the significance to better convey both the gap our study fills in membrane repair and the advance that it has made. See lines 99-102, 128, 519-525.*

      Audience: The study will be of interest of specialized audience. *

      Given the emerging broad importance of membrane repair in response to endogenous pore-forming toxins, and the large gaps in the field of membrane repair, we respectfully disagree with the reviewer. We have revised our significance statements to better convey this broad appeal. See lines 99-102, 128, 519-525.

      Reviewer 2

      Major

      *3. I find the optimisation of lysin concentrations and data presentation quite confusing. I eventually understood, what was done, but I feel that the authors should be able to transform the data and plots so these are more accessible to a reader, eg a simple dose/time-response curves would be very helpful in that respect. For example, in Figure S1E, why does aerolysin appear to be less cytotoxic after 24 hrs than after 1 hr. In principle, I would expect to observe an additive effect, i.e. cell death at 1, 3, 6, 12, and 24 hrs should add to 100%; however, if 100% cells die at 500HU/ml, how can more cells die after 24hrs? Or am I missing something in the experimental design/data presentation? *

      We agree that presenting the results from cytotoxicity can be challenging. We use LC50 in the main text because it is easiest to understand. However, we provide all dose-response curves underlying those numbers in the supplemental data. We recently published our approach to assays and data analysis (Haram et al PMID: 36373947) to make it easier to understand.

      In Fig S1E, each time point is a distinct assay. In contrast to the approach suggested by the reviewer, where we read the plate at different timepoints, we used different replicates to generate the time points. As a result, the % will not add to 100. Instead, we observe that the majority of cell death occurs in the first hour. We have clarified our discussion of Fig S1E, lines 154-155.

      At 24 h, it is possible that cell growth interfered with the assay. The plate has a finite surface area. If control cells are confluent near the start of the assay, but toxin-treated cells are not due to cell death by aerolysin, the growth rates may not be equal. Since our focus is on proximal membrane repair events, and not on late signaling events, pursuing this further is beyond the scope of the current manuscript.

      *I also wonder whether using haemolytic units is appropriate (it may well be, if justified), given that the toxins used here have various membrane-binding properties. Wouldn't it make more sense to compare the cytotoxicity using nucleated cells? *

      We agree with the reviewer on the need for standardization, and do compare cytotoxicity using nucleated cells (HeLa). Our first level of standardization is the use of hemolytic units instead of toxin mass. This normalizes toxin activity to the ability to kill human red blood cells, which are widely accepted as having minimal membrane repair mechanisms. This gives us a baseline activity, and allows us to control for toxin impurities/differences between toxin preps/toxins. We prefer cytotoxicity over membrane binding for our baseline because it is a functional assay.

      After this first level of standardization, we compare the cytotoxicity in HeLa cells. This is one reason why the majority of our assays are performed in HeLa cells—we know how they behave at different toxin doses in our hands, the cells are easy to use, and we can standardize assays in the lab. We included HeLa cells as a control in Fig 5 to show the standardization requested by the reviewer. We split Fig 1 up differently to better convey the results.*

      1. The authors use "sublytic" concentrations of aerolysin (64HU) throughout most of the paper, but according to Figure S1C, 50% cells died at that concentration after 1hr, suggesting that when the cells were investigated over a shorter period of time, they were already dying - it's almost like the cells had life support turned off, but still being investigated as though they survived aerolysin treatment. This needs to be clarified or reassessed. *

      We agree with the reviewer that we did not track cell survival beyond 45 min in our live cell imaging assays. We labeled cells as ‘surviving >45 min’ to acknowledge the fact that these cells could have died at 46, 47, 60, or 600 min after the experiment ended. We focused on time points earlier than 45 min because proximal membrane repair mechanisms are expected to have occurred in that time, and had time to complete. We have updated the manuscript on lines 214-215.

      We next considered the reviewer’s excellent point that the cells alive at 30-40 min could be executing a cell death program. If this were the case, then based on our FACS data (Fig S1C), we would predict ~50% of total cells would be dead by 1 h. From Fig 3A, ~35% of the cells died in the first 45 min. From the remaining 65%, we would predict another 15% dying from this programmed cell death pathway, which would be 15/65 = ~25% of the surviving cells. We did not notice 1/4 of the surviving cells behaving distinctly. For example, the large error bars in 3H is due to a range of cell behaviors that we could not easily subgroup. For individual cells (shown in Figs 6 and 7), there is similarly no clear demarcation of 1/4 of the cells. While we see a gap with pro-aerolysin, that is ~1/3 of the cells (not the expected 1/4), and it is not repeated with aerolysin. While we can’t rule out a cell death program contributing to the top or bottom 1/4 of our results, removing the top or bottom 25% of data points would not alter our major conclusions from the live cell imaging. If a programmed cell death pathway that occurs in the 30-90 min range is identified for aerolysin, it would be interesting to see how that pathway changes repair kinetics. However, that would require identification of the death pathway.

      *

      1. What effect does the addition of 150mM KCl have on the plasma membrane, trafficking/repair - wouldn't the plasma membrane be depolarised? There were a number of papers by John Cidlowski in mid 2000s, where his team explored the effect of potassium supplementation on apoptosis - this may be worth exploring. *

      We thank the reviewer for suggesting these interesting papers. We have explored these papers, and our understanding of them is as follows. Franco et al 2008 PMID: 18940791 shows that ferroptosis is independent of high extracellular K+. This contrasts with Fas-dependent apoptosis, which is suppressed by high extracellular K+. This is consistent with the Cidlowski group’s other work (eg Ajiro et al 2008 PMID: 18294629) and Cohen’s group (eg Cain et al 2001 PMID: 11553634) showing that apoptotic DNA degradation performs better at low K+, and extracellular K+ interferes with apoptosis. Similarly, other papers have shown that NLRP3-activated pyroptosis can be blocked by addition of extracellular K+. Depletion of intracellular K+ inhibits endocytosis and other vesicle trafficking pathways.

      While these are good papers, they do not directly relate to our K+ findings, which is that blocking K+ efflux via elevated extracellular K+ levels has no impact on aerolysin-mediated killing. Therefore, to stay focused on the repair pathways, we opted not to include these papers to avoid distracting the reader from our key points. *

      1. Figure 3 and accompanied text: it would be more informative to show all the data rather than breaking it down to 45 min. In my view, *

      We have added histograms to show when individual cells died during the assay as supplemental Fig S3E. We used the three bins for the exact reason articulated by the reviewer—we wanted to consider cells that died fast vs slow differently. However, in order to interpret the data, a cutoff of 5 min was chosen as optimal. While we agree with the reviewer that the 5 min death could be dismissed, we presented the data to avoid questions about why we omitted those data.*

      1. I am curious whether EGTA diffuses into the cytosol through aerolysin pores. If so, then unlike BAPTA-am it would affect Ca inside and outside the cell. *

      We agree with the reviewer this is an interesting question. While EGTA might diffuse into the cytosol, its binding properties suggest it would be unsuitable to block cytoplasmic Ca2+ transients (see Nakamura 2019 PMID: 31632263). BAPTA binds to Ca2+ ~40x faster than EGTA, which enables it to capture Ca2+ prior to Ca2+-binding proteins. In contrast, EGTA is thought to be too slow to sequester intracellular Ca2+ before Ca2+-binding proteins. While EGTA might perturb Ca2+ close (

      *Are the authors confident that in the absence of extracellular calcium (EGTA treatment), aerolysin formed the pores at all? Have they looked, for example, at intracellular Na/K, or have any other evidence of membrane disruption? *

      Prior structural studies suggest that Ca2+ is not required for aerolysin pore formation. For example, Iacovache et al (2011) PMC3136475 induce oligomerization with low salt and pH 2+. Cryo-EM from the same group (Iacovache et al 2016 PMID: 27405240), showed pore formation under similar conditions.

      In Fig S3, aerolysin kills in the presence of EGTA at higher concentrations, suggesting that it can form pores when EGTA is present. Also, in Fig 2D, we used Tyrode’s buffer, which was made without Ca2+ or EGTA. We added the indicated amounts of Ca2+ in, and observed a reduction in lysis at low [Ca2+]. This argues against EGTA interfering with toxin oligomerization/pore formation because EGTA was not present, and the toxin still failed to kill.

      We have updated the manuscript (lines 203-205) to emphasize this point.*

      1. Figure 6 (and some other): I find the designation of statistical significance (a-f) quite confusing, as it is unclear which comparisons are statistically different. Looking at Figure S5, there was no difference between the effect of Annexin depletion on the toxicity of the three lysins. *

      Samples sharing the same letter are NOT statistically significant. This is done to avoid a mess of stars and bars with multiple comparisons. This has now been explained in lines 981-985.

      For Fig 6/ Fig S5 (now S6), there was a statistically significant difference in LC50 between control siRNA and Annexin knockdowns for SLO. We agree that visually the dose-response curve in Fig S6B looks similar. However, we note that the x-axis is a log2 scale, and the control line is distinct over the 250-1000 region. When we calculate the LC50, these differences give different LC50 values. Over multiple reps, these differences were consistent enough to be statistically different.

      Significance

      *The paper attempts to address an interesting question of aerolysin pore repair, and it is interesting from the perspective of a potential difference between various pore-forming proteins. *

      We agree with the reviewer and thank the reviewer for this assessment.*

      The study will be potentially interesting to a broad audience of biochemists/cell biologists and microbiologists working in the field of pore-forming proteins/virulence factors. *

      We agree with the reviewer and thank the reviewer for this assessment.

      Reviewer 3

      *Major comments In the first instance, the authors use a method of assaying the specific lytic activity of aerolysin in comparison to a number of different CDCs. Whilst it is acknowledged that these methods have been published in peer-review papers previously (e.g. Ray et al., Toxins, 2018), it would be great to have more information of how the specific activity is derived. Currently there is a convoluted method that makes a number of assumptions such as, but not limited to, 1) the number of dead cells measured in the FACS experiments is proportional to the activity of the different classes of PFPs however the authors do not show how they account for PFPs leading to loss of cells into debris which would involve a total cell count and *

      We thank the reviewer for raising these concerns. We tested these assumptions in our previous papers. We compared the FACS assays to other assays that measure total cells (i.e. MTT assay), and found that the FACS assay corresponds with the MTT findings. These findings were published in Keyel et al 2011 PMID: 21693578 and Ray et al 2018.

      Loss of countable events to debris is detected in our assay as saturation of cell death at a number under 100%. Since we perform dose-response curves, we can determine when the killing saturates. This is why loss of countable events does not change our ability to accurately calculate LC50.

      2) how the inflection or linear point is identified on individual experiments (e.g. Supp. Fig. 1B, 2A, 2B, 3A, 3B to name a few) and how reliable these points are (e.g showing the data points with model sigmoidal (?) curve and corresponding R values).

      This had been calculated manually in the prior version of the manuscript. To address the reviewer’s concern and to improve data quality, we reanalyzed all of our data by fitting our dose-response curves to logistic models, and determining the LC50 using that model. An in-depth explanation of our approach was just published in Haram et al PMID: 36373947, which we now cite (line 821). *

      Furthermore, the batch-to-batch variability of protein samples presented in table 1 may be an issue where inactive but folded protein can affect the formation of homo-oligomer pores so more effort to reduce the effects of batch variation would be integral to the foundation of this paper. Given that aerolysin has a very different action on cells then this new characterisation should be provided regardless of what has been previously published by the authors on the activity of CDCs on the cells.*

      We agree with the reviewer that batch-to-batch variability is a key concern for pore-forming toxins. To address the concern of batch-to-batch variability and toxin purity, we have added Supplemental Fig S10. In Fig S10C, D, we plot the LC50 against specific activity of each toxin prep when used against control cells. We found a statistical difference in LC50 between two of our toxin preps, but not between any of the others. Notably, there was no association between increasing specific activity and LC50.

      Furthermore, we tested the impact of impurities on our toxin prep. While we purify most toxins only using His-beads (obtaining ~40% purity) (Fig S10B), we purified two toxin preps to higher purity (>90%) (Fig S10A). We did not observe differences in LC50 between these toxin preps. The specific activity for these toxins did not increase. We interpret that finding to indicate the gain in specific activity for purity was offset by the loss of specific activity due to prolonged toxin purification.*

      • Can the authors provide the raw data for the total FACS observations (scatterplot for all events) and show that there is no significant loss of cells? Or at least there is accountability of the cells? *

      Our stop conditions were to collect at least 10,000 gated events instead of running for a set period of time/set volume to determine cell density. We provide example scatterplots in Fig S1A.

      * - Can the authors provide more information about how the linear regression on Supp. Fig. 1B and other experiments showing the model sigmoidal curve performed such that this work is more reproducible? *

      We agree with the reviewer that using logistic modeling would strengthen the work. To address this concern, we reanalyzed all of our data and switched to logistic modeling. This improved reproducibility for many figures. Changes that add or remove statistical significance to results include Fig 4A, loss of significance between Ca2+/DMSO and BAPTA/DMSO, Fig 6C, loss of significance for siRNA knockdown of A6 vs scrambled for ILY, and Fig 8A/B, gain of statistical significance for GFP-Dysf protecting SLO. We have updated our results accordingly.*

      The SEMs of some data points (specific lysis LC50 scatterplots, for e.g. Fig. 2C, 4A, 4C, 8A and fMAX plots, for e.g. Fig. 3B) may not be apparently representative of the skew (e.g. and individual values (including outliers). A clarification of the statistical analysis behind the results may benefit in a clearer understanding of how the SEMs were calculated and presented in the main figures. Also, further elaboration on the meaning of the lettering in the scatterplots (denoted as a, b, c etc.) across the main figures may help improve the interpretation of the data. *

      The SEMs were calculated by Graphpad and graphs also generated by Graphpad. To address the reviewer concern, we have switched all places where we plotted individual data points to median with no error bars. This will enable the reader to judge skew, outliers, etc without reliance on error bars.

      We have now further elaborated on the lettering in the scatterplots. Samples sharing the same letter are NOT statistically significant. This is done to avoid a mess of stars and bars with multiple comparisons. This has now been explained in lines 981-985.*

      Secondly, the authors present interesting results on the significance of Ca2+ on aerolysin's mechanism behind lytic activity and introduces dysfurlin-mediated patch repair as the primary cellular resistance mechanism against aerolysin mediated lysis. Results from Figure 2-4, indicate that extracellular Ca2+ plays a role in aerolysin's function and cell lysis (aerolysin triggers influx of extracellular Ca2+). However, the results presented in figure 8 suggest an impairment of dysferlin translocation from the cytosol to the plasma membrane upon removal of extracellular Ca2+. If this were the case, wouldn't dysferlin impairment sensitise cells to aerolysin? Thus, in these sets of experiments it seems that Ca2+ is a confounding factor.*

      We agree that Ca2+ is a confounding factor, which is one reason we aimed to define better membrane repair mechanisms in response to different pore-forming toxins. Our interpretation is that Ca2+ triggers a death pathway that overcomes repair, and that aerolysin toxicity is due to the activation of this pathway. In this case, the impairment of Ca2+-dependent pathways does not reduce survival because the extent of damage is reduced/not present. Figuring out this death pathway is beyond the scope of the present manuscript, but a one future direction in which we are interested. This would also account for differences observed in different cell lines.*

      • Can the authors further elaborate on how the function of dysferlin in protecting cells against aerolysin contrasts to how aerolysin kills cells? *

      We have added the requested discussion to our manuscript, lines 519-525.

      *Finally, it is also interesting to see that cells deploy different resistance mechanisms between different families of pores. In saying that, the usage of CDCs seems to be inconsistent between each set of results. For example, intermedilysin (ILY) was used in the siRNA knockdown experiments but not in others such as Ca2+ influx assays, while PFO was only used for the initial set of results. A comment on this would benefit in understanding the rationale for selecting certain CDCs for each set of experiments. *

      We thank the reviewer for raising this point. We used SLO as the primary CDC in all the experiments because it is the CDC we have best characterized and have extensively published on. We included PFO in initial experiments to give readers a better idea of how multiple CDCs compare to aerolysin in target cells. However, since we’ve previously published on PFO, including it for later experiments would have increased cost and time of experiments without providing new knowledge.

      We used ILY because it binds to the GPI-anchored protein human CD59, so its binding determinant is more similar to aerolysin, which binds GPI-anchored proteins. We included it where practical to determine the extent to which targeting may change repair responses. Since ILY does not bind to murine cells, it was omitted from experiments using murine cells.

      We have added the rationale to the manuscript on lines 138-140.*

      Minor comments Results (Nucleated cells are more sensitive to aerolysin and CDCs) - A statement of the EC50 values of aerolysin and CDCs from the haemolytic assays would be beneficial to compare activities between the two pores. *

      The hemolytic activity is defined as the EC50 for the toxin in human red blood cells. The specific activity enables comparison of toxin activity, which is reported in Table 1. We have now added Supplementary Fig S10 which further plots the aerolysin and SLO specific activities against LC50 so that the reader can better assess batch-to-batch variability. In this study, we did not use enough batches of the other toxins to make this analysis useful for them.

      * - Figure 1A: As stated in the introduction, pro-aerolysin exists as a precursor that is functionally inactive unless activated by trypsin, furin or potentially other proteases. It would benefit the reader if an explicit statement were made about this activity and how it may come about in HeLa and 3T3 cells. Why is pro-aerolysin not shown in the Casp 1/11-/- BMDM cells? *

      The cell surface furin activity that activates aerolysin is not well-characterized across different cell types. We have revised the manuscript (line 76) to indicate these activities are present on the cell membrane.

      We omitted pro-aerolysin from the Casp1/11-/- BMDM because we performed those experiments earlier in the study before we started including pro-aerolysin. Based on the other results, we judged that the time and resource costs of adding pro-aerolysin in this system outweighed the gain to the story.

      * - Figure 1C: It was stated that "Casp 1/11 -/- Mo were ~100 fold more sensitive to pro-aerolysin and aerolysin compared to PFO and SLO" but did not show the activity for pro-aerolysin in these cells. *

      We thank the reviewer for catching this typo, and have corrected this statement (line 172).

      * - Supp fig 1E: Shouldn't 24 hr incubation of aerolysin to HeLa cells result in 100% specific lysis? *

      We agree with the reviewer that these results were surprising. At 24 h, it is possible that cell growth interfered with the assay. The assay well has a finite surface area. If control cells are confluent near the start of the assay, but toxin-treated cells are not due to cell death by aerolysin, the growth rates between control and experimental wells may not be equal. Since our focus is the proximal membrane repair events, and not the late signaling events, pursuing this further is beyond the scope of the current manuscript.

      * (Delayed calcium flux kills aerolysin-challenged cells) - What is the intracellular concentration of K+ normally in cells? Similarly, what is the intracellular concentration of Ca2+? *

      Intracellular K+ is ~140 mM (see Ajiro et al 2008 PMID: 18294629), while cytosolic Ca2+ is ~100 nM at rest.

      * - Figure 2C: Based on the description in the methods and results, both buffers are supplemented with 2 mM Ca2+ but one buffer (RPMI) shows more killing with SLO and ILY. Does this mean that both buffers contain 2 mM CaCl2? If so, what are the other potential reasons why one buffer enabled greater potency in CDCs? *

      RPMI has 0.4 mM Ca2+ prior to Ca2+ supplementation. However, the 2.4 mM Ca2+ did not provide improved protection compared to RPMI alone (See Fig 2 in Ray et al 2018).

      We suspect the various amino acids added to RPMI promote membrane integrity and account for the difference from Tyrode’s buffer. Glycine has previously been implicated in promoting membrane repair, but at higher concentrations than it is present in RPMI (0.133 mM in RPMI vs the mM concentrations used to protect cells). If other amino acids also protect, and/or why they protect is beyond the scope of the present work.

      * - Figure 3H: The data for aerolysin (WT) would greatly benefit for comparison to the inactive mutant (and indicate the sustained Ca2+ increase). *

      We have added this comparison, and updated the figure legend, line 1015.

      * - Supplementary Video V1: The addition of Triton X-100 permeabilises cells; however, this wasn't evident in (A). - Video V2: Similar to previous comment on Supplementary Video V1 (for B). *

      In V1A, the video was cut short to fit the play time with other videos. From addition, the triton takes a few minutes to diffuse to the cells and permeabilize them. In V2B, the cells do become permeabilized as shown by loss of the Ca dye. The cells are out of focus, which is why the nucleus TO-PRO is not detected.*

      (Calcium influx does not activate MEK-dependent repair) - Figure 4A: Effective ionic concentration inside and outside cell is increased (if intracellular Ca2+ becomes chelated); therefore, Ca2+ may enter the cell by passive diffusion or transport by other intrinsic Ca2+ channels. *

      There is already a very steep concentration gradient for Ca2+. The cytosolic Ca2+ is ~0.1 uM, compared with growth medium at 400 uM or assay buffer at 2400 uM. Chelation of the intracellular Ca2+ is not expected to increase Ca2+ import from outside the cell.*

      (Caveolar endocytosis does not protect cells from aerolysin) - Figure 5C: What is the purpose of using HeLa cells as a control? *

      We included HeLa cells to demonstrate the toxin was active and to rule out batch-to-batch variability as one interpretation of the reduced killing of differentiated 3T3-L1 cells.

      * - "..with Alexa Fluor 647 conjugated pro-aerolysin K244C" - this should be introduced earlier as it was initially mentioned in Supp. Figure 3C. *

      We have now introduced this earlier at line 190, instead of 300

      * - Murine fibroblasts were used earlier (Figure 1). Following from this result (where the WT can be used as a positive control), can MEFs be used instead of adipocytes to see whether caveolar endocytosis plays any role in cellular resistance? *

      The 3T3-L1 cells are murine fibroblasts prior to differentiation. Since they can also be differentiated into adipocytes, we used them instead of MEFs. The other reasons we used them include the availability of Cavin knockout cells, and the extensive caveolae present in adipocytes. We included analysis of 3T3-L1 prior to differentiation them in Fig 5B.

      * - Further comment on the increased resistance of K5 knockout would benefit on the mechanism of aerolysin-mediated cytolysis. *

      We agree further characterization of this line would be interesting in the future. At the present, however, any further comment would be speculative on our part. Since the resistance was not replicated in the second CRISPR line, we suspect it is either an unexpected mutation(s) in the cell line that arose during routine cell culture, or off-target effect(s) from the CRISPR used to generate the line.

      * (Annexins minimally resist aerolysin) - Supplementary video V3 - it seems that annexin A6 is recruited to the membrane, to a greater extent (and also quicker) than SLO. This suggests that annexin recruitment is a cellular response against aerolysin challenge. *

      We agree with the reviewer that annexins are recruited to the membrane during repair. However, individual knockdown did not enhance death. This is one reason we believe functional studies (i.e. cytotoxicity) are necessary when studying the cell biology of repair events. Recruitment of the protein, and it promoting repair may be two different things.

      In V3, three of the SLO-challenged cells have translocated by the time focus is restored. In contrast, the first aerolysin cells translocate ~10 min. One complicating factor is that A6 cycles back off the membrane with the SLO challenge.

      * o SLO also shows A6 recruitment (arrows pointed). However, supplementary figure 6B does not clearly illustrate this. *

      Given the 45 min time scale, the rapid initial membrane enrichment is hard to see on the graph.

      * - As annexin A1 is sensitive to calcium, further comment on the significance of intracellular/extracellular calcium in annexin A1 recruitment and aerolysin challenge would explain observations in Figure 4A. *

      We have updated the manuscript, line 242 to include annexins and dysferlin as Ca2+-binding proteins in our discussion of intracellular calcium.*

      (Patch repair protects cells from aerolysin) - Supplementary video V4 - the intensity decreases for the inactive mutant; is this due to lysis? *

      We included TO-PRO in the experiment to rule out lysis. Since the cells remain in focus, we interpret the lack of TO-PRO to indicate no cellular lysis.

      *- The next paragraph sounds like a contradiction: "GFP-dysferlin localized to the plasma membrane and vesicles independently of extracellular Ca2+ (Fig 8C D, Video V5) o Followed by "To study the Ca2+ dependency of dysferlin, we removed extracellular Ca2+ with 2 mM EGTA and challenged with sublytic toxin doses...found less depletion of dysferlin from cytosol". *

      We thank the reviewer for pointing out our unclear language. In the second section, we intended to refer to dysferlin positive vesicles. We have rephrased the manuscript (lines 388-395) to clarify that we are focused on Ca2+-dependence of vesicle fusion, not steady-state.*

      (Methods) - Table 1: The values presented in the methods section are, overall, confusing and require clarification. *

      We have added Fig S10, and discussion of toxin activity and purity in the methods (lines 634-641) to provide further clarity on toxin activity.

      * o 10-fold difference in SLO and PFO WT - do the authors think this might change the interpretation between different figures? *

      We do not. The reason is that we changed the membrane affinity between SLO and PFO (Ray 2018), and this switches the properties of the respective toxins without changing their yields.

      * o Understood how the haemolytic activity was calculated (referred to work in 2012), but how was the haemolytic unit originally derived? *

      It was derived as a measure of activity for toxins by determining the EC50 in RBCs for a given toxin. Since species type of RBC and other factors can change the reported activity, we have normalized to using human red blood cells. This lets us assay human-specific toxins like ILY along with other toxins.

      * o How were these values (from table 1) derived to toxin concentrations used for killing nucleated cells? *

      Full discussion of our assay was recently published in Haram et al 2022 PMID: 36373947. For the cytotoxicity assays, we use the hemolytic activity. Suppose from Table 1, the toxin stock is 1.5 x10^5 HU/mL. Then to prepare a 2x working toxin stock, we dilute the toxin to 4 x10^3 HU/mL (this is a 1 in 37.5 dilution). To get the range of concentrations used in the dose response curve, we perform a 2-fold serial dilution. Finally we mix equal volumes of toxin and cells, giving us the final 1x toxin activity (2 x10^3 HU/mL for the highest concentration in this example).

      * o Therefore, an EC50 haemolytic curve showing the activities for all toxins would greatly facilitate in understanding the derivation of values for table 1.*

      The hemolytic unit already incorporates the EC50 hemolytic curve. 1 HU is the EC50 of the toxin in the human RBCs.

      * - Flow cytometry assay: What is meant by gating out the debris? And would debris also contribute to the count in dead cells? *

      We illustrate our gating strategy in Fig S1. The debris falls in the front left corner of the plot, and includes electronic noise, non-cellular debris and cellular fragments. Since one cell could give rise to multiple pieces of debris, we exclude the debris from analysis.

      * o What was added as the high PI control? *

      In Fig S1A, the high dose of toxin was used for maximal killing. In our cell populations, there is a low level (2-5%) of dead cells that serve as a control for PI staining. In the past, we’ve used 0.01% triton to validate permeabilization of the cells. We have also compared PI uptake with MTT assays (Keyel et al 2011, Ray et al 2018) to confirm that the PIhigh cells are dead.

      *Elaborating reviewer #2's comment 7 regarding the addition of EDTA : with respect to measuring the binding if fluorescently labelled aerolysin, how can the authors differentiate between full functional pores versus prepores/incomplete pores? *

      This requires electron microscopy, which is the beyond the scope of our current study. However, prior work and Fig 2D show that aerolysin forms pores without the need for Ca2+ (see next point).

      How else can the authors validate whether aerolysin remains functional in the presence of EDTA?

      Prior structural studies suggest that Ca2+ is not required for aerolysin pore formation. For example, Iacovache et al (2011) PMC3136475 induce oligomerization with low salt and pH 2+. Cryo-EM from the same group (Iacovache et al 2016 PMID: 27405240), showed pore formation under similar conditions.

      In Fig S3, aerolysin kills in the presence of EGTA at higher concentrations, suggesting that it can form pores when EGTA is present. Also, in Fig 2D, we used Tyrode’s buffer, which was made without Ca2+ or EGTA. We added the indicated amounts of Ca2+ in, and observed a reduction in lysis at low [Ca2+]. This argues against EGTA interfering with toxin oligomerization/pore formation because EGTA was not present, and the toxin still failed to kill.

      We have updated the manuscript (lines 203-205) to emphasize this point.

      Significance

      *While the work has investigated in-depth cellular resistance mechanisms, the significance and benefits of this study are unclear. For example, the authors have used different human cell lines to dissect how these cells are affected by different pores but have not stated the significance and potential benefit of studying these cell lines. Further elaboration in this aspect may increase the relevance of the study, to an audience who is interested in the field of infection and disease. *

      We have updated our significance to better convey our advance, which is explained on lines 99-102, 128, 519-525. We also added benefits of testing the cell lines chosen on lines 167-168, and 277-278. We plan to add muscle cells to address the dysferlin points, which has relevance to necrotizing soft-tissue infections.

      Description of analyses that authors prefer not to carry out

      Not applicable

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary

      This body of work by Thapa & Keyel explores the differences in cellular resistance mechanisms between two different pore families (aerolysin versus CDCs). Herein, the authors were able to elucidate the toxin activities across a variety of different nucleated cells, using the haemolytic assay as a reference for normalising activity. Their findings revealed that, in general, aerolysins were relatively more potent than CDCs at damaging certain nucleated cell lines. Furthermore, the authors performed an exploration of different resistance mechanisms, including MEK-dependent repair, annexins, and patch repair by dysfurlin. The work provides some supporting evidence that patch repair is the main mechanism that cells deploy to prevent aerolysin-mediated cytotoxicity. Overall, the amount of work that was put in to craft the manuscript was impressive and the manuscript showed potential prospects in further investigating 1) mode of aerolysin killing in nucleated cells and 2) the role of patch repair and function of dysferlin in cellular resistance against aerolysin.

      Major comments

      In the first instance, the authors use a method of assaying the specific lytic activity of aerolysin in comparison to a number of different CDCs. Whilst it is acknowledged that these methods have been published in peer-review papers previously (e.g. Ray et al., Toxins, 2018), it would be great to have more information of how the specific activity is derived. Currently there is a convoluted method that makes a number of assumptions such as, but not limited to, 1) the number of dead cells measured in the FACS experiments is proportional to the activity of the different classes of PFPs however the authors do not show how they account for PFPs leading to loss of cells into debris which would involve a total cell count and 2) how the inflection or linear point is identified on individual experiments (e.g. Supp. Fig. 1B, 2A, 2B, 3A, 3B to name a few) and how reliable these points are (e.g showing the data points with model sigmoidal (?) curve and corresponding R values).

      Furthermore, the batch-to-batch variability of protein samples presented in table 1 may be an issue where inactive but folded protein can affect the formation of homo-oligomer pores so more effort to reduce the effects of batch variation would be integral to the foundation of this paper. Given that aerolysin has a very different action on cells then this new characterisation should be provided regardless of what has been previously published by the authors on the activity of CDCs on the cells.

      • Can the authors provide the raw data for the total FACS observations (scatterplot for all events) and show that there is no significant loss of cells? Or at least there is accountability of the cells?
      • Can the authors provide more information about how the linear regression on Supp. Fig. 1B and other experiments showing the model sigmoidal curve performed such that this work is more reproducible?

      The SEMs of some data points (specific lysis LC50 scatterplots, for e.g. Fig. 2C, 4A, 4C, 8A and fMAX plots, for e.g. Fig. 3B) may not be apparently representative of the skew (e.g. and individual values (including outliers). A clarification of the statistical analysis behind the results may benefit in a clearer understanding of how the SEMs were calculated and presented in the main figures. Also, further elaboration on the meaning of the lettering in the scatterplots (denoted as a, b, c etc.) across the main figures may help improve the interpretation of the data.

      Secondly, the authors present interesting results on the significance of Ca2+ on aerolysin's mechanism behind lytic activity and introduces dysfurlin-mediated patch repair as the primary cellular resistance mechanism against aerolysin mediated lysis. Results from Figure 2-4, indicate that extracellular Ca2+ plays a role in aerolysin's function and cell lysis (aerolysin triggers influx of extracellular Ca2+). However, the results presented in figure 8 suggest an impairment of dysferlin translocation from the cytosol to the plasma membrane upon removal of extracellular Ca2+. If this were the case, wouldn't dysferlin impairment sensitise cells to aerolysin? Thus, in these sets of experiments it seems that Ca2+ is a confounding factor.

      • Can the authors further elaborate on how the function of dysferlin in protecting cells against aerolysin contrasts to how aerolysin kills cells?

      Finally, it is also interesting to see that cells deploy different resistance mechanisms between different families of pores. In saying that, the usage of CDCs seems to be inconsistent between each set of results. For example, intermedilysin (ILY) was used in the siRNA knockdown experiments but not in others such as Ca2+ influx assays, while PFO was only used for the initial set of results. A comment on this would benefit in understanding the rationale for selecting certain CDCs for each set of experiments.

      Minor comments

      Results

      (Nucleated cells are more sensitive to aerolysin and CDCs)

      • A statement of the EC50 values of aerolysin and CDCs from the haemolytic assays would be beneficial to compare activities between the two pores.
      • Figure 1A: As stated in the introduction, pro-aerolysin exists as a precursor that is functionally inactive unless activated by trypsin, furin or potentially other proteases. It would benefit the reader if an explicit statement were made about this activity and how it may come about in HeLa and 3T3 cells. Why is pro-aerolysin not shown in the Casp 1/11-/- BMDM cells?
      • Figure 1C: It was stated that "Casp 1/11 -/- Mo were ~100 fold more sensitive to pro-aerolysin and aerolysin compared to PFO and SLO" but did not show the activity for pro-aerolysin in these cells.
      • Supp fig 1E: Shouldn't 24 hr incubation of aerolysin to HeLa cells result in 100% specific lysis?

      (Delayed calcium flux kills aerolysin-challenged cells)

      • What is the intracellular concentration of K+ normally in cells? Similarly, what is the intracellular concentration of Ca2+?
      • Figure 2C: Based on the description in the methods and results, both buffers are supplemented with 2 mM Ca2+ but one buffer (RPMI) shows more killing with SLO and ILY. Does this mean that both buffers contain 2 mM CaCl2? If so, what are the other potential reasons why one buffer enabled greater potency in CDCs?
      • Figure 3H: The data for aerolysin (WT) would greatly benefit for comparison to the inactive mutant (and indicate the sustained Ca2+ increase).
      • Supplementary Video V1: The addition of Triton X-100 permeabilises cells; however, this wasn't evident in (A).
      • Video V2: Similar to previous comment on Supplementary Video V1 (for B).

      (Calcium influx does not activate MEK-dependent repair)

      • Figure 4A: Effective ionic concentration inside and outside cell is increased (if intracellular Ca2+ becomes chelated); therefore, Ca2+ may enter the cell by passive diffusion or transport by other intrinsic Ca2+ channels.

      (Caveolar endocytosis does not protect cells from aerolysin) - Figure 5C: What is the purpose of using HeLa cells as a control? - "..with Alexa Fluor 647 conjugated pro-aerolysin K244C" - this should be introduced earlier as it was initially mentioned in Supp. Figure 3C. - Murine fibroblasts were used earlier (Figure 1). Following from this result (where the WT can be used as a positive control), can MEFs be used instead of adipocytes to see whether caveolar endocytosis plays any role in cellular resistance? - Further comment on the increased resistance of K5 knockout would benefit on the mechanism of aerolysin-mediated cytolysis.

      (Annexins minimally resist aerolysin)

      • Supplementary video V3 - it seems that annexin A6 is recruited to the membrane, to a greater extent (and also quicker) than SLO. This suggests that annexin recruitment is a cellular response against aerolysin challenge. o SLO also shows A6 recruitment (arrows pointed). However, supplementary figure 6B does not clearly illustrate this.
      • As annexin A1 is sensitive to calcium, further comment on the significance of intracellular/extracellular calcium in annexin A1 recruitment and aerolysin challenge would explain observations in Figure 4A.

      (Patch repair protects cells from aerolysin)

      • Supplementary video V4 - the intensity decreases for the inactive mutant; is this due to lysis?
      • The next paragraph sounds like a contradiction: "GFP-dysferlin localized to the plasma membrane and vesicles independently of extracellular Ca2+ (Fig 8C D, Video V5) o Followed by "To study the Ca2+ dependency of dysferlin, we removed extracellular Ca2+ with 2 mM EGTA and challenged with sublytic toxin doses...found less depletion of dysferlin from cytosol".

      (Methods)

      • Table 1: The values presented in the methods section are, overall, confusing and require clarification.
        • 10-fold difference in SLO and PFO WT - do the authors think this might change the interpretation between different figures?
        • Understood how the haemolytic activity was calculated (referred to work in 2012), but how was the haemolytic unit originally derived?
        • How were these values (from table 1) derived to toxin concentrations used for killing nucleated cells?
        • Therefore, an EC50 haemolytic curve showing the activities for all toxins would greatly facilitate in understanding the derivation of values for table 1.
      • Flow cytometry assay: What is meant by gating out the debris? And would debris also contribute to the count in dead cells?
        • What was added as the high PI control?

      Referees cross-commenting

      Elaborating reviewer #2's comment 7 regarding the addition of EDTA : with respect to measuring the binding if fluorescently labelled aerolysin, how can the authors differentiate between full functional pores versus prepores/incomplete pores? How else can the authors validate whether aerolysin remains functional in the presence of EDTA?

      Significance

      The work presents a foundation to further investigate into the mechanism of aerolysin function, following the discovery of the role of extracellular Ca2+ in its activity. As aforementioned, the role of dysferlin in resisting aerolysin also has potential, but the limitations of this work were discussed including the absence of performing a dysferlin knockout, although performing this experiment may help to strengthen the current finding.

      While the work has investigated in-depth cellular resistance mechanisms, the significance and benefits of this study are unclear. For example, the authors have used different human cell lines to dissect how these cells are affected by different pores but have not stated the significance and potential benefit of studying these cell lines. Further elaboration in this aspect may increase the relevance of the study, to an audience who is interested in the field of infection and disease.

      Section for special notes to the editor:

      My major area of expertise and contribution to this paper is in the analysis and interpretation of activity (lytic) assays.

    1. Author Response

      Reviewer #1 (Public Review):

      Neuronal tissues are very complex and are composed of a large number of neuronal types. With the advent of single-cell sequencing, many researchers have used this technology to generate atlases of neuronal structures that would describe in detail the transcriptome profiles of the different cell types. Along these lines, in this manuscript, the authors present single-cell transcriptomic data of the fruitless-expressing neurons in the Drosophila male and female central nervous systems. The authors initially compare cell cluster composition between male and female flies. They then use the expression of known markers (such as Hox genes and KC neuronal markers) to annotate several of their clusters. Then, they look in detail at the expression of different terminal neuronal genes in their transcriptomic data: first, they look into neurotransmitter-related genes and how they are expressed in the fruitless-expressing neurons; they describe in detail these populations that they then verify the expression patterns by looking into genetic intersections of Fru with different neurotransmitter-related genes. Then, they look at Fru-neurons that express circadian clock genes, different neuropeptides and neuropeptide receptors, and different subunits of acetylcholine receptors. Finally, they look into genes that are differentially expressed between male and female neurons that belong to the same clusters. They find a large number of genes; through GO term enrichment analysis, they conclude that many IgSF proteins are differentially expressed, so they look into their expression in Fru-neurons in more detail. Finally, they compare transcription factor expression between male and female neurons of the same cluster and they identify 69 TFs with cluster-specific sex-differential expression.

      In general, the authors achieved their goal of generating and presenting a large and very useful dataset that will definitely open a large number of research avenues and has already raised a number of interesting hypotheses. The data seem to be of good quality and the authors present a different aspect of their atlas.

      The main drawback is that many of the analyses are very superficial, resulting in the manuscript being handwavy and unsupported. The manuscript would benefit by reducing the number of "analyses" to the ones that are also in vivo validated and by discussing some of the drawbacks that are inherent to their experimental procedure.

      scRNA-seq studies generate atlases that are descriptive, by their nature. Therefore, we decided to keep interesting gene-expression analyses in the paper that are based on the scRNA-seq results, especially for the discoveries that point to exciting avenues for future pursuit. We reduced the text as suggested.

      1) The authors treat their male, female, and full datasets as three different samples. At the end of the day, these are, for the most part, equivalent neuronal types. The authors should decide to a) either only use the full dataset and present all analyses in this, or b) give a clear correspondence of male and female clusters onto the full ones.

      In this paper, all the analyses presented are on the full data set, with some links to the male or female data sets included. We now make clear that the full data set is the focus of the paper (lines 137-141). We provide the male and female data sets for our reader, with the individual Seurat objects uploaded to GEO, to make it easy for the reader to do follow-up analyses using the same criteria we used. We think this is helpful for our research community. We also compare the male and female clusters to the full data set using ClustifyR and report which clusters in the male or female data set analyses correspond to those in the full data set (Source data 2), as suggested by the reviewer, though ClustifyR has some limitations based on our evaluation of this tool for other annotations (see below).

      2) Most of their sections are heavily reliant on marker genes. In fact, in almost every section they mention how many of their genes of interest are marker genes. This depends heavily on specific cutoffs, making the conclusions fragile. Similarly, GO terms are used selectively and are, in many cases, vague (such as “signaling”, “neurogenesis”, “translation”).

      We evaluated marker genes, as those provide molecular identities to the clusters, given by definition they are significantly more highly expressed in a specific cluster, compared to all clusters. We used a Wilcox rank sum test with the following parameters in Seurat: (min.pct=0.25, logfc.threshold=0.25), which resulted in all called marker genes having p values < 0.05. We did not use a more stringent criteria given that most of the marker gene analyses are descriptive, and it is important to capture a broad range of genes. Our criteria are similar to Ma et al. 2021 (PMID: 33438579) and Corrales at al. 2022 (PMID: 36289550). In the text, we refer to the top 5 marker genes in several analyses, though these marker genes have a much more significant enrichment. We agree with the reviewers’ criticisms regarding the cluster-specific GO-term analyses in the text and those have been removed from the manuscript.

      3) A few of the results are not confirmed in vivo. The authors should add a Discussion section where they discuss the inherent issues of their analyses. Are there clusters of low quality? Are there many doublets?

      We have added discussion around these topics to the conclusions section of the manuscript and the results, when appropriate.

      On the same note, their clusters are obviously non-homogeneous (i.e. they house more than one cell types. This could obviously affect the authors' cluster-specific sex-differential expression, as differences could also be attributed to the differential composition of the male and female subclusters.

      We discuss this potential limitation in the discussion of sex differences in gene expression (Lines 959-961).

      4) Immunostainings are often unannotated and, in some cases especially in the Supplement, they are blurry. The authors should annotate their images and provide better images whenever possible.

      We appreciate this being pointed out and have provided higher resolution figures. The issue was we exceeded the manuscript submission file size on initial submission.

      5) I believe that the manuscript would benefit significantly by being heavily reduced in size and being focused on in vivo rigorously confirmed observations.

      We have addressed this comment by removing some of the analyses.

      Reviewer #3 (Public Review):

      This paper uses single-cell transcriptome sequencing to identify and characterize some of the neuronal populations responsible for sex-specific behaviour and physiology. This question is of interest to many biologists, and the approach taken by the authors is productive and will lead to new insights into the molecular programs that underpin sexually dimorphic development in the CNS. The dataset produced by the authors is of high quality, the analyses are detailed and well described, and the authors have made substantial progress toward the identification and characterization of some of the neuron populations. At the same time, many other cell types whose existence is suggested by this dataset remain to be identified and matched to specific neuron populations or circuits. We expect the value of this dataset to increase as other groups begin to follow up on the data and analyses reported in this paper. In general, the value of this paper to the field of Drosophila neurobiology will be high even if it is published in close to its present form. On the other hand, the current manuscript does not succeed in presenting the key take-home messages to a broader audience. A modest effort in this direction, especially re-writing the Conclusions section, will greatly enhance the accessibility and broader impact of this paper.

      While the biological conclusions reached by the authors are generally robust and of high interest, we believe that some conclusions are not sufficiently supported by the analyses that have been performed so far and need to be reexamined and confirmed. A major question concerns the authors' ability to distinguish a shared cell type with sex-biased gene expression from a pair of closely related, sex-limited cell types. There appear to be many cases that fall into this grey area, and the current analysis does not provide an objective criterion for distinguishing between sex-specific and sexually dimorphic clusters. Below we suggest some technical approaches that could be used to examine this issue. A second problem, which we do not believe to be fatal but that needs to be discussed, concerns potential differences in developmental timing and cell cycle phase between males and females, and how these differences might impact the inferences of sexual dimorphism in cell numbers and gene expression. Finally, we identify several areas, including the expression of transcription factors in different neuronal populations, that we believe could be described in more biologically insightful ways.

      For our review, we focus on three levels of evaluation:

      1). Is the dataset of high quality, useful to a large number of people, well annotated, and clearly described?

      The data appear to be high quality. The authors use reasonable neuronal markers to infer that 99% of their cells are neuronal in origin, suggesting extremely low levels of contamination from non-neuronal cells. Moreover, the gene/UMIs detected per cell are high relative to what has been reported in previous Drosophila scRNA-seq neuron papers (e.g. Allen et al., 2020). The cluster annotations are incomplete - which is not surprising, given the complexity of the cell population the authors are working with. 46 of the 113 clusters in the full dataset are named based on published expression data, gene ontology enrichments of cluster marker genes, and overlap with other CNS single cell datasets. This leaves rather a lot outstanding. It is probably unrealistic to aim for a 100% complete annotation of this dataset. But if we're thinking about how this dataset might be used by other researchers, in most cases the validation that a given cluster corresponds to a real, distinct neuron subpopulation will be left to the user.

      A major comment we have about the quality of the dataset relates to how doublets are identified and dealt with. The presence of doublets, an unavoidable byproduct of droplet-based scRNAseq protocols (like the 10x protocol used by the authors), could affect the clustering or at least bias the detection of marker genes. In large clusters, one might expect the influence of doublets on marker gene detection to be diluted, but in smaller clusters it could cause more significant problems. In extreme cases, a high proportion of doublets can produce artifactual clusters. The potential for problems is particularly high in cases where the authors identify cells with hybrid properties, such as clusters 86 and 92, which the authors describe as being serotonergic, glutamatergic, and peptidergic. Currently, the authors filter out cells with high UMI/gene counts, but it's unclear how many are removed based on these criteria, and cells can naturally vary in these values so it is not clear to us whether this approach will reliably remove doublets. That said, we acknowledge that by limiting their 'FindMarkers' analysis to genes detected in >25% of cells in a cluster the authors are likely excluding genes derived from doublets that contaminate clusters in low (but not high) numbers. We think it would be useful for the authors to report the number of cells that are filtered out because they met their doublet criteria and compare this value to the number of expected doublets for the number of cells they recovered (10x provides these figures). We would also recommend that the authors trial a doublet detection algorithm (e.g. DoubletFinder) on the unfiltered datasets (that is, unfiltered at the top end of the UMI/gene distribution). Does this identify the same cells as doublets as those the authors were filtering out?

      We appreciate this suggestion and have now added results from the doublet detection algorithm, DoubletFinder to our manuscript. Please see above response in editorial comments. We provide a table in Figure 1 – supplement 1 that indicates the number of cells removed by our filtering criteria: We acknowledge that there may be additional doublets in our data set that were not removed in our filtering criteria in the discussion (Lines 1098-1102) and have also provided a new table in Source data 2 indicating the number of potential doublets identified by DoubletFinder that are present in each cluster.

      2). What is the value of this study to its immediate field, Drosophila neurobiology? Are the annotation and analysis of specific cell clusters as precise and insightful as they could be? Has all the most important and novel information been extracted from this dataset?

      This is the part that we are least qualified to assess, since we, unlike the authors, are not neurobiologists. We hope some of the other referees will have sufficient expertise to evaluate the paper at this level.

      One thing we noticed (more on that in Part 3) is that the authors rely on JackStraw plots and clustree plots to identify the optimal combination of PCs and resolution to guide their clustering. This represents a relatively objective way of settling on clustering parameters. However, in a number of the UMAPs it looks like there are sub clusters that go undiscussed. E.g. in Fig. 2E clusters 1 and 3 are associated with smaller, distinct clusters and the same is true of clusters 2 and 6 in Fig 4b. Given that the authors are attempting to assemble a comprehensive atlas of fru+ neurons, it seems important for them to assess (at least transcriptomically) whether these are likely to represent distinct subpopulations.

      We appreciate these comments and address this above in the editorial comments section.

      3). How interesting, and how accessible is this paper to people outside of the authors' immediate field? What does it contribute to the "big picture" science?

      Here, we think the authors missed an important opportunity by under-utilizing the Conclusions section. The manuscript has a combined "Results and Discussion" section, where the authors talk about their identification and analysis of specific cell clusters / cell types. Frankly, to a non-specialist this often reads like a laundry list, and the key conclusions are swamped by a flood of details. This is not to criticize that section - given the complexity and potential value of this dataset, we think it is entirely appropriate to describe all these details in the Results and Discussion. However, the Conclusions section does not, in its present form, pull it all back together. We recommend using that section to summarize the 5-8 most important high-level conclusions that the authors see emerging from their work. What are the most important take-home messages they want to convey to a developmental biologist who does not work on brains, or to a neurobiologist who does not work on Drosophila? The authors can enhance the value of this paper by making it more interesting and more accessible to a broader audience.

      We appreciate this suggestion and made changes to the conclusions section to address this comment.

    1. Author Response

      Reviewer #3 (Public Review):

      Yamada et al utilizes the full strength of Drosophila neural circuit approaches to investigate second-order conditioning. The new insights into the mechanisms of how a learned cue can act as reinforcement are relevant beyond the fly field and have the potential to spark broad interest. The main conclusions of the authors are justified and the experiments, to my understanding, are well done.

      Some minor aspects must be addressed. To avoid misunderstandings a clear distinction should be made between those experiments using real world sugar and those using artificial activation of dopamine neurons as reward. For example, the proposed teacher - student model is mostly based on the work established with artificial activation.

      We split Figure 1 and made two separate figures. The new Figure 1 displays experiments with only real sugar or optogenetic activation of sugar receptor neurons (new data), whereas the new Figure 2 shows mostly experiments with direct DAN activations. This new figure arrangement makes a clear distinction between experiments with sugar and DAN activation, and allows readers to compare them more easily. We also modified the second paragraph of the discussion to clarify this point.

      To emphasize the generality of the model, it might help to provide some further evidence using real world sugar approaches, especially since the only known sugar-reward driven plasticity is reported in the student (g5b`2a) but not the teacher compartments. In this line, it would be useful to extend the functional interference used during the sugar experiments beyond the a1 compartment.

      In response to the reviewer’s comment, we added new data in Figure 2D to show that blocking PAM-DANs in γ4, γ5 and β′2a compartments impairs second-order conditioning following odor-sugar first-order conditioning. In contrast to blocking α1 DANs, blocking those non-α1 PAM-DANs did not impair one-day first-order memory (Figure 2D), which further strengthens our model of differential requirement of compartments for first-order and second-order memory formation.

      We think transient blocks of individual DAN cell types during second-order conditioning after odor-sugar conditioning will be informative to map second-order memories to specific compartments in naturalistic settings. For the reasons detailed above, however, we will need to develop a new way of transient purturbation for that.

      We would also point out that, to our knowledge, sugar-reward-driven plasticity has not been fully demonstrated in MBON-γ5β′2a. Owald et al., 2015 Neuron (10.1016/j.neuron.2015.03.025) showed a reduced CS+ odor response after odor-sugar conditioning in MBON-b′2mp (their Fig 3). However, they could not investigate the plasticity of MBON-γ5β′2a because the magnitude of odor response was too low (their Figure S3).

      Further, general statements about the compartments, for example for g5 and a1, might need adjustment since the tools used, the respective driver lines, often don't label all dopamine neurons in one specific compartment. In fact, functional heterogeneity among dopamine neurons innervating the g5 compartment have been recently established (sugar-reward, extinction) and might apply here.

      To clarify the point that we are manipulating a subset of DANs in each compartment, we added “cell count” information in Figure 2A. Also, we made Figure 4-figure supplement 2 to show which subtypes of DANs are connected with SMP108.

      Lastly, I would like to recommend that the authors discuss alternative feedback pathways that might serve similar or parallel functions.

      Despite these minor points, the study is impressive.

      Figure 4C shows several candidate interneurons that may have similar functions as SMP108. For instance, CRE011 may acquire enhanced response to reward-predicting odor as an outcome of reduced inhibition from MBON-γ5β′2a, and send excitatory inputs to DANs.

      In Figure 4-figure supplement 3, we made additional scatter plots to visualize other outlier cell types in terms of their connectivity with MBONs and DANs.

    1. The development of larp in the United States followed a trajectory common to larp across theworld. The currently dominant forms of larp developed out of TRPGs, such as D&D. How-ever, its evolution intersects with other practices as well: theater, parlor games, simulations likethe Model United Nations, and, especially, historical reenactment. The Society for CreativeAnachronism (SCA), a medieval reenactment organization started in 1966, was a significantinfluence, but United States Civil War reenactment was also important. It appears that themodern form of larp emerged in many places, nearly simultaneously across the country

      I wonder what the main differences between North American Larping and the type that occurs abroad. I rarely think about activities like this on a global scale, and I mainly usually focus on what we do domestically. That is a goal of mine, to learn more about how activities and games may differ depending on where they're located/being played.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      We thank all the Reviewers for their highly constructive reviews. Below, I have pasted the Reviewer’s comments in black and my replies in red, for easy reading.

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      In their study, Zhu and colleagues study how the centrosome proteins Spd-2 and Cnn in Drosophila recruit gamma-tubulin complexes to centrosomes, which is an important step in mitotic spindle formation. The authors make use of mutant flies and RNAi and find that the two factors Spd-2 and Cnn together are responsible for mitotic centrosomal accumulation of gamma-tubulin. By inactivating Spd-2 or Cnn separately, the authors show that Cnn appears to recruit the large share of mitotic gamma-tubulin pool by its CM1-domain. Interestingly, this involves only gamma-TuSCs (subcomplexes of gamma-TuRC) and not gamma-TuRCs. A smaller pool is recruited by Spd-2, and this pool depends on gamma-tubulin complex proteins that are only present in pre-assembled, complete gamma-TuRCs. This suggests that Drosophila makes microtubule nucleation templates in two ways. First, as in yeast, by direct recruitment of gamma-TuSCs to mitotic centrosomes, where additionally oligomerization needs to happen. And second, by recruitment and activation of preassembled gamma-TuRCs. Inactivation of both Cnn- and Spd-2 pathways abolishes mitosis-specific gamma-tubulin recruitment, resulting in low, but not complete loss of gamma-tubulin at centrosomes. The authors show that these low-gamma-tubulin centrosomes are still able to organize microtubules, but these microtubules have different dynamicity. Inspired by existing literature in flies and other model organisms, the authors identify Msps/Xmap215 as an important nucleation factor in this scenario.

      Major points:

      1) The authors use fly embryos with mutant Grip71, Grip75 and Grip163 alleles, which are central to the study. Most conclusions are based on the assumption that some mutants contain only gamma-TuSC, whereas wildtype cells contain a mix of gamma-TuSC and gamma-TuRC. It would be important to show sucrose gradient analyses of extracts to confirm the expected presence/absence of gamma-TuSC/gamma-TuRC.

      We agree that it would be nice to perform sucrose gradient analysis of γ-tubulin mutants in different mutant backgrounds, but unfortunately this is not as easy as the Reviewer may think. To clarify, we have used larval brain cells (not embryos) for the analysis of γ-tubulin recruitment to centrosomes. We cannot use embryos because most mutant combinations are lethal beyond larval stages, meaning that mutant adult females are not available for embryo collection (embryos use maternally loaded proteins and mRNA and so it is the genotype of the mother that is important). Performing sucrose gradients with larval brain extracts would be extremely challenging, if not impossible, because a relatively large amount of starting material is required for sucrose gradient centrifugation, and manually dissecting and preparing hundreds if not thousands of larval brains is unrealistic, especially as mutant larvae are rare.

      Given that we are not able to carry out these experiments, we have modified the text to include the caveat that some higher-order complexes may partially form in certain mutants. For example, in relation to the ability of Grip71 to recruit γ-TuSCs in cnn,grip75,grip163 mutants, the text now reads: “Thus, Spd-2 appears to recruit a very small amount of γ-TuSCs (which may, or may not, be present as larger assemblies due to an association with Grip128-γ-tubulin) via Grip71 (i.e. the recruitment that occurs in cnn,grip75GCP4,grip163 GCP6 cells), but its recruitment of γ-tubulin complexes relies predominantly on the GCP4/5/4/6 core.”

      Nevertheless, the most important conclusion is that Cnn can recruit γ-TuSCs independent of pre-formed cytosolic γ-TuRCs and this is based on the finding from one particular mutant – the spd-2,grip71,grip75,grip128,grip163 mutant – where γ-tubulin levels at mitotic centrosomes are only very slightly reduced compared to single spd-2 mutants (Figure 1B). This conclusion is based on three assumptions that we argue are all very reasonable:

      Assumption 1: flies depleted of 2, if not all 3, GCP4/5/4/6 core components (grip75,grip128,grip163) do not have a functioning GCP4/5/4/6 core. The Grip75GCP4 allele is a null mutant and is combined with a deficiency chromosome that depletes the whole Grip75GCP4 gene, and the Grip163GCP6 allele is a very strong depletion allele and is also combined with a deficiency chromosome that depletes the whole Grip163GCP6 gene. Even if the efficiency of the RNAi against Grip128GCP5 were poor, it would be hard to form a GCP4/5/4/6 core without Grip75GCP4 and in the near absence of Grip163GCP6 (which together provide 3 of the 4 molecules of the complex, including the outermost ones).

      Assumption 2: cells depleted of the GCP4/5/4/6 core cannot assemble cytosolic γ-TuRCs. This is reasonable given that even individual depletion of Grip75GCP4, Grip128GCP5 or Grip163GCP6 already strongly reduces the presence of cytosolic γ-TuRCs (Vogt et al., 2006; Vérollet et al., 2006). In spd-2,grip71,grip75,grip128,grip163 mutant brain cells, the only γ-TuRC protein not targeted, except for the γ-TuSC components, is Actin (Mozart 1 is expressed only in testes (Tovey et al., 2018) and Mzt2 does not exist in flies). In Xenopus and humans, Actin appears to facilitate γ-TuRC assembly via interactions with a GCP6-N-term-Mzt1 module, and so it would be unlikely to allow γ-TuSC assembly into higher-order complexes without GCP6 (i.e Grip163GCP6) and Mzt1.

      Assumption 3: Were Cnn not able to recruit γ-TuSCs independently of pre-formed γ-TuRCs, we would expect a much stronger reduction in γ-tubulin recruitment to centrosomes in spd-2,grip71,grip75,grip128,grip163 mutant cells. It is reasonable to assume, even without sucrose gradients, that the assembly of γ-TuRCs is strongly impeded in spd-2,grip71,grip75,grip128,grip163 mutant cells. Nevertheless, γ-tubulin is still recruited to centrosomes at ~66% compared to ~77% in spd-2 single mutant cells. While statistically significant (as stated in the updated manuscript), this reduction would surely be much greater were Cnn not able to recruit γ-TuSCs.

      In the absence of experimental data, we have therefore made these arguments in the main text by making some text modifications and adding a new paragraph, as follows:

      *“….the centrosomes in spd-2,grip71,grip75GCP4,grip128GCP5-RNAi,grip163GCP6 mutant cells had ~66% of the γ-tubulin levels found at wild-type centrosomes, only slightly lower than ~77% in spd-2 mutants alone (Figure 1A,B). Thus, the recruitment of γ-tubulin to mitotic centrosomes that occurs in the absence of Spd-2, i.e. that depends upon Cnn, does not appear to require Grip71 or the GCP4/5/4/6 core. *

      While we cannot rule out that residual amounts of GCP4/5/4/6 core components in spd-2,grip71,grip75GCP4,grip128GCP5-RNAi,grip163GCP6 mutant cells may support a certain level of γ-TuSC oligomerisation in the cytosol, we favour the conclusion that Cnn can recruit γ-TuSCs directly to centrosomes in the absence of the GCP4/5/4/6 core for several reasons: First, the alleles used for grip71 and grip75GCP4 are null mutants, and the allele for grip163GCP6 is a severe depletion allele (see Methods), and even individual mutations in, or RNAi-directed depletion of, Grip75GCP4, Grip128GCP5 or Grip163GCP6 are sufficient to strongly reduce the presence cytosolic γ-TuRCs (Vogt et al., 2006; Vérollet et al., 2006). Second, spd-2,grip71,grip75GCP4,grip128GCP5-RNAi,grip163GCP6 mutant cells are depleted for all structural γ-TuRC components except for γ-TuSCs and Actin (note that Mozart1 (Mzt1) is not expressed in larval brain cells (Tovey et al., 2018) and that Mzt2 does not exist in flies). In human and Xenopus γ-TuRCs, Actin supports γ-TuRC assembly via interactions with a GCP6-N-term-Mzt1 module (Liu et al., 2019; Wieczorek et al., 2019, 2020; Zimmermann et al., 2020; Consolati et al., 2020), and so Actin alone is unlikely to facilitate assembly of γ-TuSCs into higher order structures. Third, our data agree with the observation that near complete depletion of Grip71, Grip75GCP4, Grip128 GCP5, and Grip163GCP6 from S2 cells does not prevent γ-tubulin recruitment to centrosomes (Vérollet et al., 2006). Fourth, given the strength of mutant alleles used, one would have expected a much larger decrease in centrosomal γ-tubulin levels in spd-2,grip71,grip75GCP4,grip128GCP5-RNAi,grip163GCP6 mutant cells were Cnn not able to directly recruit γ-TuSCs to centrosomes. Thus, our finding that Cnn can still robustly recruit γ-tubulin to centrosomes in spd-2,grip71,grip75GCP4,grip128GCP5-RNAi,grip163GCP6 mutant cells strongly suggests that Cnn can recruit γ-TuSCs to centrosomes without a requirement for them to first assemble into higher-order complexes.”

      2) Given the advantage of the CnnΔCM1 separation of function mutant, I do not understand why it is not used throughout the study. Instead, full Cnn loss is used, which results in strongly reduced Spd-2 levels (Figure 2C,D). Are the observed differences between wild-type and mutants in Figure 2-5 dependent on defective PCM or do they also occur in a CnnΔCM1 background?

      This is a good point, and we agree that it would have been “cleaner” to use the CnnΔCM1 mutant in these experiments. The reason that the CnnΔCM1 mutant was not used is that this mutant allele was made only after we had already generated the multi-allele stocks and performed most of the other experiments in Figures 2-5. It would have taken a long time to go back and generate fly stocks containing the CnnΔCM1 allele instead of the cnn null mutant allele. As we have shown that the CnnΔCM1 mutant cannot recruit any γ-tubulin, we don’t believe that using this mutant would change the results regarding recruitment of γ-tubulin by the spd-2 pathway i.e. when we have examined γ-tubulin recruitment in the cnn mutant background (Figure 2). Nevertheless, in terms of the efficiency to which microtubules can be nucleated in the absence of γ-tubulin complexes, which was examined in a cnn,grip71,grip163 mutant background, it is likely that using a cnnΔCM1,grip71,grip163 mutant background would better maintain Spd-2 in the PCM and thus better allow Msps and Mei-38 to stimulate microtubule nucleation. We may therefore find that microtubules can be nucleated even more efficiently in the absence of γ-TuRCs. Note that we do state this caveat in the paper. That said, performing the experiments would not be essential to conclude that microtubules can be nucleated independent of γ-TuRCs, which is the main point of this part of the paper.

      Should the Reviewer and Editor deem it necessary, we will generate CnnΔCM1,grip71,grip163 lines to test whether or not γ-tubulin can be recruited to mitotic centrosomes under these conditions, and, if no γ-tubulin is recruited, we will generate CnnΔCM1,grip71 ,grip163,Jup-mCherry lines to test the ability of these centrosomes to nucleate microtubules (using the CherryTemp). Please note, however, that this would be several months of difficult fly genetics and data collection and we would therefore appreciate it if you consider the cost/benefit ratio when making your decision on whether you expect this data or not.

      3) Statistical tests should support the conclusions in the text. If the authors claim differences between different genetic backgrounds (e.g. that spd2-mutants only have ~77% of gamma-tubulin at mitotic centrosomes compared to wild-type), statistical tests must compare mutant mitosis vs. wild-type mitosis.

      We agree. We have now carried out the appropriate statistical tests and included them in the new version of the paper. For more detail, see the response to Reviewer 2 point 2.

      4) While Cnn, grip71, grip163 mutants do not accumulate gamma-tubulin at centrosomes in mitosis, they still have low levels of centrosomal gamma-tubulin. It is therefore misleading to refer to "gamma-tubulin negative centrosomes".

      This is a fair point. While we suspect this small fraction of γ-tubulin is non-functional in regard to microtubule nucleation i.e. it is the interphase pool of γ-tubulin and interphase centrosomes do not organise microtubules, we agree that referring to them as "gamma-tubulin negative centrosomes" is misleading. We have now changed the text to refer to them simply as “cnn,grip71,grip163 mutant centrosomes” or “cnn,grip71,grip163 centrosomes”.

      Minor points:

      1) The abstract states that gamma-TuRC is a catalyst of microtubule nucleation. By definition, a catalyst takes part in a reaction but is not part of the final product. Although our knowledge of the nucleation mechanism is still incomplete, mechanistic studies suggest a non-catalytical mechanism since gamma-TuRC was found to stay attached to the microtubule end after nucleation (Consolati et al. 2020, Wieczorek et al. 2020).

      We have now removed any reference to the γ-TuRC being a catalyst.

      2) CnnΔCM1 flies: genotyping data should be provided besides describing gRNAs.

      We are not entirely sure what the Reviewer means here. We had already stated in the main text and methods that the deletion region spanned from R98 to D167. For further clarity, we now included the word “inclusive” in both the main text and the methods: main text: “We therefore used CRISPR combined with homology-directed repair to delete the CM1 domain (amino acids 98-167, inclusive) from the endogenous cnn gene…”; Methods:“R98 to D167, inclusive. Please do let us know if further information is required.

      3) Is it important to combine spd-2 with all four mutants, grip75 grip128 grip163 and grip71? What about spd-2 grip71 cells and spd-2 grip75 grip128 grip163 cells? Should that not have the same effect?

      This comment relates to Major point 1, as our main conclusion (that Cnn can recruit γ-TuSCs) is only possible when combining spd2 with all four mutants i.e. targetting all γ-TuRC specific proteins is the most likely way to deplete as many pre-formed γ-TuRCs as possible. Depleting only Spd-2 and Grip71 would leave fully assembled γ-TuRCs in the cytosol, as assembly does not require Grip71. Depleting Spd-2, Grip75, Grip128, and Grip163 would prevent cytosolic γ-TuRC assembly, but there is a possibility that Grip71 may still act as a link between γ-TuSCs and Cnn. It was therefore necessary to deplete Spd-2, Grip75, Grip128, and Grip163, and Grip71.

      4) CM1-containing factors are the only known factors able to directly bind and activate gamma-TuRC. How do the authors envision activation of gamma-TuRC in the absence of Cnn?

      This is a good question but remains unanswered. Phosphorylation of γ-TuRCs is the most obvious possibility. For example, Aurora A phosphorylates NEDD1 (homologue of Grip71) to promote microtubule nucleation (Pinyol et al., 2013). NME7 kinase has been shown to increase the activity of purified γ-TuRCs (Liu et al., 2014). Other γ-TuRC components are also phosphorylated, but the consequences on γ-TuRC activity are not known. Another possibility is that TOG proteins indirectly promote the closing of the γ-TuRCs while adding tubulin dimers onto γ-tubulin (Thawani et al., 2020).

      5) Do the authors think that each identified pathway to microtubule nucleation (i.e. Spd-2/gamma-TuRC, Cnn/gamma-TuSC, Msps/mei38) as revealed by mutant genetic backgrounds contributes to a similar extent to overall nucleation capacity also in an unperturbed genetic background?

      Another good question, but it is very difficult to answer. Our view is that when γ-TuRCs are present and active they will likely dominate microtubule nucleation, out-competing the ability of TOG domain proteins to stimulate microtubule nucleation independently of γ-TuRCs. Nevertheless, TOG proteins will likely help promote microtubule nucleation from γ-TuRCs when both are present, as has been previously shown in vitro (Thawani et al., 2018; King et al., 2020; Consolati et al., 2020) and in fission yeast (Flor-Parra et al., 2018). We also believe that both Spd-2 and Cnn γ-TuRC recruitment pathways will contribute simultaneously. Another question is whether Cnn recruits γ-TuRCs instead of γ-TuSCs when γ-TuRCs are present in the cytosol. We assume this will depend on Cnn’s affinity of γ-TuRCs versus γ-TuSCs and on the relative levels of γ-TuRCs and γ-TuSCs in the cytosol.

      6) How does CM1 mediate binding to gamma-TuRC? Using recombinant Cnn fragments, the authors find that a Cnn triple mutant (R101Q, E102A and F115A) no longer binds gamma-tubulin, suggesting these residues together mediate binding to gamma-tubulin complexes. However, it is not tested to what extent R101, E102 and F115 individually contribute to gamma-tubulin binding. Does the binding mode in Drosophila resemble more the one in humans or in budding yeast? Also, was this done with extracts from Grip71, Grip75, Grip128RNAi, Grip163 embryos or normal embryos?

      In future, we will test the relative contributions of R101, E102 and F115, but for this study we wanted only to show that the CM1 domain was necessary for Cnn binding (hence why we directly mutated all three residues). We apologise for not stating that the IPs were carried out using wild-type embryos extracts – we have now included this information in the main text and methods.

      7) Figure 2C: Should the green channel not correspond to Spd-2?

      Thank you for pointing out this mistake – now corrected.

      8) I suggest to reconsider the color-coding of graphs. While the colored background of the dot plots in Figure 1 and 2 are a matter of taste, the coloring of graphs in Figure 4F-H is confusing. Here, genetic backgrounds of fly lines are colored in the same way as the microscopy channels in Figure 4A-E, but they do not belong together.

      We have now modified the colour-coding of images/graphs in Figure 4A-E as suggested.

      9) A tacc mutant allele is used in experiments, but is not further described. Please provide the necessary background information.

      We thank the reviewer for pointing this out. We had also forgotten to include the msps alleles used. The information for msps and tacc are now included in the methods.

      10) The authors assess spindle quality in Cnn, grip71, grip163 cells and show that spindle quality worsens with ectopic msps. For comparison it would be good to compare spindle quality side by side with a wild-type situation.

      This data is now included in Figure S4A,B.

      11) Introduction: "[...], however, as they depend upon each other for their proper localisation within the PCM and act redundantly." - Sentence is incomplete.

      I think this was just to do with how we had phased the sentence (the position of “however” was confusing). We have now rephrased the sentence: “It is complicated, however, to interpret the individual role of these proteins in the recruitment of γ-tubulin complexes, as they depend upon each other for their proper localisation within the PCM and act redundantly”.

      12) Introduction: "Cnn contains the highly conserved CM1 domain (Sawin et al., 2004), which binds directly to γ-tubulin complexes in yeast and humans (Brilot et al., 2021; Wieczorek et al., 2019)". - Choi et al 2010 should also be cited here.

      This citation has been added.

      13) Results: "Typically, interphase centrosomes have only ~5-20% of the γ-tubulin levels found at mitotic centrosomes, [...]". - Citation is needed

      We now cite our Conduit et al., 2014 paper.

      14) The authors should discuss that Msps was found to act non-redundantly with gamma-tubulin in interphase nucleation (Rogers, MBC, 2008), contrary to the conclusions in the current manuscript.

      Thank you for pointing this out. We have now modified the relevant part of the discussion to read:

      “TOG domain and TPX2 proteins have been shown to work together with γ-TuRCs (or microtubule seed templates) to promote microtubule nucleation (Thawani et al., 2018; Flor-Parra et al., 2018; Gunzelmann et al., 2018b; Consolati et al., 2020; King et al., 2020; Wieczorek et al., 2015). Consistent with this, co-depletion of γ-tubulin and the Drosophila TOG domain protein Msps did not delay non-centrosomal microtubule regrowth after cooling compared to single depletions in interphase S2 cells (Rogers et al., 2008). Nevertheless, several studies, mainly in vitro, have shown that TOG and TPX2 proteins can also function independently of γ-TuRCs to promote microtubule nucleation (Roostalu et al., 2015; Woodruff et al., 2017; Schatz et al., 2003; Slep and Vale, 2007; Ghosh et al., 2013; Thawani et al., 2018; King et al., 2020; Zheng et al., 2020; Tsuchiya and Goshima, 2021; Imasaki et al., 2022). Our data suggest that, unlike from non-centrosomal sites in interphase S2 cells, Msps can promote γ-TuRC-independent microtubule nucleation from centrosomes in mitotic larval brain cells. This difference may reflect the ability of centrosomes to concentrate Msps at a single location.”

      **Referees cross-commenting**

      This is a good paper in my opinion, they need to add some controls though, to determine the expected presence/absence of gTuSC/gTuRC in the different mutants. An important advance is the finding that gTuSC can function as nucleator in parallel to gTuRC, depending on the recruitment mechanism. Different recruitment mechanisms, nucleation templates, and regulatory strategies co-exist and provide complex regulation and robustness to nucleation/spindle assembly. We thank the Reviewer for their thorough and constructive review. We hope they will agree to allow publication without us having to perform the sucrose gradient experiments that, as discussed above, will be very difficult, if not impossible, to carry out.

      Reviewer #1 (Significance (Required)):

      This is a very well-executed study and the data is presented clearly. However, some findings would benefit from additional experiments to substantiate the main interpretations. If these points are addressed, the study would provide an important conceptual advance in the field, namely that animal cells may rely on two different gamma-tubulin complexes for nucleation at mitotic centrosomes, gamma-TuSC and gamma-TuRC, which differ not only in their composition of GCP proteins but also the mode of recruitment to the centrosome. The findings will be of interest to all cell biologists.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      Summary This paper sets out to further our understanding of how two proteins, Cnn and Spd-2, independently recruit g-Tubulin ring complexes(g-TuRC) to mitotic centrosomes in Drosophila cells. It uses some robust classical genetics to generate mutants to reduce/remove GCP4/5/6, Dgrip71 and Cnn and Spd-2 from cells, monitoring the consequences using live imaging.

      It begins by showing that Cnn can recruit g-Tubulin independently of the core g-TuRC components or Dgrip71, and that a mutant Cnn lacking the CM1 domain cannot, strongly suggesting that, similarly to other organisms, the CM1 domain is essential for this function.

      It then demonstrates that Spd-2, in contrast, cannot localise g-Tubulin in the absence of the g-TuRC components or Dgrip71.

      In the second half of the paper, then use this tool as a proxy for centrosomes that completely lack mitotic g-Tubulin recruitment, in order to explore spindle assembly in the absence of centrosomal g-Tubulin. The show that microtubules and spindle are still nucleated but do so with different dynamics. This section is particularly convincing, given the use of the live de/repolymerisation assays using the CherryTemp device.

      Finally, the authors visualise spindle formation in the absence of centrosomal g-Tubulin, alongside a number of other MT associated proteins, including Msps.

      Major Comments 1. The claims and conclusions relating to the first half of the paper are supported by the data, but they need to be caveated by a clear explanation of the alleles used. Some are well-characterised mutant lines but have they been previously shown to completely remove the associated protein products? For the RNAi lines, do the authors have evidence (via Western blots) that these remove the protein products? It is not necessary that they show Western blots for all the lines, and it does not invalidate the major conclusions that the fly line carrying mutations in cnn, grip71, grip163 completely fails to localise g-Tubulin to mitotic centrosomes. However, they need to help the reader understand much more clearly whether these lines are complete nulls and, consequently this may impact the strength of their interpretation of the relationship between Grip163 versus Grip75, discussed both at the end of the relevant section and in the Discussion.

      We appreciate the reviewer’s concern and have now included a detailed description in the Methods section of the alleles we use and their known effect on protein levels (pasted below for convenience). We have also included western blots for cnn and spd2 mutants to show the absence of detectable protein in larval brains. Unfortunately, we could not provide western blots for the other mutants, as we don’t have working antibodies for these proteins (although for Grip71 we did make an antibody and did western blots that showed the absence of protein in grip71 mutants, but this antibody has now been commercialised and so the western blot is published on the CRB website: https://crbdiscovery.com/polyclonal-antibodies/anti-grip71-antibody/). Nevertheless, protein levels for the grip75, grip163, msps and tacc mutants have been shown previously (now cited in the new text). We have also modified the main text to allow the reader to better understand whether proteins are completely absent or strongly reduced. In response to the specific comment about interpreting the relationship between Grip163 and Grip75, as we mention in the new methods section, the Grip75 allele is a null mutant while the Grip163 mutant is a severe depletion; thus, the fact that the Grip163 mutant has a stronger effect on γ-tubulin recruitment is not due to a stronger depletion.

      New text in methods: “For spd-2 mutants, we used the dspd-2Z35711 mutant allele, which carries an early stop codon resulting in a predicted 56aa protein. Homozygous dspd-2Z35711 mutant flies lack detectable Spd-2 protein on western blots and so the allele is therefore considered to be a null mutant (Giansanti et al., 2008). This allele no longer produces homozygous flies (which is common for mutant alleles kept as balanced stocks for many years), which combined dspd-2Z35711 with a deficiency that includes the entire spd-2 gene (dspd-2Df(3L)st-j7). On western blots, there was no detectable Spd-2 protein in extracts from dspd-2Z35711 / dspd-2Df(3L)st-j7 hemizygous mutant brains (Figure S4B). For cnn mutants, we combined the cnnf04547 and cnnHK21 mutant alleles. The cnnf04547 allele carries a piggyBac insertion in the middle of the cnn gene and is predicted to disrupt long Cnn isoforms, including the centrosomal isoform (Cnn-C or Cnn-PA) (Lucas and Raff, 2007). This mutation is considered to be a null mutant for the long Cnn isoforms (Lucas and Raff, 2007; Conduit et al., 2014). The cnnHK21 allele carries an early stop codon after Cnn-C’s Q78 (Vaizel-Ohayon and Schejter, 1999) and affects both long and short Cnn isoforms – it is considered to be a null mutant (Eisman et al., 2009; Chen et al., 2017a). On western blots, there was no detectable Cnn-C protein in cnnf04547 / cnnHK21 hemizygous mutant brains (Figure S4A). For Grip71, we used the grip71120 mutant allele, which is a result of an imprecise p-element excision event that led to the removal of the entire grip71 coding sequence except for the last 12bp; it is considered to be a null mutant (Reschen et al., 2012). We combined this with an allele carrying a deficiency that includes the entire grip71 gene (grip71Df(2L)Exel6041). On western blots, there is no detectable Grip71 protein in grip71120 / grip71df6041 hemizygous mutant brains (see blots on CRB website, which were performed by us). For Grip75GCP4, we used the grip75175 mutant allele, which carries an early stop codon after Q291. Homozygous grip75175 mutant flies lack detectable Grip75GCP4 protein on western blots and so the allele is therefore considered to be a null mutant (Schnorrer et al., 2002). We combined this with an allele carrying a deficiency that includes the entire grip75GCP4 gene (grip75Df(2L)Exel7048). In the absence of a working antibody, we have not confirmed the expected absence of Grip75GCP4 protein in grip75175 / grip75Df(2L)Exel7048 hemizygous mutant flies on western blots. For Grip128GCP5, we used the UAS-controlled grip128-RNAiV29074 RNAi line, which is part of the VDRC’s GD collection, and drove its expression using the Insc-Gal4 driver (BL8751), which is expressed in larval neuroblasts and their progeny. In the absence of a working antibody, we have not confirmed the absence or reduction of Grip128GCP5 protein on western blots. RNAi was used for grip128GCP5 as its position on the X chromosome made generating stocks with multiple alleles technically challenging. For Grip163GCP6, we used the grip163GE2708 mutant allele, which carries a p-element insertion between amino acids 822 and 823 (total protein length is 1351aa) and behaves as a null or strong hypomorph mutant (Vérollet et al., 2006). We combined this with an allele carrying a deficiency that includes the entire grip163GCP6 gene (grip163Df(3L)Exel6115). In the absence of a working antibody, we have not confirmed the absence or reduction of Grip163GCP6 protein in grip163GE2708 / grip163Df(3L)Exel6115 hemizygous mutant flies on western blots. For Msps, we used the mspsp and mspsMJ15 mutant alleles. The mspsp allele carries a p-element insertion within, or close to, the 5’ UTR of the msps gene and results in a strong reduction, but not elimination, of Msps protein on western blots (Cullen et al., 1999). The mspsMJ15 allele was generated by re-mobilisation of the p-element (the genetic consequence of which has not been defined) and also results in a strong reduction, but not elimination, of Msps protein on western blots (Cullen et al., 1999; Lee et al., 2001). For TACC, we used the taccstella allele which contain a p-element insertion of unknown localisation but that results in no detectable TACC protein on western blots (Barros et al., 2005). For Mei-38, we used the UAS-controlled mei-38-RNAiHMJ23752 RNAi line, which is part of the NIG’s TRiP Valium 20 collection, and drove its expression using the Insc-Gal4 driver (BL8751). In the absence of a working antibody, we have not confirmed the absence or reduction of Mei-38 protein on western blots. RNAi was used for mei-38 as its position on the X chromosome made generating stocks with multiple alleles technically challenging. Moreover, the only available mutant of mei-38 affects a neighbouring gene.”

      I have an issue with the statistics in Figure 1 &2. I realise the t-tests in Figure 1 show the significant differences between g-Tubulin recruitment to centrosomes in interphase and mitosis, in order to demonstrate the difference between the Spd-2;Grip combination line in (B) and the Spd-2; CnnCM1 double mutant in (D). But in doing so, it draws attention to the fact that there is no similar t-test between mitotic g-Tubulin recruitment to centrosomes in WT, Spd-2 and the Spd-2;Grip combination lines. This lack of stats between conditions is further confused by the language used in the text: In the Figure legend, the authors claim mitotic centrosomal g-Tubulin levels between are WT, Spd-2 and the Spd-2;Grip combination lines "similar", and in the text they say: the spd-2 Grip combination line had g-Tubulin "similar to the levels found at spd-2 mutants alone". But then they give numbers - an average of 77% of wild type for spd2 and 66% of wild type for the spd-2 Grip combination. I'm sure if they did a t-test they would find a significant difference between these conditions. This doesn't invalidate the thrust of what they're claiming, but they do need to be consistent in language, analysis and interpretation.

      We agree that we should have performed a statistical comparison between the γ-tubulin levels for “WT mitosis” vs “spd2 mitosis” and for “spd-2 mitosis” vs “spd2,grip71,grip75,grip128,grip163 mitosis” (Figure 1B). We have now done this and found statistically significant differences in both cases. We have included the new p-values in the figure and modified the main text to read: “In fact, the centrosomes in these spd-2,grip71,grip75GCP4,grip128GCP5-RNAi,grip163GCP6 mutant cells had ~66% of the γ-tubulin levels found at wild-type centrosomes, only slightly lower than the levels found at spd-2 mutants alone (Figure 1A,B).”; and we have modified the legend to read: “A one-way ANOVA with a Sidak’s multiple comparisons test was used to make the comparisons indicated by p values in the graph. Note that there is only a small reduction in mitotic centrosomal γ-tubulin levels in spd-2 mutants and in spd-2, grip71,grip75GCP4, grip128GCP5-RNAi,grip163GCP6 mutants, showing that Cnn can still efficiently recruit γ-tubulin complexes to mitotic centrosomes when only γ-TuSCs are present.” Note that due to performing comparisons multiple times with the same data sets, it was necessary to use a one-way ANOVA with a Sidak’s multiple comparisons test (rather than paired t-tests).

      For Figure 1D, we did not compare WT mitosis vs cnn∆CM1,spd-2 mitosis, as the point here was to test whether there was an increase from interphase to mitosis in cnn∆CM1,spd-2 mutants and we wanted to maintain the statistical power of using a paired t-test (one is more likely to detect differences with a paired t-test than with a multiple comparisons ANOVA, making the conclusion that there is no difference between interphase and mitotic cnn∆CM1,spd-2 centrosomes even more solid).

      Similarly, in Figure 2, it would be better to assess any statistically significant difference between mitotic accumulation of g-Tubulin between fly lines, rather than accumulation between interphase and mitosis (which is pretty clear cut). This would help to clarify whether differences between loss of grip subunits are merely additive or synergistic. Again, this doesn't invalidate the overall result that concomitant loss of cnn, grip71 and grip163 completely abolishes mitotic centrosomal accumulation of g-Tubulin, but it is a more complete analysis of the extant data.

      As for Figure 2, we respectfully disagree that we should make comparisons between genotypes instead of, or in addition to, making comparisons between interphase and mitotic centrosomes within the same genotype. This is because we will lose statistical power by performing a multiple comparisons test. Indeed, if we were to compare both within and between selected genotypes (14 comparisons in total), then we lose the statistically significant differences between interphase and mitotic centrosomes in cnn,grip75,grip163 (p=0.04) and cnn,grip71,grip75 (p=0.08) genotypes, when there clearly appears to be a difference (as stated by the Reviewer). Given that the point of this experiment is to elucidate which proteins are required to allow maturation from interphase to mitosis, rather than which combination of mutations has the stronger effect, we feel that maintaining the paired t-test analysis is more appropriate.

      One OPTIONAL experiment that would significantly improve the study would be similar CherryTemp live imaging of the cells lacking both centrosomal g-Tubulin and Msps. Currently the manuscript finishes with a fixed analysis of MT de/repolymerisation in these cells, which provides evidence that Msps has a role in MT nucleation in the absence of centrosomal g-Tubulin-nucleated MTs, but very little else can be concluded.

      We would love to do this experiment but the genetics are complicated. We would have to generate stocks containing a cnn,grip71,GFP-PACT triple allele chromosome II and a grip163,msps,Jupiter-mCherry triple allele chromosome III. While live data would provide interesting insights into the dynamics of microtubules nucleated in the absence of γ-TuRCs and reduction of Msps, our fixed analysis is at least sufficient to implicate Msps in γ-TuRC-independent microtubule organisation.

      1. There is, perhaps surprisingly, no mention of Augmin in the paper. Augmin has been shown to recruit g-TuRC to pre-existing MTs, through the grip71 subunit (Chen et al., 2017). So, presumably, in cnn, grip71, grip163, g-Tubulin cannot be recruited to pre-existing MTs either? This could add impact to the results - in that it implies the MT nucleation seen in the absence of cnn, grip71 and grip163 actually reflects, not just loss of centrosome function, but also loss of Augmin function. Mentioning this in the discussion could help increase the impact of the paper.

      We apologise for this oversight. Indeed, it is perfectly possible that Grip71/Augmin-mediated amplification of microtubules during microtubule re-growth from centrosomes could influence the difference in recovery rates between control and mutant centrosomes. We have now modified the results section to read:

      “Our data suggest that microtubules are more resistant to cold-induced depolymerisation when they have been nucleated independently of γ-TuRCs, but that microtubules are nucleated more efficiently when γ-TuRCs are present. However, it must be considered that, due to the loss of Cnn from centrosomes in the cnn,grip71,grip163 mutant cells, general PCM levels are reduced, likely reducing the levels of any protein involved in γ-TuRC-independent microtubule nucleation. Moreover, Grip71 is necessary for γ-TuRC recruitment to microtubules, most likely via the Augmin complex (Reschen et al., 2012; Chen et al., 2017b; Dobbelaere et al., 2008; Vérollet et al., 2006), enabling microtubules to be nucleated from the sides of pre-existing microtubules. Thus, the potential for Augmin-mediated amplification of centrosome-nucleated microtubules in control cells may also contribute to the increased microtubule recovery speed in control cells. Importantly, however, both of these caveats make it even clearer that microtubules can be nucleated independently of γ-TuRCs from mitotic centrosomes in Drosophila.”

      Minor comments 1. The cnn, grip71, grip163 mutant image in Fig3 B after 40 min cooling appears to have 4 centrioles. Is this a cell that exited and re-entered mitosis?

      Cnn mutant cells often have centrosome segregation problems, resulting in cells with variable numbers of centrioles (Conduit et al., 2010b, Current Biology). We have now mentioned this in the legends for Figure 3, Figure 4, and Figure S4.

      Methods should contain more detail on the de/repolymerisation live imaging analysis (including the numbers of cells contributing to the analysis) and techniques such exponential curve fitting.

      We have now included this information in the methods and updated this information in the figure legend (to include cell numbers, not just centrosome numbers, and to indicate that GraphPad Prism was used to generate the models.

      P5 para 2 - "GPC4/5/4/6" should read "GCP4/5/6"

      We actually use the GCP4/5/4/6 nomenclature throughout as it represents the 2 copies of GCP4 to one copy of GCP5 and GCP6 in the complex, as well as the order of these molecules.

      Fig legend 1 - "error bar" should read "scale bar"

      Thanks, now corrected

      Reviewer #2 (Significance (Required)):

      The experimental approach (genetics and cell biology) taken in this manuscript is very appropriate and the experiments are of high quality. It uses the strengths of Drosophila to cleverly engineer flies to pull apart the relationship between two different ways to recruit the main MT nucleator, g-Tubulin, to mitotic centrosomes. This is an important advance for the specific research field of centrosome biology.

      By generating a fly that completely fails to localise g-Tubulin to mitotic centrosomes, the paper is able to explore whether MTs and the mitotic spindle can form in its absence. Again, there is very high quality imaging and image analysis, using a commercially available (but very cool) fast heating/cooling apparatus - the CherryTemp to explore the dynamics of MT generation. The limitation to this approach, though, is that g-Tubulin itself is still present and presumably able to nucleate MTs in the cytosol or elsewhere, albeit inefficiently. As such, it adds to a body of centrosomal and cell division research, rather than adding a highly significant conceptual advance.

      Similarly, the finding that Msps is involved in nucleating MTs in the absence of centrosomal g-Tubulin, via fixed analysis, supports other work, rather than moving the field forwards.

      Overall, assuming the caveats mentioned in the major comments are dealt with, I see this as a robust and very well carried out piece of research, that will be of interest to those investigating the broad field of cell division

      My field of expertise is Drosophila cell division

      We thank the Reviewer for their thorough and constructive review. We hope that the reviewer may agree with us and the other Reviewers that revealing the complexity of γ-TuRC recruitment and microtubule nucleation at centrosomes, particularly the finding that different types of γ-tubulin complexes are recruited to centrosomes by different tethering proteins, provides an important conceptual advance.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      Centrosomes are complex and it has been appreciated for some time that they likely nucleate microtubules by more than one mechanism. However, what these mechanisms exactly are, and which are the most significant has not been clear. A major contributor to centrosomal microtubule nucleation the tubulin isoform gamma-tubulin (g-tubulin), which is present in two complexes, a smaller gTuSC that contains gamma tubulin along with GCP2 and 3 and a larger g-tubulin ring complex (gTuRC) whose assembly additionally requires GCP4/5/6. A second high-level question has been whether the centrosome has any g-tubulin-independent microtubule nucleation mechanisms. In this manuscript, the authors use a collection of mutants and RNAi conditions in the Drosophila brain to generate a picture of centrosomal microtubule nucleation pathways. They show that there are two g-tubulin-dependent and a third g-tubulin-independent microtubule nucleation pathways. They show that the first g-tubulin-dependent pathway depends on the CM1 domain of the centrosomal PCM matrix protein, Centrosomin (Cnn) and on the gTuSC components GCP2/3, but not on the components specifically required for gTuRC assembly. The second g-tubulin-dependent pathway depends on Spd-2 (and not Cnn) and requires the gTuRC-specific components and NEDD1/Grip71. By inhibiting both of these pathways, the authors also show that there is a robust g-tubulin-independent microtubule nucleation pathway. Overall, has the potential to be an impactful contribution from a conceptual point-of-view. I would be excited to recommend publication if the major comments below, particularly points 1 and 2, could be addressed.

      1. The experiment in Fig. 2B examines what is required for Spd-2 to recruit g-tubulin to mitotic centrosomes that lack Cnn. This panel should include a cnn mutant-only control, for which the readers are currently referred to an older paper from 2014. Without repeating this control in parallel to one of the conditions in this panel, it is impossible to say whether the addition of the grip71 mutation has any effect on g-tubulin levels.

      This is a good point. We will perform a cnn vs cnn,grip71 experiment and include this data in the new version of the paper. This will take a couple of months, as this will involve growing up fly lines, performing the necessary crosses, microscopy, data analysis, and manuscript updating.

      1. The experiment in Fig. 2B is in the background of a Cnn loss-of-function mutation in which centrosomal Spd-2 is at just under 40% of its levels in brains with Cnn (according to Fig. 2D). So the Spd-2 doing the recruiting-is the non-Cnn-dependent population. The authors should also do one experiment in the background of their Cnn-CM1delete mutant or their Cnn CM1 g-tubulin recruitment mutant, because these backgrounds would be expected to have normal amounts of Cnn matrix and normal levels of Spd-2. Comparing the amount of g-tubulin recruitment in a cnn loss-of-function mutant to that in a cnn-CM1delete mutant would reveal whether the Cnn-bound Spd-2 can contribute to g-tubulin recruitment in the same way that the Cnn-independent Spd-2 can. These two populations could easily differ in their ability to recruit g-tubulin. Also, is it clear that these two pathways can act in parallel (i.e. that assembly of the Cnn matrix around the centriole does not mask the ability of Cnn-independent Spd-2 to recruit g-tubulin)? Thus, there are three possibilities- all interesting- for the outcome of this experiment. The Cnn-CM1delete mutant/Cnn-CM1 g-tubulin recruitment mutants could: (1) recruit less g-tubulin than the cnn loss-of function mutant (if Cnn matrix assembly inhibits the Cnn-independent Spd-2 pathway), (2) recruit the same amount of g-tubulin as the cnn loss-of-function mutant (if the Cnn matrix does not inhibit the Cnn-independent Spd-2 pathway but Cnn-dependent Spd-2 does not itself recruit g-tubulin), or (3) recruit more g-tubulin than the cnn loss-of-function mutant (if both the Cnn-dependent and Cnn-independent Spd-2 can recruit g-tubulin).

      These are very interesting points that we have not considered before. As the reviewer suggests, we will perform an analysis of γ-tubulin levels at centrosomes in cnnnull vs cnn∆CM1 to test the ability of Cnn-dependent and Cnn-independent populations of Spd-2 to recruit γ-tubulin. This should take ~2 months.

      1. The paper needs a summary model figure that the field can understand. The current model in Fig. 2E does not suffice in this regard. It would be nice to have this model appear at the end of the paper to outline the 3 pathways for centrosomal microtubule nucleation outlined by the work. Maybe have an arc for the centrosome at the bottom of the figure and show arrows from the gTuSC to the Cnn CM1 domain from the gTuRC to the Cnn CM1 domain and the gTuRC to Spd-2 or something like this. How you draw this could be impacted by the experiment outlined above in point 2. Also, there would be a g-tubulin-independent pathway in the figure. Not everyone reads papers carefully, and you want people to be able to get the takeaway message at a glance.

      We have now completely modified the Figure and moved it to the end of the paper (new Figure 5). We thank the Reviewer for this suggestion as it really does provide a clearer message for the reader.

      1. The authors show that this pathway is modulated by loss of Minispindles (Msps)-but as this is a critical microtubule assembly factor, it seems likely that Msps loss might modulate all of the pathways. From the data in Figure 4, my main takeaway would be that Msps is not the central player in the g-tubulin independent nucleation pathway. It might make the paper more impactful to end the story after Fig. 4, move the current Fig. 5 to the supplement and add a nice model figure at the end.

      We agree that Msps may play a role beyond microtubule nucleation, including plus end growth, and that this may also influence the efficiency of spindle formation in cnn,grip71,grip163,msps mutants. Nevertheless, our microtubule regrowth data in original Figure 5A clearly show that Msps is a key player in the g-tubulin independent nucleation pathway at centrosomes. Perhaps the Reviewer missed this point as the data was in Figure 5 and not Figure 4. Moreover, the original Figure 5E shows that the effect of depleting Msps in addition to cnn, grip71 and grip163 is specific to cells containing centrosomes i.e. if Msps played a significant role in microtubule regulation beyond its role at centrosomes, then one would expect spindle formation to be worse when comparing mutant cells that lack centrosomes. Nevertheless, we now realise it would be better to include the microtubule regrowth from centrosomes data for cnn,grip71,grip163 vs cnn,grip71,grip163,msps in Figure 4, and move the spindle assembly data from original Figure 5C-E to a new supplementary Figure (Figure S4). We then end the paper on a model figure in new Figure 5.

      Minor comments: 5. In Fig. 1E the sequence labels are confusing. Please label each sequence on the left with the residue numbers in the corresponding endogenous protein that are shown in the alignment.

      You are absolutely right, I’m not sure why our labelling was like that. Now corrected.

      In Fig. 1F, please label with location of molecular weight markers

      Now added.

      Reviewer #3 (Significance (Required)):

      Repeating my text from above. Centrosomes are complex and it has been appreciated for some time that they likely nucleate microtubules by more than one mechanism. However, what these mechanisms exactly are, and which are the most significant has not been clear. A major contributor to centrosomal microtubule nucleation the tubulin isoform gamma-tubulin (g-tubulin), which is present in two complexes, a smaller gTuSC that contains gamma tubulin along with GCP2 and 3 and a larger g-tubulin ring complex (gTuRC) whose assembly additionally requires GCP4/5/6. A second high-level question has been whether the centrosome has any g-tubulin-independent microtubule nucleation mechanisms. In this manuscript, the authors use a collection of mutants and RNAi conditions in the Drosophila brain to generate a picture of centrosomal microtubule nucleation pathways. They show that there are two g-tubulin-dependent and a third g-tubulin-independent microtubule nucleation pathways. They show that the first g-tubulin-dependent pathway depends on the CM1 domain of the centrosomal PCM matrix protein, Centrosomin (Cnn) and on the gTuSC components GCP2/3, but not on the components specifically required for gTuRC assembly. The second g-tubulin-dependent pathway depends on Spd-2 (and not Cnn) and requires the gTuRC-specific components and NEDD1/Grip71. By inhibiting both of these pathways, the authors also show that there is a robust g-tubulin-independent microtubule nucleation pathway. Overall, has the potential to be an impactful contribution from a conceptual point-of-view. I would be excited to recommend publication if the major comments, particularly points 1 and 2, could be addressed.

    1. The prior for the slope is a lot easier now. We can simply specify a normal distribution with a mean of 0 and a standard deviation equal to the size of the effect we deem likely, together with a lower bound of 0 and upper bound of 1.

      Update: I was wrong on the below, the SD is not 1 here, because it's the SD for the residual term in the linear model, not the SD for the raw outcome variable.

      Previous comment:...

      I’m ‘worried’ that if you give it data you know has sigma=1, but you allow it to choose any combination of beta and sigma, you may be getting it to do give a weird posterior to both of the parameters, in a way you know can’t make sense, in order to find the most likely parameters for the weird geocentric model you imposed.

      on the other hand I would have thought that it would tend to converge to a sigma=1 anyways as the most likely, as that is ‘allowed’ by your model

      my take is that the cauchy prior you impose in that part is heliocentric; well let me expand on this. I think you know that the true std deviation of the ‘standardized heights from this population’ is 1 what you don’t know is whether it is indeed normal (i.e., whether family = gaussian is right here) thus it might be finding ‘a sigma far from 1 is likely’ under this model, because that makes your ‘skewed’ or ‘fat tailed’ data seem more likely under the normal prior A better approach might be to allow a different distribution with some sort of ‘skew’ parameter, but imposing the sd must be 1

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Manuscript number: RC-2022-01573

      Corresponding author(s): Helder Maiato and Niels Galjart

      1. General Statements

      The murine microtubule (MT) plus-end tracking protein CLASP1 has been extensively examined in cultured cells, revealing an important function for this protein in mitosis and the regulation of MT dynamics. Here we describe a major in vivo phenotype of Clasp1 knockout (KO) mice: we find that these mice die at birth due to respiratory problems. In the first version of our manuscript we tried to link this in vivo phenotype of the KO mice to CLASP1’s major roles in cultured cells, including mitosis, and we therefore included multiple results, obtained in cultured cells and in different organs.

      We thank the reviewers for their thoughtful and constructive criticisms and for their judgment that our study is - in principle - worthy of publication. Based on suggestions by reviewers #2 and #3 we have decided to focus the revised manuscript on the lung phenotype of the Clasp1 KO mice, and on a possible cause for this phenotype. We believe that our new analysis, which was partly driven by the remarks of the reviewers, is revealing a mechanism for why the mice die at birth. This mechanism suggests a role for CLASP1 in controlling epithelial and endothelial cell differentiation in the neonatal lung, and in particular protein secretion in AT2 alveolar cells.

      2. Description of the planned revisions

      General remarks

      We believe our new RNA-Seq analysis (explained in detail below, in point 3 “Description of incorporated revisions”) strongly suggests that four essential lung cell types (i.e. AT1 and AT2 cells, endothelial cells and immune cells) fail to properly differentiate in Clasp1 KO embryos. In particular AT2 cell differentiation and functioning are hampered in the KO mice.

      Brief summary of planned experiments and table of old and new Figures

      To support our new findings we will stain sections of wild type and KO lung with a selected set of antibodies and other reagents. To help the reviewers we have made a table with original Figures and Figures for the revision.

      Figure Number

      Original Figure

      Fate of original

      Revision Figure

      1

      Targeted inactivation of the Clasp1 allele

      Remains

      Targeted inactivation of the Clasp1 allele

      2

      Clasp1 KO mice show reduced rib-cage and delayed ossification

      Minor revision

      Clasp1 KO mice show reduced rib-cage and delayed ossification (Statistics will be added)

      3

      Innervation of the diaphragm is affected in Clasp1 KO mice from E14.5-E18.5

      Moved to Supp

      Newborn Clasp1 KO lungs show a drastic reduction in air inflation

      4

      Neurite outgrowth, branching capacity and microtubule dynamics are altered in Clasp1 KO neurons

      Removed

      Histological and immunological examination of the Clasp1 KO lungs demonstrating decreased air space

      5

      Histological and immunological examination of the Clasp1 KO lungs demonstrating decreased air space

      Moved Up

      (4)

      Histo-morphological analysis of the developing lung throughout embryonic development (E14.5-PN1)

      6

      Transcriptome analysis of wild type and Clasp1 KO lungs

      Major revision

      Transcriptome analysis of wild type and Clasp1 knockout lungs reveals differentiation defects in four major lung cell types (New data added, old data moved to Supp)

      7

      Loss of Clasp1 alters the ratio of alveolar type I and type II cells in the lungs

      Major revision

      Cellular analysis of Clasp1 knockout lungs (New data will be added)

      8

      -

      -

      Role of Clasp1 in AT2 function (New data will be added)

      S1

      Incidental cell division defects in mouse embryonic fibroblasts derived from Clasp1 knockout mice

      Removed

      Innervation of the diaphragm is affected in Clasp1 knockout mice from E14.5-E18.5

      S2

      Ultra-structural analysis of diaphragms

      Remains

      Ultra-structural analysis of diaphragms

      S3

      Newborn Clasp1 knockout lungs show a drastic reduction in air inflation

      Moved to Main (3)

      Cellular analysis of late stage gestation mouse lungs

      S4

      Histo-morphological analysis of the developing lung throughout embryonic development (E14.5-PN1)

      Moved to Main (5)

      Exogenous administration of glucocorticoids promotes lung maturation and partially rescues postnatal lethality

      S5

      Cellular analysis of late stage gestation mouse lungs

      Moved Up

      (S3)

      Analysis of signature genes and cell type signatures of the mouse and human lung

      S6

      Exogenous administration of glucocorticoids promotes lung maturation and partially rescues postnatal lethality

      Moved Up (S4)

      Transcriptome analysis of wild type, Clasp1, and Mll3 knockout E18.5 lungs

      Below we react to specific comments of the reviewers, describing in more detail which experiments will be carried out and why we will do these experiments.

      Specific remarks to the comments of the reviewers

      Reviewer #1.

      Comment:

      p.17: Aqp5 expression was decreased in mutant lungs as shown by RNA-seq data and RT-qPCR. However, immunolabelling with T1a does not show a decrease in the number of Type I pneumocytes (Fig. 7D). According to the data presented, it is difficult to conclude that CLASP1 is involved in Type I pneumocyte differentiation.

      A cell count should be done for Figure 7D. Immunolabeling with more markers for Type I pneumocytes, including AQP5 Ab, should be performed to determine if the decreased Aqp5 RNA expression correlates with less Type I cells. GSEA signature has to be confirmed by additional analyses.

      Answer:

      Given the flat appearance of the T1a-positive cells (see old Figure 7E) it is difficult to carry out a quantification for T1a (which is Pdpn). We will perform new IF experiments to examine AT1/2 cell numbers using additional markers (e.g. Hopx for AT1).

      Comment:

      p.17: The same comments can be made for Type II pneumocytes and SpC expression.

      Answer:

      We actually did do an Sftpc (Pro-SPC) count (see old Figure 7E), which reveals that the number of Sftpc-expressing cells is up in the Clasp1 KO. At first sight this seems surprising, given that Chil1 (a top AT2 signature gene at E18.5) is virtually absent from Clasp1 KO lungs. However, our new GSEA analysis (shown in the new Figure 6) shows that of all the E18.5 AT2 signature genes (403 genes in total) the majority is down-regulated, including Chil1 and 4 other top signature genes, but some genes are up, including Sftpc (see new Figure 6). Combined with the fact that we observe more Pro-SPC-expressing cells in the Clasp1 KO lung we hypothesise that AT2 cell numbers are up compared to wild type, giving rise to higher mRNA counts of some genes in the RNA-Seq. Differentiation of AT2 cells is significantly hampered, giving rise to lower expression of many AT2 signature genes in the RNA-Seq. By contrast, all AT1 signature genes are either down or not affected (see new Figure 6). We interpret this as evidence that AT1 cell numbers are down. The same goes for endothelial cells (EC, see new Figure 6). We will perform additional IF experiments to examine this hypothesis.

      Reviewer #3.

      Comment:

      T1α-positive cells should be quantified (Figure 7D). From the images, the number of T1α+ cells in Clasp1 KO is not consistent with the qPCR result showing markedly reduced Aqp5 transcript levels in Clasp1 KO. It is unclear whether the reduction in Aqp5 is due to impaired water channel function as the authors suggest or instead due to reduced number of AT1 cells, further investigation should be conducted.

      Answer:

      Please see our answer to reviewer #1 above. To summarise, we now have evidence that AT1 cell numbers are down. We will perform additional IF experiments to examine this hypothesis.

      Comment:

      Additional AT1 markers (Hopx, Ager, Clic5 and Rage) should be assessed by qPCR and immunostaining to determine the effect of Clasp1 knockout on AT1 cells.

      Answer:

      Please see our answer to reviewer #1 above. To summarise, we will perform new IF experiments to examine AT1/2 cell numbers using additional markers (e.g. Hopx for AT1).

      3. Description of the revisions that have already been incorporated in the transferred manuscript

      General remarks

      As explained in detail below, we believe that our new RNA-Seq analysis has uncovered a mechanism underlying the severe lung phenotype of Clasp1 KO mice, and that it has revealed the major cell types affected in embryonic Clasp1 KO lungs.

      Brief summary of experiments

      In the first version of the manuscript we used Gene Set Enrichment Analysis (GSEA, see https://www.gsea-msigdb.org/gsea/index.jsp) to compare our RNA-seq results to publicly available scRNA-Seq datasets of cell type signature gene sets, which contain cluster marker genes for cell types identified in single-cell sequencing studies of human tissue. As stated in our manuscript, this revealed “enrichment of alveolar epithelial type I cells and lung capillary intermediate cells in WT lungs ….”. However, the analysis was restricted to what is available in the Gene Set Enrichment Analysis database of the University of San Diego. Thus, we could only compare our embryonic mouse lung data to adult human lung scRNA-Seq data.

      We recently discovered publicly available scRNA-Seq datasets of the mouse lung (see https://research.cchmc.org/pbge/lunggens/mainportal.html and https://lungcells.app.vumc.org). The data in these portals are not part of the common GSEA sets of the University of San Diego. In particular the LGEA web portal is very easy to use and data can be downloaded for individual applications. In the new version of our manuscript we compared our RNA-Seq data to scRNA-Seq data of the embryonic mouse lung, focussing on E18.5. We first overlaid differentially expressed genes in Clasp1 KO lungs with LGEA E18.5 scRNA-seq gene signatures for different cell types, and we subsequently compared all the genes in our dataset with the gene signature lists, using custom-built gene signature sets and the GSEA software. In addition, we interrogated LGEA to find out which signature genes are specifically turned on from E16.5-E18.5 in the different cell types in the developing mouse lung. We found, for example, that Chil1, which is the most severely down-regulated gene in our Clasp1 KO RNA-Seq, is a very prominent AT2 signature gene; Chil1 is hardly expressed at E16.5 and prominently comes up at E18.5.

      Our combined analysis strongly suggests that four cell types (AT1, AT2, endothelial cells (EC), and immune cells (IC)) are affected in their differentiation in the Clasp1 KO lung, and that this defect occurs in the later stages of lung development (from E16.5 onward). As the top five differentially down-regulated genes in KO lungs (including Chil1) are all top signature genes of AT2 cells, these data strongly suggest that it is this cell type that is most affected in the KO. A Metascape analysis (which includes a GO enrichment analysis, see also our specific answer to comments of reviewer #3 below) is consistent with the scRNA-Seq comparison and suggests, among others, that the secretory pathway might be hampered in the Clasp1 KO. This analysis furthermore indicates that cholesterol metabolism might be affected in the Clasp1 KO, which bears relevance to our dexamethasone rescue experiments.

      Specific remarks to the comments of the reviewers

      Reviewer #1.

      • *

      Comment:

      p.6: What is the justification to mention Nfib, Pdpn and Ndst1 mutant mice in the introduction? Do these genes have any cellular/molecular/functional relation with CLASP1?

      Answer:

      We initially wanted to provide examples of genes important for lung maturation, whose absence in knockout mice leads to lung collapse. Of the examples provided Pdpn (which is equal to the marker T1a) bears a relation with our data in that it is down-regulated in Clasp1 KO lungs (see Table S2, RNA-Seq); furthermore, we examined T1a localisation in IF stainings (see old Figure 7E). In the new version of the manuscript we modified this Introduction section, to better align with our recent results, and to introduce the papers mentioned by reviewer #3 (Nelson et al., 2017; doi:10.1242/dev.154823, Li, J. et al., Dev Cell, 44, 297-312 e5.), who points out that pressure plays an important role in lung development. In the Li et al manuscript Pdpn is mentioned as being expressed at E16.5 in so-called Id2+ cells, together with Sftpc. These cells are proposed to be the precursors of the AT1/2 epithelial cells that arise later.

      Comment:

      p.8: It is mentioned that CLASP1 is expressed in secretory cells of the lung. Which ones? Is CLASP1 expressed in nerves, muscle cells and/or fibroblasts of the diaphragm? These information are important according to the phenotypes described.

      Co-immunolabelling experiments should be done.

      Answer:

      We apologize for our incorrect phrasing. With respect to the lung, we now state that “CLASP1 is expressed in the endothelium of blood vessels, as well as in all cells lining the airways of mouse lungs at E18.5 (Fig. 1A)”.

      Comment:

      p.11: To identify the cause of the respiratory failure, the authors looked at the innervation pattern of the phrenic nerve in the diaphragm. Mutants present decreased branching but larger nerve extensions covering a wider innervated area and less neuromuscular junctions. Despite the decreased innervation of the diaphragm, its morphology is normal as well as the ultra-structure of the sarcomeres suggesting a mild phenotype rather than the cause of death of the mutants as suggested by the authors (p.20).

      Diaphragmatic muscle activity should be measured to establish if the contractile activity of the diaphragm is affected. This might support the statement of the authors.

      Answer:

      We thank the reviewer for these observations. We agree with the reviewer and have toned down our conclusions in this section. We now simply describe the innervation pattern because we believe it is interesting, and we tentatively conclude that it may contribute to the severe respiratory phenotype which is primarily due to impaired AT1/2, EC, and IC differentiation.

      Comment:

      p.13: The authors examined lung from mutants. Mutant lungs do not float and they are collapsed at birth. However, lung morphology appears normal and myofibroblasts, ciliated cells and Club cells are present as shown by IHC labeling. No difference in proliferation and apoptosis was reported.

      It would have been more informative to do BrdU/EdU immunolabeling for proliferation in order to see if differences occur in specific cell types of the lung. It is not clear why the authors have limited their IHC analysis to these three specific cell types. A complete analysis should be done.

      Answer:

      As described above (general remarks), we compared our RNA-Seq data to publicly available scRNA-Seq data from the developing mouse lung (see new Figure 6). These comparisons reveal which cell types are affected in the Clasp1 KO lung (AT1/2, EC, IC), and which process might be hampered.

      Comment:

      p.14: The authors proposed a delay in lung development according to lung morphology that appears more collapsed starting at E15.5.

      Measurement of branching would allow to quantify this delay. Since cell differentiation occurs ~E16.5, analysis of the onset of cell types can also support a delay in lung development.

      Answer:

      As described above (general remarks), we compared our RNA-Seq data to publicly available scRNA-Seq data from the developing mouse lung (see new Figure 6). This not only revealed which cell types are affected in the Clasp1 KO lung, but also suggest that a differentiation block occurs at E16.5 to E18.5. For example, Chil1, a top AT2 signature gene of E18.5, is hardly expressed at E16.5 and is strongly upregulated at E18.5. This gene fails to become up-regulated in the Clasp1 KO, indicating that epithelial precursor cells have problems differentiating to AT2 type cells. By contrast, Id2, a marker of precursor epithelial cells, is normally expressed in the Clasp1 KO, and two genes that are co-expressed with Id2 in these precursor cells (Pdpn and Sftpc) are slightly down and up, respectively, in the Clasp1 KO. Thus, while our lung morphology studies might suggest early defects, our RNA-Seq indicates that specific defects occur during the late terminal saccular stage, i.e. from E16.5 onward. We therefore agree with with Negretti et al (2021, doi: 10.1242/dev.199512, Discussion section) who state: the developmental stages of the lung are largely founded on histologically descriptive features. While this is important, such a categorization often results in debate regarding the function and identity of cell types within the boundaries of each stage. By contrast transcriptome analysis suggests that different cell types commit to change asynchronously during development, suggesting that the timing of the saccular-to-alveolar transition is fluid and highly cell-type specific.

      As shown by Li et al (2018, doi.org/10.1016/j.devcel.2018.01.008) mechanical forces contribute to embryonic lung alveolar epithelial cell differentiation. Interestingly, RNA-Seq data from Nelson et al (2017; doi:10.1242/dev.154823) suggest that CLASP1 is a “pressure sensing gene” (see also below, our answer to comments of reviewer #3). Thus, Clasp1 KO lungs might fail to properly sense pressure, which could explain, at least in part, the observed failure in epithelial differentiation.

      Comment:

      p.15: Finally, the authors conclude this section by "these data support a direct role for CLASP1 in lung maturation".

      Which direct role? How? This sentence appears premature according to the data presented. The authors should look at microtubule dynamics in lung cells from mutant embryos to see if a link exists between the proposed role of the protein and the lung phenotype observed.

      Answer:

      The reviewer is correct, knockout studies can not demonstrate a direct role of a protein in a perturbed process. We have therefore removed the word “direct” from this phrase.

      Comment:

      p.15: The authors attempted to rescue the defective lung maturation phenotype by treating pregnant females with dexamethasone at late gestational stages. Around 10% of mutants survive for more than 45 minutes to 2 hrs compared to 20-30 minutes for mutants obtained from untreated mothers (p.9). Even though it is an intriguing result, the very small numbers of "survivors" makes very difficult to reach a conclusion.

      This section should be shortened.

      Answer:

      Our new Metascape analysis, which will be presented in the new Figure 8, suggests that cholesterol metabolism is affected in the Clasp1 KO mice. Cholesterol is an important component of mammalian cell membranes, of both alveolar and lamellar body surfactant, and it is a precursor of vitamin D and steroid hormones. A cholesterol defect would explain the partial rescue by dexamethasone in the Clasp1 KO, i.e. dexamethasone can rescue a steroid hormone defect but it cannot rescue other defects (e.g. surfactant production). Given these new results we decided to leave the section on glucocorticoids as it is and come back to it when we discuss the Metascape result in the revised manuscript.

      Comment:

      p.16: To determine which molecular mechanisms are responsible for the lung defect, the authors performed RNA-seq analysis on E18.5 lung specimens. The number of genes with significant differential expression was low and the highest scores were cathepsin E for the upregulated gene and chitinase-like 1 for the downregulated gene.

      Are these two genes known for their role in lung development? Please describe.

      Answer:

      The Ctse gene, which encodes Cathepsin E, is indeed the most upregulated gene in the Clasp1 KO. Although it is up-regulated in all three KO mice, Ctse expression is quite low (normalised counts: ~2 in KO, up from ~0.2 in WT). Based on the comment of this reviewer we examined Ctse expression in the scRNA-Seq lung repositories, but we could not find any description, presumably because its expression is too low (scRNA-Seq has difficulty catching low abundance genes), consistent with our data. Furthermore, there is not much literature on the role of Cathepsin E in the lung. We therefore decided to remove any mention of Ctse in the manuscript. By contrast, the expression and function of Chil1 are described in detail.

      Comment:

      p.16: Except for the fact that Chil1 is also downregulated in mutant lungs for the H3K4 methyltransferase Mll3 gene, it is not clear why the authors compared these 2 sets of data.

      Can CLASP1 and MLL3 interact together? How? Did the authors looked at the list of genes that are commonly differentially expressed? Does it provide some clues on the mechanisms? The RNA-seq data should be analyzed more deeply.

      Answer:

      The reviewer is correct, we compared the Mll3 (i.e. Kmt2c) RNA-Seq dataset because Chil1 is down-regulated in the Mll3 KO lung at E18.5, like in the Clasp1 KO. To examine a possible relation between Mll3 and Clasp1 in more detail, we overlaid the differentially expressed genes from the Mll3 dataset with the custom-built gene signature dataset of E18.5 lung (described above). The data suggest that Mll3 knockout affects AT1 differentiation (see new Supplementary Figure S6C). This mode of action is clearly different from that of CLASP1, and since Mll3 is nuclear and CLASP1 is cytoplasmic we do not believe these proteins interact. Given our new and exciting data on the Clasp1 KO lung phenotype, we moved the Mll3 data to the new Supplementary Figure 6, and only briefly we touch upon these data in the manuscript.

      Comment:

      p.16: There is also a Clasp2 gene with a more restricted expression pattern. Clasp2 mutant mice either die from hemorrhages or survive. It is not clear why the RNA-seq data of the lungs from Clasp2-/- mice are presented since no lung phenotype is mentioned for these mice. How the lack of change in Chil1 expression in Clasp2 mutant lungs is informative?

      This should be clarified or the data should be removed.

      Answer:

      The reviewer is correct, i.e. in light of our new findings (Chil1 is a top signature gene of E18.5 AT2 cells) it makes little sense to include the Clasp2 KO RNA-Seq data, as these were generated in adult mouse lungs. We therefore removed these data from the manuscript.

      Comment:

      p.31: The authors mentioned a role for CLASP1 in the mesenchyme.

      What are the experiments and data that support this sentence?

      Answer:

      We thank the reviewer for this remark, we have no evidence for a role of CLASP1 in the mesenchyme and have removed this phrase.

      Comment:

      How do the authors reconcile their observation of CLASP1 expression in lung secretory cells (p.8) with their conclusion of defective Type I cell differentiation (p.17)?

      Answer:

      We apologize for our incorrect phrasing. With respect to the lung, we now state that “CLASP1 is expressed in the endothelium of blood vessels, as well as in all cells lining the airways of mouse lungs at E18.5 (Fig. 1A)”.

      Reviewer #2.

      Comment:

      Fig. 3. There is not a lot of detail how the analysis in B-E was done, and no primary data for the synaptic defects.

      Answer:

      We have removed these data from the manuscript.

      Reviewer #3.

      Comment:

      1. The authors showed significant reduction in the rib cage size and abnormal diaphragm innervation in Clasp1 KO. Mechanical properties play a crucial role in regulating lung development and maturation. So changes in intrathoracic space and pressure are a major limiting factor that impairs lung development and maturation (Nelson et al., 2017; doi:10.1242/dev.154823, Li, J. et al., Dev Cell, 44, 297-312 e5.). Answer:

      We thank the reviewer for these interesting papers and observations.

      Nelson et al (2017; doi:10.1242/dev.154823) devised a method to culture lung-on-a-chip where they can induce pressure in culture. They apply this to examine lung development and they also do RNA-Seq. Interestingly, they find that Clasp1 is down-regulated at high pressure compared to low pressure (log2FC 0.5, Clasp1 goes down ~1.5 fold in high pressure). Thus Clasp1 appears to be a “pressure-responsive gene”. However, Nelson et al examine gene expression at much earlier time points than we do (E12-14 versus E18.5). In our view it therefore makes little sense to compare RNA-Seq data.

      Li et al (2018 doi.org/10.1016/j.devcel.2018.01.008) show that mechanical forces help to control embryonic lung alveolar epithelial cell differentiation. More specifically, mechanical force from amniotic fluid inhalation ensures AT1 cell differentiation, whereas FGF10-mediated ERK1/2 signaling induces a protrusive structure in some cells that protects from mechanical force-caused flattening to specify AT2 fate. They conclude that future AT2 cells can “embed” into mesenchyme by exerting an acto-myosin based force and hence they can keep their cuboidal shape. The differentiation of the two cell types occurs at different time points, E16.5 for AT2, and E17.5 for AT1. In this manuscript they also mention that Id2+ tip cells express pro-SPC and Pdpn (which are up and down, respectively, in Clasp1 KO). These Id2+ cells would be the AT1/2 progenitors.

      We believe that a smaller ribcage in the Clasp1 KO does not necessarily have to be a cause of increased pressure on the lung, if the lung is also smaller. Nonetheless, since CLASP1 is a “pressure-responsive gene”, Clasp1 KO lungs might experience aberrant pressure sensing (in addition to a possible pressure difference due to a smaller ribcage). This different sensing predicts altered differentiation pathways, which is exactly what we see. We have modified the revised version of the manuscript to reflect these thoughts and observations.

      Comment:

      Since CLASP1 was found to be highly expressed in the lung endothelium (Figure 1A), this suggests the importance of CLASP1 in the lung vasculature. GSEA analysis also showed significant downregulation of genes from the lung capillary intermediate 1 cell signature gene set in Clasp1 KO (Figure 7G). Extensive crosstalk between the lung endothelium and other lung cell types is critical for the regulation of lung development. However, no further investigation was carried out to elucidate this.

      Answer:

      We have performed a new comparison, which is extensively discussed above and shows that EC are affected in the Clasp1 KO lungs, as predicted by this reviewer. We will discuss crosstalk between cell types in the new version of the manuscript.

      Comment:

      Analysis of RNA-Seq data needs to be re-written. Pathway or GO enrichment was not performed. Although the authors have identified a number of key DEGs, only Chil1 was investigated. It is also unclear how it led the authors to identify Mll3 KO experiment on the Omnibus repository. A list of overlapped genes between Mll3 KO dataset and Clasp1 KO dataset were not provided. Aqp5 (AT1 marker gene) that authors claimed to be significantly reduced in Clasp1 KO is not on the DEGs list (Table S2).

      Answer:

      We initially focused on Chil1 because its expression is almost completely abrogated in all three Clasp1 KO lungs. The identification of the Mll3 dataset was coincidental; we mentioned it because Chil1 is also affected in these KO mice. A Venn diagram of overlapping significantly deregulated genes in both datasets is shown in the new Figure S6 of the revised manuscript. However, this analysis has been superseded by the new comparison with scRNA-Seq data from the LGEA web portal. As extensively explained above this new analysis provides a satisfying explanation for the lack of Chil1 in Clasp1 KO lungs. We also performed a Metascape analysis (which includes pathway and GO enrichment analyses), which will be included in the revised version of this manuscript. Finally, the reviewer is correct that Aqp5 is not in the DEGs list, this is because the adjusted p-value did not reach the required significance. We nevertheless showed its RNA-Seq values, first because the p-value is significant, second, because RT-PCR experiments confirm it to be down-regulated, and third, because Aqp1 (another AT1 marker) is also deregulated (with an adjusted p-value that is significant). In the revised manuscript we will examine Aqp5 levels by IF staining.

      Comment:

      There is a lack of cohesion between the experimental findings presented in the paper and the RNA Seq data analysis. Pathway or GO enrichment was not performed for the DEGs the authors identified. This would help identify the key functions of the deregulated genes in Clasp1 KOs and provide a fuller picture of what pathways/biological processes are dysregulated in the absence CLASP1. Instead, the authors have focused on one single gene, Chil1 in the subsequent analysis. The authors infer that overlapped DEGs between Mll3 KO and Clasp1 KO mean that same cell types or signalling pathways are affected in embryonic lungs of Mll3 and Clasp1 KO, this is an overinterpretation. A list showing the overlap in DEGs between Mll3 KO dataset and Clasp1 KO dataset should be provided.

      Answer:

      We have improved our RNA-Seq analysis and we have performed a Metascape analysis, which includes pathway and GO enrichment analyses. Results are shown in the new Figures 6 and 8. The Metascape analysis indicates which pathways/biological processes are deregulated in the absence CLASP1. We observe, for example, defects in endocytosis, and cholesterol metabolism. Given the new data, we decided to pay less attention to the Mll3-CLASP1 comparison.

      Minor comments:

      1. Figure 1A - please label the specific cell types to aid visualisation.
      2. Figure 6B - present the Log2FC for KO vs WT instead of WT vs KO to facilitate data visualisation and interpretation
      3. Figure 6E - provide the overlapping genes in a list and include it as a supplementary table
      4. Figure 7D and 7F - Quantification is needed
      5. The statistical tests used should be added to the figure legends.
      6. There is some wording in the manuscript that is either unclear or inaccurate, please carefully check the manuscript. e.g. manuscript refers to alveolization- I would recommend changing this to the more widely used terms alveolarization or alveologenesis. The manuscript refers to 'catastrophe rate'- this term needs to be defined. Answers:

      7. This has been done.

      8. This has been done.
      9. This panel has been moved to a Supplementary Figure, as the analysis is less relevant now we will not provide the list.
      10. This will be done.
      11. This has been/will be done.
      12. This has been done. The term “catastrophe rate” has been removed.
      13. *

      4. Description of analyses that authors prefer not to carry out

      General remarks

      Based on the comments of reviewers #2 and #3 we have decided to fully focus our revised manuscript on the lung phenotype of the Clasp1 KO mice. We still do show the results on the ribcage (Figure 2) and diaphragm (Figure S2) because they might enhance the severity of the lung phenotype. We have decided not to carry out extra “non-lung” experiments.

      Specific remarks to the comments of the reviewers

      Reviewer #1.

      Comment

      p.10: Homozygous mutants are smaller. The authors reported minor skeletal phenotypes small rib cage and delayed ossification in sternum and occipital bone.

      The number of specimens analyzed was not mentioned rendering difficult to establish if these observations are important or not. Stats should be included.

      Answer:

      Whereas the results of Figure 1H, I (growth deficits at E15.5 and PN1) are based on analysis of multiple animals, the embryonic skeleton data presented in Figure 2 are based on single mouse comparisons, i.e. one WT and one KO. Given the obvious growth deficit in the KO (Figure 1H, I) and the fact that gross morphological observation did not reveal a specific body part in the KO mice that is affected (Figure 1G), we were of the opinion that a representative comparison of the skeleton is allowed and we therefore kept Figure 2 intact. Since we focus in the revision on the lung phenotype, we have decided against examining the skeletons of more mice. We are willing to remove Figure 2, or make it Supplemental, if the reviewer feels that the skeletal phenotype is too prominently displayed.

      Comment:

      p.10: The authors established MEF used to study cell division. Multipolar spindles and additional centrosomes were detected in mutant cells.

      No stats were provided to establish if the differences in numbers are significant. According to the authors, the cell division defects may explain the smaller size of mutants. The authors should check proliferation in MEF. The sentence of conclusion is not well supported according to the data presented.

      Answer:

      Based on the advice of reviewer #2, who states “I think it would be best to better focus the paper on the lung phenotype”, we have decided to remove the mitotic data on MEFs.

      Comment:

      p.12: The authors looked at the growth capacity of motor neurons and dorsal root ganglion neurons and showed a reduced growth in both cases.

      How do the authors reconcile the observation made in the diaphragm in which nerve extensions are larger with the reduced growth capacity of neurons?

      Answer:

      We thank the reviewer for this remark, which is difficult to address, as CLASPs are expressed at different levels in neurons and as different isoforms, which may even have antagonistic functions. For example, in our recent publication (Sayas et al, 2019, DOI: 10.3389/fncel.2019.00005) we find through RNA-Seq that in cultured hippocampal neurons (3DIV) Clasp2β/γ levels are increased compared to Clasp2α-mRNA and that both in hippocampal and in DRG neurons Clasp2 mRNA levels are higher than Clasp1. As CLASP2b/g have a different function compared to CLASP2a, it is conceivable that absence of CLASP1 leads to different effects due to different CLASP2 activities. However, we recognize that these are speculations. Because of this and because reviewer #2 advices against inserting the neuronal data, we have decided to completely remove these results from the manuscript.

      Comment:

      p.12: The authors used cultured hippocampal neurons for imaging microtubule growth. According to the authors, the loss of CLASP1 deregulates microtubule dynamics.

      No explanation was provided to justify the use of hippocampal neurons. What is a catastrophe rate? What is the justification to study this parameter? What does it tell us about microtubule dynamics?

      Answer:

      Although we have decided to remove the neuronal data from the revised manuscript, we would like to address this comment nonetheles. Hippocampal neurons are often used in the field, hence they represent a “golden standard”. Furthermore, the techniques to examine microtubule dynamics are well established in this system. Dynamic microtubule behaviour is described using five parameters: growth rate of microtubules, shrinkage rate of microtubules, catastrophe and rescue frequencies (the conversion of growth to shrinkage or from shrinkage to growth, respectively), and pauzing times. The marker used in our studies (EB3-GFP) accumulates at the ends of growing microtubules, allowing us to measure growth rate and the duration of a growth event. The latter is the inverse of the catastrophe frequency. Hence, using EB3-GFP we are able to examine two of the five parameters. Although this is not complete the parameters do allow us to draw (speculative) conclusions. For example, a higher growth rate indicates that free tubulin concentration is higher, as tubulin concentration is a main determinant of growth rate. This in turn means that there are less microtubules (tubulin must come from somewhere). If this correlates with the catastrophe frequency (which should be higher) than one can conclude that CLASP1 is a microtubule-stabilising protein.

      Reviewer #2.

      Comment:

      Fig. S1. It would be good to indicate the number of cells / experiments analyzed. In panel D, there is only one multi-nucleated cell, which without further analysis does not mean much. The authors correlate this mitotic defect with smaller animal size although this connection is not at all conclusive. If both CLASPs are important for mitosis, do CLASP2 KOs have similar size defects? It is also mentioned above that CLASP1 KOs show microcephaly. Are there fewer neurons that might also be linked to a stem cell division defect? I understand that this is not the central point of the paper and important to include given previous work on CLASPs, but it would be good to discuss a little clearer. It seems the authors do not think this is the/a cause of the lung phenotype, but can that be completely excluded?

      Answer:

      Based upon suggestions of this reviewer (for example: “I think it would be best to better focus the paper on the lung phenotype”) we will not address this comment beyond a statement that Clasp2 knockout mice are indeed also smaller.

      Fig. 4. Please indicate n of cells / experiments and statistics in the figure legend. In panel B and C, it would help to include the time on the figure itself and to scale the y-axis the same to better illustrate differences. It is very hard to see much in panel D. The quantifications in E and F do not make sense. How can the total neurite length (average of many neurons?) be larger than the longest neurite length?

      The switch to MT dynamics in Fig. 4 is very abrupt and the relevance is unclear. Where were these kymographs located in the neuron (growth cones or neurites)? Primary data needs to shown here. The changes in catastrophe frequency are not that large and I doubt this can be accurately measured from kymographs as shown. Yes, MTs are important in neurite growth, but the potential link here is very vague. Are similar changes in MT dynamics also seen in the MEFs?

      Minor:

      Answer:

      See above, we will not address these comments, since we will remove these data.

      Reviewer #3.

      Comment:

      The lung morphological difference and disrupted lung cell differentiation in Clasp1 KO could be secondary to the biomechanical defects. This is crucially important but is not addressed in this study, ex vivo lung culture may help to answer this question.

      Answer:

      While the experiments suggested by this reviewer are interesting, we do not have sufficient expertise (nor the equipment) to carry out such specialised experiments.

      Comment:

      CLASPs are known to regulate directed cell migration (Myer and Myers 2017, doi: 10.1242/bio.028571) and this is a key process required for lung morphogenesis. Experiments to address whether directed cell migration is affected should be conducted in Clasp1 KO mice.

      Answer:

      We agree that migration assays would be interesting to perform. However, again, we do not have the expertise to do such assays in the developing lung. Experiments in MEFs are possible, and indeed, we previously showed a role for CLASP2 in directed cel migration in MEFs (DOI: 10.1016/j.cub.2006.09.065). However, lung epithelial cells are different from MEFs, and we have shown that CLASPs have cell type- (and isoform-)specific functions. Reviewer #2 actually advised us to focus on the lung phenotype.

      Comment:

      Higher magnification images of staining for microtubule associated proteins in neurons is required to show the details of the defects.

      Answer:

      Based on the reviewers’ advice we decided to take out the neuronal data and focus the manuscript on the lung phenotype.

    1. Author Response

      Reviewer #2 (Public Review):

      The authors performed a series of impressive experiments to systematically establish each part of their CRISPRi method. They provided one of the most compact design of CRISPRi dual-guideRNA library, with a genome-wide coverage; they confirmed prior finding on the optimal repressor domain to generate a set of useful vectors for expressing the repressor; they showcased the usage of the system in multiple common cancer cell lines. The authors also took an important step towards providing a detailed and well-annotated protocol (in the supplementary materials) to help users of their methods. The items listed below would be helpful to further improve this work:

      First, while the dual guideRNA design is a useful development, the author also noted the significant rate (~30%) recombination between the two sgRNAs. This should be further discussed and evaluated in the manuscript to help readers understand the implication of this high recombination rate. For example, across replicate experiments or across cell types tested, would the recombination be stochastic, or there may be some bias of which guide would be recombined? Are there any cell-type dependencies here in terms of the recombination rate? This would also help future users to decide if they would need to check for this effect during functional screening.

      We agree that recombination is an important limitation of dual-sgRNA screens. We included additional analyses and data in the revised manuscript to help readers understand the implications of the observed recombination.

      First, we performed growth screens using dual-sgRNA libraries in two additional cell lines (RPE1 and Jurkat) to address the potential cell type specificity of lentiviral recombination. We cloned a dual-sgRNA library targeting DepMap Common Essential genes (n=2291 dual-sgRNA elements). We transduced cells with this library, harvested cells at day 7 post-transduction, amplified sgRNA cassettes from extracted genomic DNA, and sequenced to quantify sgRNA recombination rates. We found similar recombination rates of dual-sgRNA constructs isolated from these three cell types (observed K562 recombination rate 29%; observed RPE1 recombination rate 26%; observed Jurkat recombination rate 24%).

      Next, we compared the recombination rates of each dual-sgRNA element. Our expectation was that lentiviral recombination would be largely stochastic per element based on the known mechanism of lentiviral recombination previously discussed in Adamson et al. 2018 (https://www.biorxiv.org/content/10.1101/298349v1.full) given that the constant region between sgRNAs (400bp) far exceeds the length of sgRNA targeting regions (20bp). However, we would also expect apparent recombination rates to be artificially inflated for dual-sgRNAs with strong growth phenotypes, as the stronger growth phenotypes of unrecombined dual-sgRNAs compared to recombined dual-sgRNAs will lead to dropout of unrecombined dual-sgRNAs. To account for this bias, we began by comparing the recombination rate for non-targeting control dual-sgRNAs excluding those with growth phenotypes across replicates of our K562 screens. There was only a weak correlation between the recombination rate for non-targeting control dual-sgRNAs (r = 0.30; Figure 1 – Figure Supplement 1E). We next compared the recombination rates of all dual-sgRNA elements (both targeting and non-targeting) across replicates of our K562 screens. As expected, we observed that the recombination rate of elements was correlated across replicates (r = 0.77; Figure 1 – Figure Supplement 1F), and the recombination rate was strongly anticorrelated with the growth phenotype of dual-sgRNAs in K562 cells (r = -0.84; Figure 1 – Figure Supplement 1G). We have integrated these data into the manuscript.

      Second, on the repressor development and evaluation. As the author mentioned in the text, the expression level of the repressor can confound their conclusion on fitness/efficiency comparisons of CRISPR repressor. Thus, it would be helpful to perform protein level validation using the cell lines they generated, such as a WesternBlot comparison to rule out this potential issue.

      We agree that differences in expression levels of the effectors can confound comparisons and that Western Blotting for such differences would be valuable. That said, any such analyses would not substantively alter the main claim of our paper, which is that Zim3-dCas9 provides excellent on-target knockdown in the absence of non-specific effects on cell growth or gene expression. This finding is of immediate practical use to the community. By no means are we claiming that we eliminated all possible confounding factors nor do we think that it is possible to do so. To avoid overstating our findings, we had acknowledged in the discussion that expression levels may indeed be a confounding factor, we had noted in the methods section that the dCas9-MeCP2 effector uses a different coding sequence for dCas9, which may contribute to differences in expression, and we had noted that other effectors may prove useful in some settings. We have further emphasized that differences in expression levels may contribute to our results in the revised manuscript.

      This work would also benefit from including cell proliferation/viability measurement using the selected Zim3-dCas9 repressor in multiple cell lines, as it seems this was only done initially in K562 cells. As authors noted, the fitness effects of the CRISPR repressor would be a major concern when performing functional genomics screening, so such validation of fitness-neutrality of the repressor can be very helpful for potential users of their method and approach.

      To address this point, we assessed the proliferation of HepG2, HuTu-80, and HT29 cells expressing Zim3-dCas9. Expression of Zim3-dCas9 did not have a negative impact on proliferation in any of these cell types, providing further evidence that the Zim3-dCas9 will be broadly useful. We included these data in Figure 4 – Figure Supplement 2 in the revised manuscript. That said, we cannot rule out that expression of Zim3-dCas9 may be detrimental in other cell types. Indeed, we want to emphasize that users should evaluate both on-target knockdown and lack of non-specific effects of effectors in new cell models before proceeding to large-scale experiments. The assays and protocols we describe are ideally suited for this purpose. We have further emphasized this point in the discussion section to guide users.

      Third, a major resource from this work, as the authors noted, is a suite of useful Zim3-dCas9 cell lines. The authors have performed a set of experiments to demonstrate the knockdown efficiency with dozens of guideRNAs. While this is a good initial validation, to really ensure the cell lines are performing as expected, a small scale screening in pooled fashion will be more convincing. This would be a setting more relevant for potential readers, given that pooled screening would likely be the most powerful application of these cell lines.

      While conducting the work described in this manuscript, we had used the Zim3-dCas9 RPE1 cell line for a large-scale pooled screen with single-cell RNA-seq readout (Perturb-seq, Replogle et al. 2022). Across greater than 2000 target genes, the median knockdown was 91.6%, which provides strong validation that Zim3-dCas9 performs as expected in this cell line. We had noted this point in the discussion section of our manuscript.

    1. Author Response

      Reviewer #1 (Public Review):

      The authors focused on linking physiological data on theta phase precession and spike-timing-dependent plasticity to the more abstract successor representation used in reinforcement learning models of spatial behavior. The model is presented clearly and effectively shows biological mechanisms for learning the successor representation. Thus, it provides an important step toward developing mathematical models that can be used to understand the function of neural circuits for guiding spatial memory behavior.

      However, as often happens in the Reinforcement Learning (RL) literature, there is a lack of attention to non-RL models, even though these might be more effective at modeling both hippocampal physiology and its role in behavior. There should be some discussion of the relationship to these other models, without assuming that the successor representation is the only way to model the role of the hippocampus in guiding spatial memory function.

      We thank the reviewer for the positive comments about the work, and for the detailed and constructive feedback. We agree with the reviewer that the manuscript will benefit from significantly more discussion of non-RL models, and we’ve detailed below a number of modifications to the manuscript to better incorporate prior work from the hippocampal literature, including the citations the reviewer has listed. Since our goal with this paper is to contextualise hippocampal phenomena in the context of an RL learning rule, this is really important and we appreciate the reviewers recommendations. We have added text (outlined in the point-by-point responses below) to the introduction and to the discussion that we hope better demonstrates the connections between the SR and existing computational models of hippocampus, and communicates clearly that the SR is not unique in capturing phenomena such as factorization of space and reward or capturing sequence statistics, but is rather a model that captures these phenomena while also connecting with downstream RL computations. Existing RL accounts of hippocampal representation often do not connect with known properties of hippocampus (as illustrated by the fact that TD learning was proposed in prior work to be the learning mechanism for SRs, even though this doesn’t have an obvious mechanism in HPC), so the purpose of this work is to explore the extent to which TD learning effectively overlaps with the well-studied properties of STDP and theta oscillations. In that sense, this paper is an effort to connect RL models of hippocampus to more physiologically plausible mechanisms rather than an attempt to model phenomena that the existing computational hippocampus literature could not capture.

      1) Page 1- "coincides with the time window of STDP" - This model shows effectively how theta phase precession allows spikes to fall within the window of spike-timing-dependent synaptic plasticity to form successor representations. However, this combination of precession and STDP has been used in many previous models to allow the storage of sequences useful for guiding behavior (e.g. Jensen and Lisman, Learning and Memory, 1996; Koene, Gorchetchnikov, Cannon, Hasselmo, Neural Networks, 2003). These previous models should be cited here as earlier models using STDP and phase precession to store sequences. They should discuss in terms of what is the advantage of an RL successor representation versus the types of associative sequence coding in these previous models.

      We agree that the idea of using theta precession to compress sequences onto the timescale of synaptic learning is a long-standing concept in sequence learning, and that we need to be careful to communicate what the advantages are of considering this in the RL context. We have added these citations to the introduction:

      “One of the consequences of phase precession is that correlates of behaviour, such as position in space, are compressed onto the timescale of a single theta cycle and thus coincide with the time-window of STDP O(20 − 50 ms) [8, 18, 20, 21]. This combination of theta sweeps and STDP has been applied to model a wide range of sequence learning tasks [22, 23, 24], and as such, potentially provides an efficient mechanism to learn from an animal’s experience – forming associations between cells which are separated by behavioural timescales much larger than that of STDP.” and added a paragraph to the discussion as well that makes this clear:

      “That the predictive skew of place fields can be accomplished with a STDP-type learning rule is a long-standing hypothesis; in fact, the authors that originally reported this effect also proposed a STDP-type mechanism for learning these fields [18, 20]. Similarly, the possible accelerating effect of theta phase precession on sequence learning has also been described in a number of previous works [22, 55, 23, 24]. Until recently [40, 41], SR models have largely not connected with this literature: they either remain agnostic to the learning rule or assume temporal difference learning (which has been well-mapped onto striatal mechanisms [37, 56], but it is unclear how this is implemented in hippocampus) [54, 31, 36, 57, 58]. Thus, one contribution of this paper is to quantitatively and qualitatively compare theta-augmented STDP to temporal difference learning, and demonstrate where these functionally overlap. This explicit link permits some insights about the physiology, such as the observation that the biologically observed parameters for phase precession and STDP resemble those that are optimal for learning the SR (Fig 3), and that the topographic organisation of place cell sizes is useful for learning representations over multiple discount timescales (Fig 4). It also permits some insights for RL, such as that the approximate SR learned with theta-augmented STDP, while provably theoretically different from TD (Section 5.8), is sufficient to capture key qualitative phenomena.”

      2) On this same point, in the introduction, the successor representation is presented as a model that forms representations of space independent of reward. However, this independence of spatial associations and reward has been a feature of most hippocampal models, that then guide behavior based on interactions between a reward representation and the spatial representation (e.g. Redish and Touretzky, Neural Comp. 1998; Burgess, Donnett, Jeffery, O'Keefe, Phil Trans, 1997; Koene et al. Neural Networks 2003; Hasselmo and Eichenbaum, Neural Networks 2005; Erdem and Hasselmo, Eur. J. Neurosci. 2012). The successor representation should not be presented as if it is the only model that ever separated spatial representations and reward. There should be some discussion of what (if any) advantages the successor representation has over these other modeling frameworks (other than connecting to a large body of RL researchers who never read about non-RL hippocampal models). To my knowledge, the successor representation has not been explicitly tested on all the behaviors addressed in these earlier models.

      We agree – a long-standing property of computational models in the hippocampal literature is a factorization of spatial and reward representations, and we have edited the text of the paper to make it clear that this is not a unique contribution of the SR. We have modified our description of the SR to better place it in the context of existing theories about hippocampal contributions to the factorised representations of space and goals, and included all citations mentioned here by adding the following text.

      We have added a sentence to the introduction:

      “However, the computation of expected reward can be decomposed into two components – the successor representation, a predictive map capturing the expected location of the agent discounted into the future, and the expected reward associated with each state [26]. Such segregation yields several advantages since information about available transitions can be learnt independently of rewards and thus changes in the locations of rewards do not require the value of all states to be re-learnt. This recapitulates a number of long-standing theories of hippocampus which state that hippocampus provides spatial representations that are independent of the animal’s particular goal and support goal-directed spatial navigation[27, 28, 23, 29, 30]”

      We have also added a paragraph to the discussion:

      “The SR model has a number of connections to other models from the computational hippocampus literature that bear on the interpretation of these results. A long-standing property of computational models in the hippocampal literature is a factorisation of spatial and reward representations [27, 28, 23, 29, 30], which permits spatial navigation to rapidly adapt to changing goal locations. Even in RL, the SR is also not unique in factorising spatial and reward representations, as purely model-based approaches do this too [26, 25, 67]. The SR occupies a much more narrow niche, which is factorising reward from spatial representations while caching long-term occupancy predictions [26, 68]. Thus, it may be possible to retain some of the flexibility of model-based approaches while retaining the rapid computation of model-free learning.”

      3) Related to this, successes of the successor representation are presented as showing thebackward expansion of place cells. But this was modeled at the start by Mehta and colleagues using STDP-type mechanisms during sequence encoding, so why was the successor representation necessary for that? I don't want to turn this into a review paper comparing hippocampal models, but the body of previous models of the role of the hippocampus in behavior warrants at least a paragraph in each of the introduction and discussion sections. In particular, it should not be somehow assumed that the successor representation is the best model, but instead, there should be some comparison with other models and discussion about whether the successor representation resembles or differs from those earlier models.

      We agree this was not clear. This is a nuanced point that warrants substantial discussion, and we have added a paragraph to the discussion (see the paragraph in the response to point 1 that begins “That the predictive skew of place fields can be accomplished…”).

      4) The text seems to interchangeably use the term "successor representation" and "TD trained network" but I think it would be more accurate to contrast the new STDP trained network with a network trained by Temporal Difference learning because one could argue that both of them are creating a successor representation.

      We now refer to these as “STDP successor features” and “TD successor features”. We have also replaced all references of “true successor representation/features” to “TD successor representation/feature” and have edited the text at the beginning of the results section to reflect this:

      “The STDP synaptic weight matrix Wij (Fig. 1d) can then be directly compared to the temporal difference (TD) successor matrix Mij (Fig. 1e), learnt via TD learning on the CA3 basis features (the full learning rule is derived in Methods and shown in Eqn. 27). Further, the TD successor matrix Mij can also be used to generate the ‘TD successor features’...”

      Reviewer #2 (Public Review):

      The authors present a set of simulations that show how hippocampal theta sequences may be combined with spike time-dependent plasticity to learn a predictive map - the successor representation - in a biologically plausible manner. This study addresses an important question in the field: how might hippocampal theta sequences be combined with STDP to learn predictive maps? The conclusions are interesting and thought-provoking. However, there were a number of issues that made it hard to judge whether the conclusions of the study are justified. These concerns mainly surround the biological plausibility of the model and parameter settings, the lack of any mathematical analysis of the model, and the lack of direct quantitative comparison of the findings to experimental data.

      While the model uses broadly realistic biological elements to learn the successor representation, there remain a number of important concerns with regard to the biological plausibility of the model. For example, the model assumes that each CA3 cell connects to exactly 1 CA1 cell throughout the whole learning process so that each CA1 cell simply inherits the activity of a single CA3 cell. Moreover, neurons in the model interact directly via their firing rate, yet produce spikes that are used only for the weight updates. Certain model parameters also appeared to be unrealistic, for example, the model combined very wide place fields with slow running speeds. This leaves open the question as to whether the proposed learning mechanism would function correctly in more realistic parameter settings. Simulations were performed for a fixed running speed, thereby omitting various potentially important effects of running speed on the phase precession and firing rate of place cells. Indeed, the phase precession of CA1 place cells was not shown or discussed, so it is unclear as to whether CA1 cells produce realistic patterns of phase precession in the model.

      The fact that a successor-like representation emerges in the model is an interesting result and is likely to be of substantial interest to those working at the intersection between neuroscience and artificial intelligence. However, because no theoretical analysis of the model was performed, it remains unclear why this interesting correspondence emerges. Was it a coincidence? When will it generalise? These questions are best answered by mathematical analysis of the model (or a reduced form of it).

      Several aspects of the model are qualitatively consistent with experimental data. For example, CA1 place fields clustered around doorways and were elongated along walls. While these findings are important and provide some support for the model, considerable work is required to draw a firm correspondence between the model and experimental data. Thus, without a quantitative comparison of the place field maps in experimental data and the model, it is hard to draw strong conclusions from these findings.

      Overall, this study promises to make an important contribution to the field, and will likely be read with interest by those working in the fields of both neuroscience and artificial intelligence. However, given the above caveats, further work is required to establish the biological plausibility of the model, develop a theoretical understanding of the proposed learning process, and establish a quantitative comparison of the findings to experimental data.

      Thank you for the positive comments about the work, and for the detailed and constructive review. We appreciate the time spent evaluating the model and understanding its features at a deep level. Your comments and suggestions have led to exciting new simulation results and a theoretical analysis which shed light on the connections between TD learning, STDP and phase precession.

      We have incorporated a number of new simulations to tackle what we believe are your most pressing concerns surrounding the model’s biological plausibility. As such, we have extended the hyperparameter sweep (Supp. Fig 3) to include the phase precession parameters you recommended, as well as three new multipanel supplementary figures satisfying your recommendations (Supp. Figs. 1, 2 & 4). Collectively, these figures show that the specifics of our results, which as you pointed out might have been produced with biologically implausible values (place cell size, movement speed/statistics, weight initialisation, weight updating schedule and phase precession parameters), do not fundamentally depend on the specific values of these parameters: the mechanism still learns predictive maps close in form to the TD successor features. In the hyperparameter sweep, we do find that results are sensitive to specific parameter values (Supp. Fig 3), but that interestingly, the optimal values of these parameters are remarkably close to those observed experimentally. We have also written an extensive new theory section analysing why theta sequences plus STDP approximates TD learning. In addition the methods section has been added to and reordered to make some of the subtler aspects of our model (i.e. the mapping of rates-to-rates and weight fixing during learning) more clear.

      At a high level, regarding our claim of biological plausibility, we like to clarify our intended contribution and give context to some responses below. We have added the following paragraph to the discussion in order to accurately represent the scope of our work:

      “While our model is biologically plausible in several respects, there remain a number of aspects of the biology that we do not interface with, such as different cell types, interneurons and membrane dynamics. Further, we do not consider anything beyond the most simple model of phase precession, which directly results in theta sweeps in lieu of them developing and synchronising across place cells over time [60]. Rather, our philosophy is to reconsider the most pressing issues with the standard model of predictive map learning in the context of hippocampus (e.g., the absence of dopaminergic error signals in CA1 and the inadequacy of synaptic plasticity timescales). We believe this minimalism is helpful, both for interpreting the results presented here and providing a foundation for further work to examine these biological intricacies, such as the possible effect of phase offsets in CA3, CA1 [61] and across the dorsoventral axis [62, 63], as well as whether the model’s theta sweeps can alternately represent future routes [64] e.g. by the inclusion of attractor dynamics [65].”

    1. In our way of delivering orders we emphasise explaining the context two levels up. I may tell my soldiers to raid a compound, but I would also tell them that the reason for this is to create a distraction so that the Colonel can divert the enemy away from a bridge, and that the reason the Brigadier wants the Colonel to divert the enemy is so that the bridge is easier to cross. Not only do the soldiers then know why it’s important to raid the compound (so that others can cross the bridge), but they know that if for some reason they can’t raid the compound, creating any other diversion or distraction will do in a pinch, and if they can’t do that they can still try to do something to make it easier to cross the bridge. It lets everyone adapt to change as it happens without additional instruction if they aren’t able to get in touch with me. Again I think tech could possibly learn from that.

      def

    1. Author Response:

      eLife assessment

      This paper reports a useful set of results that uses a reduced network model based on a previously published large-scale network model to explain the generation of theta-gamma rhythms in the hippocampus. Combining the detailed and reduced models and comparing their results is a powerful approach. However, the evidence for the main claim that CCK+ basket cells play a key role in theta-gamma coupling in the hippocampus is currently incomplete.

      We thank the reviewers for their thorough and thoughtful notes, and we are pleased that there is acknowledgement of the combination of models as a powerful approach.  We agree with many of the comments made and we intend to address them in subsequent revisions. 

      In particular, we think that our ‘narrative’ as presented was perhaps not as clear as it could have been, based on the somewhat different comments from the reviewers (R#1 and #3).  That is, we created a reduced population rate model based on the theta/gamma generation hypotheses from the detailed model and then explored the PRM in more detail to predict cellular contributions.  The goal was not to validate the original detailed model per se (R#1) nor to do a fitting of parameters in the PRM directly from the detailed model (R#3).  Rather, it was to obtain a set of parameter values in PRM that would be in accordance with the hypotheses of the detailed model that could be fully explored to derive cellular-based predictions that could help design experiments to understand theta/gamma rhythms.

      Responses specific to the Reviewers are given below.

      Reviewer #1 (Public Review):

      This paper investigates potential mechanisms underlying the generation of hippocampal theta and gamma rhythms using a combination of several modeling approaches. The authors perform new simulation experiments on the existing large-scale biophysical network model previously published by Bezaire et al. Guided by their analysis of this detailed model, they also develop a strongly reduced, rate-based network model, which allows them to run a much larger number of simulations and systematically explore the effects of varying several key parameters. The combined results from these two in silico approaches allow them to predict which cell types and connections in the hippocampus might be involved in the generation and coupling of theta and gamma oscillations.

      In my view, several aspects of the general methodology are exemplary. In the current work as well as several earlier papers, the authors are re-using a large-scale network model that was originally developed in a different laboratory (Bezaire et al., 2016) and that still represents the state-of-the-art in detailed hippocampal modeling. Such model reuse is quite rare in computational neuroscience, which is rather unfortunate given the amount of time and effort required to build and share such a complex model. Very often, and also, in this case, the original publication that describes a detailed model provides only limited validation and analysis of model behavior, and the re-use of the same model in later studies represents a great opportunity to further examine and validate the model.

      Combining detailed and simplified models can also be a powerful approach, especially when the correspondence between the two is carefully established. Matching results from the two models, in this case, allow strong arguments about key mechanisms of biological phenomena, where the simplified model allows the identification and characterization of necessary and sufficient components, while the detailed model can firmly anchor the models and their predictions to experimental data.

      On the other hand, I have several major concerns about the implementation of these approaches and the interpretation of the results in the current study. First of all, the detailed model of Bezaire et al. is considered strictly equivalent, in all of its relevant details, to biological reality, and no attempt is made to verify or even discuss the validity of this assumption, even when particular details of the model are apparently critical for the results presented. I see this as a fundamental limitation of the current work - the fact that the Bezaire et al. model is the best one we have at the moment does not automatically make it correct in all its details, and features of the model that are essential for the new results certainly deserve careful scrutiny (preferably via detailed comparison with experimental data).

      An important case in point is the strength of the interactions between specific neuronal populations. This is represented by different quantities in the detailed and simplified model, but the starting point is always the synaptic weight (conductance) values given by Bezaire et al. (2016), also listed in Tables 2 and 3 of the current manuscript. Looking at these parameters, one can identify a handful of connections whose conductance values are much higher than those of the other connections, and also more than an order of magnitude higher (50-100 nS) than commonly estimated values for cortical synapses (normally less than about 5 nS, except for a few very special types of synapse such as the hippocampal mossy fibers). Not surprisingly, several of these connections (such as the pyramidal cell to pyramidal cell connections, and the CCK+BC to PV+BC connections) were found to be critical for the generation and control of theta and gamma oscillations in the model. Given their importance for the conclusions of the paper, it would be essential to double-check the validity of these parameter values. In this context, it is worth noting that, unlike the anatomical parameters (cell numbers and connectivity) that had been carefully calculated and discussed in Bezaire and Soltesz (2013), biophysical parameters (the densities of neuronal membrane conductances and synaptic conductances) in Bezaire et al. (2016) were obtained by relatively simple (partly manual) fitting procedures whose reliability and robustness are mostly unknown. Specifically for synaptic parameters in CA1, a more systematic review and calculation were recently carried out by Ecker et al. (2020); their estimates for the synaptic conductances in question are typically much lower than those of Bezaire et al. (2016) and appear to be more in line with widely accepted values for cortical (hippocampal) synapses.

      Furthermore, some key details concerning the construction of the simplified rate model are unclear in the current manuscript. The process of selecting cell types and connections for inclusion in the rate model is described, and the criteria are mostly clear, although the results are likely to be heavily affected by the problems discussed above, and I do not understand why the strength of external input was included among the selection criteria for cell types (especially if the model is meant to capture the internal dynamics of the isolated CA1 region). However, the main issue is that it remains unclear how the parameters of the rate model (the 24 parameters in Table 4) were obtained. The authors simply state that they "found a set of parameters that give rise to theta-gamma rhythms," and no further explanation is provided. Ideally, the parameters of the rate model should be derived systematically from the detailed biophysical model so that the two models are linked as strongly as possible; but even if this was not the case, the methods used to set these parameters should be described in detail.

      An important inaccuracy in the presentation of the results concerns the suggested coupling of theta and gamma oscillations in the models. Although the authors show that theta and gamma oscillations can be simultaneously present in the network under certain conditions, actual coupling of the two rhythms (e.g., in the form of phase-amplitude coupling) is not systematically characterized, and it is therefore not clear under what conditions real coupling is present in the two models (although a probable example can be seen in Figure 1C(ii)).

      The Discussion of the paper states that gamma oscillations in the model(s) are generated via a pure interneuronal (ING) mechanism. This is an interesting claim; however, I could not find any findings in the Results section that directly support this conclusion.

      Finally, although the authors write that they can "envisage designing experiments to directly test predictions" from their modeling work, no such experimental predictions are explicitly identified in the current manuscript.

      As noted above, our goal was not to validate the original detailed model but to carry out further analysis of the Bezaire model in its re-use, since as noted by this Reviewer, the original publication was limited in validation and analysis.  Further validation/extensions of Bezaire et al can be carried out given their acknowledged limitations (some as mentioned by the Reviewer).  However, as noted, more detailed models of CA1 microcircuitry now exist (Ecker et al 2020), and it would be interesting to examine whether and how these more detailed models might express theta/gamma rhythms.  In essence, we completely agree that all the details of the Bezaire et al model are not automatically correct.  We were using it as a biological proxy, albeit imperfect.  However, it is able to produce theta/gamma rhythms using parameter values that are experimentally derived in many ways (Bezaire & Soltesz 2013), and with minimal tuning, and thus our assumption is that it captures a potential ‘biological balance’ to generate these rhythms.  Hence, we carried out additional simulations and explorations to derive generation hypotheses that are “applied” in the development of the reduced population rate model (PRM).  The “ING” aspect is due to CCK+BCs and PV+BCs firing coherent gamma rhythms that are imposed onto the PYR cell population as mentioned in the Results.  Without PYR input, they still fire coherent gamma rhythms.  Experiments in which theta/gamma rhythms are characterized (CFC, frequencies)  with and without the presence of CCK+BCs would allow the main prediction of the modeling work to be explored – i.e., whether CCK+BCs are essential for the existence of these coupled rhythms.  We know from Dudok et al that there are alternating sources of perisomatic inhibition, but how they might control theta/gamma rhythms has not been explored to the best of our knowledge.

      We will more fully describe our process for PRM parameters in subsequent revisions as well as formally apply CFC metrics.

      Reviewer #2 (Public Review):

      The goal of this study is to find a minimal model that produces both theta and gamma rhythms in the hippocampus CA1, based on the full-scale model (FSM) of Bezaire et al, 2016. The FSM here is treated as equivalent to biological data. This seems to be a second part of a study that the same authors published in 2021, and is extensively cited here. The study reduces the FSM to a neural rate model with 4 neurons, which is capable of producing both rhythms. This model is then simulated and its parameter dependencies are explored.

      The authors succeed in producing a rate model, based on 4 neuron types, that captures the essence of the two rhythms. This model is then analyzed at a descriptive level to claim that the synapse from one interneuron type (CCK) to another (PV+) is more effective than its reciprocal counterpart (PV+ to CCK synapse) to control theta rhythm frequency.

      The results fall short on several fronts:<br /> The conclusions rely exclusively on the assumption that the FSM is in fact able to faithfully reflect the biological circuits involved, not just in its output, but in response to a variety of perturbations. Although the authors mention and discuss this assumption, in the end, the reader is left with a (reduced) model of a (complex) model, but no real analysis based on this reduction. In fact, the reduced model is treated in a manner that could have been done with the full one. Thus the significance of the work is greatly reduced not by what the authors do, but by what they fail to do, which is to properly analyze their own reduced model. Consequently, the impact of this study on the field is minimal.<br /> Related to the first point, throughout the manuscript, multiple descriptive findings, based on the authors' observations of the model output, are presented as causal relationships. Even the main finding of the study (that one synapse has a larger effect on theta than another) is not quantified, but just simply left as a judgment call by the authors and reader of comparing slopes on graphs.

      We agree with this Reviewer that analysis of the PRM is needed and is currently underway.  It will hopefully help us understand what ‘balances’ are essential for theta/gamma rhythm expression.  However, the overall goal of this work was not to “find” a minimal model per se, but rather to determine how theta/gamma rhythms in the hippocampus are generated (hence building on previous works).  However, it was important to use the detailed model (biological proxy – albeit imperfect – see response to Reviewer#1) to obtain hypotheses on which the PRM is based.  We do not envisage the minimal model as a `replacement’ for the detailed model in general, but rather, to show that using a combination approach (detailed and/or experimental observations with ‘derived’ reduced models) allows us to gain insight into cellular contributions to rhythm generation. Quantification of observations will be applied in subsequent revisions.

      Reviewer #3 (Public Review):

      While full-scale and minimal models are available for CA1 hippocampus and both exhibiting theta and gamma rhythms, it is not fully clear how inhibitory cells contribute to rhythm generation in the hippocampus. This paper aims to address this question by proposing a middle ground - a reduced model of the full-scale model. The reduced model is derived by selecting neural types for which ablations show that these are essential for theta and gamma rhythms. A study of the reduced model proposes particular inhibitory cell types (CCK+BC cells) that play a key role in inhibitory control mechanisms of theta rhythms and theta-gamma coupling rhythms.

      Strengths:<br /> The paper identifies neural types contributing to theta-gamma rhythms, models them, and provides analysis that derives control diagrams and identifies CCK+BC cells as key inhibitory cells in rhythm generation. The paper is clearly written and approaches are well described. Simulation data is well depicted to support the methodology.

      Weaknesses:<br /> The derivation methodology of the reduced model is hypotheses based, i.e. it is based on the selection of cell types and showing that these need to be included by ablation simulations. Then the reduced model is fitted. While this approach has merit, it could "miss" cell types or not capture the particular balance between all types. In particular, it is not known what is the "error" by considering the reduced model. As a result, the control plots (Fig. 5 and 6) might be deformed or very different. An additional weakness is that while the study predicts control diagrams and identifies CCK+BC cell types as key controllers, experimental data to validate these predictions is not provided. This weakness is admissible, in my opinion, since these recordings are not easy to obtain and the paper focuses on computational investigation rather than computationally guided experiments.

      This Reviewer has provided a succinct description of our work which we will leverage in subsequent revisions as we more fully describe our process – thank you.  We agree with the Reviewer that we could ‘miss’ cell types and not capture particular balances etc., as we based our PRM on hypotheses from the detailed model.  Our PRM and its reference parameter values are ‘designed’ based on hypotheses from our set of explorations of the detailed model, and we were able to determine particular predictions that can be experimentally explored.  Subsequent theoretical analyses will help us understand the required ‘balances’ but as noted above (see response to Reviewer#2), we are not aiming for a minimal model (in general), but rather to use such a combined approach (detailed model and/or experimental observations with ‘derived’ reduced models) to come up with (cellular-based) predictions underlying theta/gamma generation.  As noted by this Reviewer, specific inhibitory cell recordings are not easy to obtain and we hope our work would help with computationally guided experiments – i.e, even though the reduced model may ‘miss’ other aspects, it would hopefully capture some aspects that are biologically salient for consideration in experimental design and future detailed model explorations.

    1. Author Response

      Reviewer #1 (Public Review):

      This paper describes the results of a MEG study where participants listened to classical MIDI music. The authors then use lagged linear regression (with 5-fold cross-validation) to predict the response of the MEG signal using (1) note onsets (2) several additional acoustic features (3) a measure of note surprise computed from one of several models. The authors find that the surprise regressors predict additional variance above and beyond that already predicted by the other note onset and acoustic features (the "baseline" model), which serves as a replication of a recent study by Di Liberto.

      They compute note surprisal using four models (1) a hand-crafted Bayesian model designed to reflect some of the dominant statistical properties of Western music (Temperley) (2) an ngram model trained on one musical piece (IDyOM stm) (3) an n-gram model trained on a much larger corpus (IDyOM ltm) (4) a transformer DNN trained on a mix of polyphonic and monophonic music (MT). For each model, they train the model using varying amounts of context.

      They find that the transformer model (MT) and long-term n-gram model (IDyOM stm) give the best neural prediction accuracy, both of which give ~3% improvement in predicted correlation values relative to their baseline model. In addition, they find that for all models, the prediction scores are maximal for contexts of ~2-7 notes. These neural results do not appear to reflect the overall accuracy of the models tested since the short-term n-gram model outperforms the long-term n-gram model and the music transformer's accuracy improves substantially with additional context beyond 7 notes. The authors replicate all these findings in a separate EEG experiment from the Di Liberto paper.

      Overall, this is a clean, nicely-conducted study. However, the conclusions do not follow from the results for two main reasons:

      1) Different features of natural stimuli are almost always correlated with each other to some extent, and as a consequence, a feature (e.g., surprise) can predict the neural response even if it doesn't drive that response. The standard approach to dealing with this problem, taken here, is to test if a feature improves the prediction accuracy of a model above and beyond that of a baseline model (using cross-validation to avoid over-fitting). If the feature improves prediction accuracy, then one can conclude that the feature contributes additional, unique variance. However, there are two key problems: (1) the space of possible features to control for is vast, and there will almost always be uncontrolled-for features (2) the relationship between the relevant control features and the neural response could be nonlinear. As a consequence, if some new feature (here surprise) contributes a little bit of additional variance, this could easily reflect additional un-controlled features or some nonlinear relationship that was not captured by the linear model. This problem becomes more acute the smaller the effect size since even a small inaccuracy in the control model could explain the resulting finding. This problem is not specific to this study but is a problem nonetheless.

      We understand the reviewer’s point and agree that it indeed applies not exclusively to the present study, but likely to many studies in this field and beyond. We disagree, however, that it constitutes a problem per se. We maintain that the approach of adding a feature, observing that it increases crossvalidated prediction performance, and concluding that therefore the feature is relevant, is a valid one. Indeed, it is possible and even likely that not all relevant features (or non-linear transformations thereof) will be present in the control/baseline model. If a to-be-tested feature increases predictive performance and therefore explains relevant variance, then that means that part of what drives the neural response is non-trivially related to the to-be-tested feature. The true underlying relationship may not be linear, and later work may uncover more complex relationships that subsume the earlier discovery, but the original conclusion remains justified.

      Importantly, we wish to emphasize that the key conclusions of our study primarily rest upon comparisons between regression models that are by design equally complex, such as surpriseaccording-to-MT versus surprise-according-to-IDyOM and comparisons across different context lengths. We maintain that the comparison with the Baseline model is also important, but even taking the reviewer’s worry here into account, the comparison between different equally-complex regression models should not suffer from it to the same extent as a model-versus-baseline comparison.

      2) The authors make a distinction between "Gestalt-like principles" and "statistical learning" but they never define was is meant by this distinction. The Temperley model encodes a variety of important statistics of Western music, including statistics such as keys that are unlikely to reflect generic Gestalt principles. The Temperley model builds in some additional structure such as the notion of a key, which the n-gram and transformer models must learn from scratch. In general, the models being compared differ in so many ways that it is hard to conclude much about what is driving the observed differences in prediction accuracy, particularly given the small effect sizes. The context manipulation is more controlled, and the fact that neural prediction accuracy dissociates from the model performance is potentially interesting. However, I am not confident that the authors have a good neural index of surprise for the reasons described above, and this limits the conclusions that can be drawn from this manipulation.

      First of all, we would like to apologize for any unclarity regarding the distinction between Gestalt-like and statistical models. We take Gestalt-like models to be those that explain music perception as following a restricted set of rules, such as that adjacent notes tend to be close in pitch. In contrast, as the reviewer correctly points out, statistical learning models have no such a priori principles and must learn similar or other principles from scratch. Importantly, the distinction between these two classes of models is not one we make for the first time in the context of music perception. Gestalt-like models have a long tradition in musicology and the study of music cognition dating back to (Meyer, 1957). The Implication-Realization model developed by Eugene Narmour (Narmour, 1990, 1992; Schellenberg, 1997) is another example for a rule-based theory of music listening, which has influenced the model by David Temperley, which we applied as the most recently influential Gestalt-model of melodic expectations in the present study. Concurrently to the development of Gestalt-like models, a second strand of research framed music listening in light of information theory and statistical learning (Bharucha, 1987; Cohen, 1962; Conklin & Witten, 1995; Pearce & Wiggins, 2012). Previous work has made the same distinction and compared models of music along the same axis (Krumhansl, 2015; Morgan et al., 2019a; Temperley, 2014). We have updated the manuscript to elaborate on this distinction and highlight that it is not uncommon.

      Second, we emphasize that we compare the models directly in terms of their predictive performance both of upcoming musical notes and of neural responses. This predictive performance is not dependent on the internal details of any particular model; e.g. in principle it would be possible to include a “human expert” model where we ask professional composers to predict upcoming notes given a previous context. Because of this independence of the relevant comparison metric on model details, we believe comparing the models is justified. Again, this is in line with previously published work in music (Morgan et al., 2019a), language, (Heilbron et al., 2022; Schmitt et al., 2021; Wilcox et al., 2020), and other domains (Planton et al., 2021). Such work compares different models in how well they align with human statistical expectations by assessing how well different models explain predictability/surprise effects in behavioral and/or brain responses.

      Third, regarding the doubts on the neural index of surprise used: we respond to this concern below, after reviewer 1’s first point to which the present comment refers (the referred-to comment was not included in the “essential revisions” here).

      Reviewer #2 (Public Review):

      This manuscript focuses on the basis of musical expectations/predictions, both in terms of the basis of the rules by which these are generated, and the neural signatures of surprise elicited by violation of these predictions.

      Expectation generation models directly compared were gestalt-like, n-gram, and a recentlydeveloped Music Transformer model. Both shorter and longer temporal windows of sampling were also compared, with striking differences in performance between models.

      Surprise (defined as per convention as negative log prior probability of the current note) responses were assessed in the form of evoked response time series, recorded separately with both MEG and EEG (the latter in a previously recorded freely available dataset). M/EEG data correlated best with surprise derived from musical models that emphasised long-term learned experiences over short-term statistical regularities for rule learning. Conversely, the best performance was obtained when models were applied to only the most recent few notes, rather than longer stimulus histories.

      Uncertainty was also computed as an independent variable, defined as entropy, and equivalent to the expected surprise of the upcoming note (sum of the probability of each value times surprise associated with that note value). Uncertainty did not improve predictive performance on M/EEG data, so was judged not to have distinct neural correlates in this study.

      The paradigm used was listening to naturalistic musical melodies.

      A time-resolved multiple regression analysis was used, incorporating a number of binary and continuous variables to capture note onsets, contextual factors, and outlier events, in addition to the statistical regressors of interest derived from the compared models.

      Regression data were subjected to non-parametric spatiotemporal cluster analysis, with weights from significant clusters projected into scalp space as planar gradiometers and into source space as two equivalent current dipoles per cluster

      General comments:

      The research questions are sound, with a clear precedent of similar positive findings, but numerous unanswered questions and unexplored avenues

      I think there are at least two good reasons to study this kind of statistical response with music: firstly that it is relevant to the music itself; secondly, because the statistical rules of music are at least partially separable from lower-level processes such as neural adaptation.

      Whilst some of the underlying theory and implementation of the musical theory are beyond my expertise, the choice, implementation, fitting, and comparison of statistical models of music seem robust and meticulous.

      The MEG and EEG data processing is also in line with accepted best practice and meticulously performed.

      The manuscript is very well-written and free from grammatical or other minor errors.

      The discussion strikes a brilliant balance of clearly laying out the interim conclusions and advances, whilst being open about caveats and limitations.

      Overall, the manuscript presents a range of highly interesting findings which will appeal to a broad audience, based on rigorous experimental work, meticulous analysis, and fair and clear reporting.

      We thank the reviewer for their detailed and positive evaluation of our manuscript.

      Reviewer #3 (Public Review):

      The authors compare the ability of several models of musical predictions in their accuracy and in their ability to explain neural data from MEG and EEG experiments. The results allow both methodological advancements by introducing models that represent advancements over the current state of the art and theoretical advancements to infer the effects of long and shortterm exposure on prediction. The results are clear and the interpretation is for the most part well reasoned.

      At the same time, there are important aspects to consider. First, the authors may overstate the advancement of the Music Transformer with the present stimuli, as its increase in performance requires a considerably longer context than the other models. Secondly, the Baseline model, to which the other models are compared, does not contain any pitch information on which these models operate. As such, it's unclear if the advancements of these models come from being based on new information or the operations it performs on this information as claimed. Lastly, the source analysis yields some surprising results that don't fit with previous literature. For example, the authors show that onsets to notes are encoded in Broca's area, whereas it should be expected more likely in the primary auditory cortex. While this issue is not discussed by the authors, it may put the rest of the source analysis into question.

      While these issues are serious ones, the work still makes important advancements for the field and I commend the authors on a remarkably clear and straightforward text advancing the modeling of predictions in continuous sequences.

      We thank the reviewer for their compliments.

    1. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary

      Rodríguez-Real, Huertas and colleagues here explore the roles of centrosomes in DNA damage responses, focussing on DNA repair activities. They show that centrosome depletion by PLK4 inhibition leads to reduced levels of homologous recombination and increased nonhomologous end-joining, along with altered level of nuclear focus formation by DNA repair proteins. Knockdown of genes that encode components of centriolar subdistal appendages (SDAs) cause reduced levels of RPA foci, with CRISPR-generated CEP170 heterozygotes also showing defects in focus formation. Knockdown of CEP170 impairs homologous recombination, although NHEJ activities are unaffected. Some increase in sensitivity to DNA damaging agents is seen in CEP170- or centriole-deficient cells, albeit with a modest effect size. CEP170 status is shown to affect mutational signatures and patient prognosis in different cancer samples.

      While the experiments are generally well-presented and controlled, the effects seen are not large, so that the the conclusions that the authors draw are not entirely substantiated by the data presented, even without the suggestion of a mechanism. There are several additional experiments and clarifications that I consider necessary to provide appropriate support for the phenomenon.

      Major points

      1. The lack of cell cycle arrest or phenotype in the U2OS cells after a week's treatment with centrinone is somewhat surprising, given their p53 status. The initial description of centrinone showed a distinct impact on U2OS proliferation, albeit after 2 weeks' treatment (although the present paper shows robust impact on centriole numbers after only 1 week in centrinone). It would be useful to know the percentage of mitotic cells, or if there is any increased cell death observed at this stage of treatment.
      2. In the I-SceI assays, were transduction efficiencies or apoptosis within the experiment impacted by centrinone treatment? If not, it would be useful to state that this was examined and that there were no confounding effects; having only normalised data does not allow the reader to exclude these potential confounding factors.
      3. The authors present binary data for a given type of nuclear focus (positive or negative for RPA/ BRCA1/ RAD51), while the supporting images show altered numbers/ intensities. For example, the BRCA1 signals shown in Fig. 3D are less readily distinguished than they are in Fig. 1D. These data should be reconsidered: it is possible that these observations reflect different kinetics of focus formation, rather than a change in IRIF formation capacity. Numbers and a timecourse should be provided, with details of how these are quantitated provided in the Methods.
      4. Are the BRCA1 and RAD51 results seen with centrinone treatment of U2OS cells recapitulated in the Saos-2 and RPE1 lines?
      5. Some additional analysis is needed of the extent to which cells are sensitised to DNA damaging treatments by CEP170 deficiency or centrinone treatment. It should be confirmed that these experiments were performed in biological triplicate, rather than a technical triplicate (within a single experiment); if this is not the case, these experiments should be done in triplicate. Analysing p53-deficient hTERT-RPE1 clones, Kumar et al. (NAR Cancer 2020 PMID: 33385162) showed <10% survival with 100 ng/ml NCS. Hustedt et al. (Genes Dev 2019 PMID: 31467087) showed just over 50% survival with 10 nM CPT treatment, although their data for IR were comparable to the current study. Given the wide variation that these assays seem to incur, the extent to which a ≈20% difference in clonogenic survival is biologically significant may be limited. A rescue of the CEP170 siRNA, and/ or washout in the centrinone experiment would make these data more convincing. The knockdown of CEP170 in Figure 4 should be correctly labelled (not as CEP170+/-); given that the authors have generated CEP170 heterozygotes in Figure 2, this is potentially confusing.
      6. Direct data for the (centrosomal) phosphorylation of CEP170 are limited; it has not been demonstrated that the S637A mutants are fully functional in terms of the centrosome functions of CEP170, so that the conclusion regarding a requirement for centrosomal CEP170 phosphorylation is not sufficiently supported by the available data. The CEP170-dependent changes in RPA focus positive cell percentages shown in Figure 3 are not very marked. The relevant sections should be revised, or the authors should include additional experiments showing directly a phosphorylation of CEP170.
      7. It is difficult to interpret the mutational spectrum data and their significance. These should be compared with data for mutations in NDEL1 mutant cells, and/or other SDA components.
      8. The Kaplan-Meier curves data are intriguing, but their interpretation is highly speculative, given that there are no data on treatment groups included in this study. It is unclear whether other genes that affect SDAs might also impact survival (in the same, or different cancers), so the presentation of those patient groups where CEP170 status impacted survival seems selective, given the ubiquity of HR and centrosomes. These data would be better included as Supplemental information.
      9. The independence of p53 status/ responsiveness of the system is a crucial aspect of this study. Sir et al. (JCB 2013 PMID: 24297747) showed no DNA repair defect in centrosome-deficient chicken DT40 cells. This paper is very relevant to the current study and should be discussed. Similarly, the work by Lambrus et al (JCB 2015 PMID: 26150389) should also be considered.

      Minor points

      1. References for the RPE1 TP53/ SAS6 mutant cell lines should be provided (or controls for their generation presented).
      2. Fig S1K should correct its x-axis to reflect the time intervals correctly.
      3. Fig 2D should show blow-ups of the centrosomes.
      4. To avoid any potential confusion, it would be helpful to indicate in the Figure proper which cells are used for the various analyses.
      5. The 'basal side' of the centriole is not a standard term- this should be clarified. This may be confusing, given the role of centrioles in the basal body.
      6. The consideration of Seckel syndrome seems somewhat speculative at this stage in the exploration of this phenomenon.

      Referees cross commenting I think the comments from Reviewers #2 and #3 are reasonable and justified; there is good convergence between the comments that we all made and I have no issues to raise in this cross-commentary.

      Significance

      Strengths: Much previous work linking centrosomes and DNA damage responses has addressed cell cycle and checkpoint roles of the centrosome, so that a direct role in (nuclear) DNA repair is intriguing. Limitations:The present study shows a relatively moderate impact of centrosome defects on DNA repair, without a clear mechanism. There are some technical details that should be addressed. The relatively limited sensitization to DNA damaging treatments caused by centrosome deficiency questions the biological significance of the phenomenon.

      Advance: The current study presents some new findings that potentially show DNA repair defects resulting from the loss of centrioles (or SDA proteins). This has not been demonstrated to date.

      Audience: The idea of subdistal appendage components contributing to homologous recombinational repair of DNA damage is of potential interest to several fields, ranging from basic centrosome biology through translational to clinical cancer research.

      Reviewer's expertise: basic/ cell biology.

    1. Author Response

      eLife Assessment:

      This manuscript follows the still unanswered concept of 'original antigenic sin' and shows the existence of a 24-year periodicity of the immune response against influenza H3N2. The valuable work suggests a long-term periodicity of individual antibody response to influenza A (H3N2) within a city. But, to substantiate their argument, the authors would need to provide additional supporting data.

      Thank you for your comments. We have performed additional analyses and included those results in the revision to support our findings.

      Specifically, we included a sensitivity analyses that predicting phases by fitting models with 35- and 6-years periodicity, which were found to provide poorer predictions than the 24-year periodicity used in our main results (Figure 4 – figure supplementary 1).

      We also generated a antigenic map with the locations of our tested strains shown in the map. We also compared the paired antigenic distance of A(H3N2) strains (including our tested strains). These results (Figure 1 – figure supplementary 3) suggested that the tested strains that we used spanned the circulation of A(H3N2) since its emergence and well covered the antigenic space of the virus.

      Reviewer #1 (Public Review):

      The authors suggest that there is a long-term periodicity of individual antibody response to influenza A (H3N2). The interesting periodicity may be surely appeared. Though the authors assume that the periodicity is driven by pre-existing antibody responses, the authors could provide more supportive data and discuss some possibilities.

      Thank you for your comments and please find our point-to-point responses below.

      1) The authors can investigate whether the periodicity reflects an epidemic/invasion record of A(H2N3) within Guangzhou or the surrounding city, e.g., the numbers of flu-infected people yearly can be referred to.

      Thank you for your comments. We aimed to investigate the periodicity in individual level antibody responses, so we made several efforts to minimize the impacts of population level A(H3N2) activity in our analyses. In particular, we have removed the average activity at population level (i.e., strain-specific intercepts), to minimize the impact of higher circulation of a certain stain on the periodicity.

      In our simulations, we tested models that only incorporated population level activity but not including cross-reactions (Figure 3B, I), which did not recover the observed periodicity. In the models that including both population level activity and cross-reactions, we found that less predictable population level activities (i.e., less regular annual epidemics) would increase the variations in individual-level long-term periodicity (Figure 3G-H). We also found that measured periodicities did not vary substantially when comparing those measured at baseline compared to those measured at follow up (~3-4 years later). These results suggested that the local epidemics may only have limited impacts on the observed periodicity in individual’s antibody responses, while the cross-reactions between previous exposed and currently circulating strains may be the main drivers.

      To address this comment, we added a paragraph in discussion (lines 336-342):

      “In this study, we did not explore the interactions between individual level antibody responses with population level A(H3N2) activity (e.g., epidemic sizes). We minimized the impacts from population level by performing the Fourier analysis with individual departures from population average and validating the results with data from the Vietnam cohort. Simulation results further suggested that the population level virus activity alone was not able to recover the observed periodicity, though epidemics with less regularity seemed to increase the variability in individual-level periodicity in the presence of broad cross-reactions (Figure 3G-H).”

      2) The authors can consider whether the participants are recently/previously vaccinated and/or infected with flu. The remaining antibodies may reflect a long memory but may show a recent activation.

      Thank you for your comments. We agree with the reviewer that the observed seroconversion of the circulating strains may reflect responses recent re-exposures. Given the low influenza vaccine coverage in our cohort (1.3%, 10 out of 777) and in China in general (<5% [3, 4]), we believe that our observed periodicity and seroconversion patterns were unlikely to be caused by to recent influenza vaccinations.

      We think that the pervasive exposure to A(H3N2) could be a driver to the observed seroconversions to circulating strains between our baseline and follow-up were likely due to the pervasive exposures (or reinfections for those who developed into infections). Using the same data set, we previously reported 98% and 74% of participants experienced 2- and 4-fold rise to any of the 21 tested A(H3N2) strains [5].

      As the reviewer and previous studies suggested, the antibody responses could reflect long term memories that were activated after recent exposures [1, 6]. We generated our hypothesis based on this features, and to characterize the periodicity that may arose from the interactions between long term memories and newly generated antibodies.

      We incorporate the re-infection mechanism in our simulations, with and without subsequent cross-reactions with previously exposed distant strains (Figure 3I). Results indicate that reinfection alone cannot recover the observed long-term periodicity (Figure 3A), while reinfection plus the resulting cross-reactions can recover such long-term periodicity (Figure 3D). Therefore, we believe that the repeated exposures or re-infections would not affect our reported periodicity, while they may be drivers of continuous formulation of the life-course antibody profiles and the observed periodicity. Of particular note is the consistency of measured periodic behaviour at baseline and follow up (~3-4 years later).

      To address this comment, we reported the vaccination status of our participants when introducing the data (lines 127-129) and in the discussions (lines 280-282 and 313-315):

      “Only 0.6% (n = 5) of participants self-reported influenza vaccinations between the two visits, therefore, the observed changes in HI titers between the two visits were likely due to natural exposures.”

      “Due to the low influenza coverage in our participants and in China in general, the observed seroconversions likely reflected antibody responses after natural exposures during the study period.”

      “Particularly, our simulation results suggested that model including repeated exposures or population level A(H3N2) activity alone did not recover the long-term periodicity (Figure 3).”

      3) The strains inducing high HI titers may have similar mutations and may be reactive to the same antibodies. What are the mutation frequencies among 21 A(H3N2) strains?

      Thank you for your comments. We selected the 21 tested strains to cover the span of the circulation of A(H3N2) strains since 1968 and antigenic diversity. We prioritized with the strains that were included in the vaccine formulation and tested to create the antigenic map by Fonville et al. [1].

      We reproduced the antigenic map (up to strains isolated in 2010) by Fonville et al. [1] and compared the antigenic locations of our tested A(H3N2) strains (Figure 1—figure supplement 3). The 21 strains (or their belonging antigenic clusters if the strains were not used for the map) largely tracked the antigenic evolution of A(H3N2) since its emergence in 1968, with a reportedly mutation rate of 0.778-unit changes in antigenic space per year [1, 2].

      We further calculated the paired antigenic distance of strains tested in the antigenic map, which was highly correlated with the time intervals between the isolation of the two strains. The figure also suggested our tested strains cover the time spans and antigenic distances that were shown in the original antigenic map. In addition, our observed periodicity was identified in individual time series of residuals, which has removed the shared virus responses or assay measurements (Figure 1). Therefore, we believe that the impact of specific mutations may have limited impacts on our findings.

      To address this comment, we included the reproduced antigenic map showing the locations of the tested strains and their pair-wise antigenic distance in Figure 1—figure supplement 3 and referenced in the main text (line 127).

      Reviewer #2 (Public Review):

      This is a well-thought-out, clearly exposed article. It builds upon the platform of 'original antigenic sin' (OAS), a notion first developed from studying individuals infected with influenza. According to OAS, the initial infection will set the dominant immune response targets (antigens) that immune cells will recognize, such that infection with a related strain will cause a strong response focused mainly against the initially infecting strain, that then goes on to protect against the new-infecting strain. This study builds off this idea, showing that as strains become increasingly antigenically distant as inferred by the time between strain appearance, the cross-protection can drop to a point where it needs to be invigorated with a potentially new response. The potential biological mechanisms behind this aren't discussed, but a model is built that conveys the potential for 'relative risk' of an individual over the course of the life, based essentially on when one was born.

      Thank you for your comments. We expanded our introduction hoping to include more biological mechanisms, especially those related with original antigenic sin.

      “Antibodies mounted against a specific influenza virus decay (in either absolute magnitude or antigenic relevance) after exposure until re-exposure or infection to an antigenically similar virus occurs, whereupon back-boosting of antibodies acquired from previous infections (e.g., activation of memory B cells) can occur, as well as updating antigen specific antibodies to the newly encountered infection (e.g., activation of naïve B cells.” (lines 80-84)

      “Original antigenic sin (OAS) is a widely accepted concept describing the hierarchical and persistent memory of antibodies from the primary exposure to a pathogen in childhood. Recent studies suggested that non-neutralizing antibodies acquired from previous exposures can be boosted and may blunt the immune responses to new influenza infections.” (lines 92-97)

      The basic premise was to measure from serum influenza haemagglutinin-inhibition (HI) titers of 21 strains of influenza A (H3N2) - related strains causing disease at various times over a period of some 40 years- from a diverse set of ≈800 participants of various ages, at two time points, spaced 2 yr apart. The authors then calculated the HI titer for the 21 strains for each individual. From this, each participant's age, their age at the time of a strain's development, and when a strain emerged were used to assess whether there was periodicity to immune responses by performing a splined Fourier transform for each individual and then examining the composite pattern across time for HI titers. The authors propose that on average there is a 24-year periodicity to immune responses to influenza strains, such that after the initial infection, cross-reactivity reduces to the point where it may be less meaningful for protection over around 24-year, and suggests activation of a 'new' immune response might be required to control the more distant strain involved in the response at that time. The periodicity was longer than would be predicted if age were not a factor involved in the HI titer patterns across time. Further, variability in the periodicity was shown to involve broad cross-reactivity between strains and narrow cross-reactivity in more highly-related (closer in time) strains, individual HI titer, and periodic population fluctuations. In the literature, viral strains are estimated to mutate to the point of losing 50% cross-reactivity with a T1/2 of approximately 2.5 yr, which would make the inferred lifespan plausible but perhaps surprisingly long, implying there are immune feedback parameters that influence periodicity. The authors also use an independent cohort of approximately 150 individuals from a separate, published, study to validate some findings revealed in the primary data set.

      Thank you for your comments and sorry for the confusion. We agree with the reviewer that the onward protection from the cross-protection should be shorter than 24-year periodicity that was identified in the retrospective antibody responses. We hope to clarify that we identified long-term periodicity by retrospectively investigating the individual antibody profiles, which were results of multiple previous exposures and immunity and cross-reactions that arose from these previous exposures. Therefore, the long-term periodicity is a retrospective characterization, and should not be directly interpretated as the duration of onward protection.

      As shown in Figure 4A, the 24-year periodicity consists of phases when individuals’ titers are higher (phase I & II) and lower (phase III & IV) than the population average. As such, the duration of onward protection may be shorter than the entire periodicity. Assuming the protection decreasing with lower titer levels, the onward protection is expected to decrease in phase II and take 1-6 years to drop from the furthest to population average. This is consistent with findings that homotypic cross-protection against PCR-confirmed infections up to about five seasons (lines 291-293), but whether such protection is driven by the declining of cross-reactions still need further investigations.

      To address this comment, we rephrased our discussion and make the interpretation less confusing. (lines 285-287):

      “Of note, the long-term periodicity is a retrospective characterization of individual antibody profiles that arose from multiple exposures and cross-protection, which should not be directly interpreted as the duration of onward protection conferred by the existing antibodies.”

      Strengths: Overall, the study is well executed and the patterns that are visually apparent in Figure 1A (the 'raw' data) are built on to inform a model of the potential breadth of cross-reactivity in a given individual at any given time after birth, integrated with the influenza strains to which they are most likely to have been first exposed. It is a complex thing to make sense of data involving many individuals who could be infected or vaccinated at any and variable points in time over the course of their life, but the authors derive a model that probabilistically accounts for possible infection events, so controls for this nicely, or at least to a degree that is practicable.

      Thank you for your supportive comments. We hope to clarify that we identified the long-term periodicity using the residuals of individual HI titers after extracting the population activity that is visually noticeable in Figure 1A. By doing this, we hope to minimize the impacts of population level A(H3N2) activity and laboratory measurements on individual antibody responses (Figure 1C; detailed methods in lines 396-412).

      Questions related to the main limitation: The level of math in this paper makes it hard for a basic biologist to critique the approach, but the argued points are intriguing. Foremost, in the final part of the paper the authors move from building a model to testing its potential to predict HI titers in the final quarter strains of the study period, placing individuals into one of four phases: I) early increasing to high titer response, II) waning response phase where they are returning back to the average population-level response against a strain, III) sub-par response against a strain and then reinitiation of HI titers in phase IV. Pleasingly this shows a good correlation between individuals' ages and their predicted phase. However, while the fit predicts phase well in Fig 4C and 4D, it looks to perform less adequately in Fig 4B.

      1) Why is this?

      Thank you for your comments and sorry for the confusion. In Figure 4B, we aimed to characterize and predict the position instead of the amplitude in the individual time series of residuals. Therefore, we fitted the model using only harmonic terms (i.e., sine and cosine functions; Equation 12 on page 26) [7], while we believe there may be other factors that could affect the observations but were not included in the model. The perditions from the model inform the position and velocity of harmonic oscillators rather than the amplitude or extent of the wave, therefore, the predictions did not exactly fit the observations.

      To address this comment, we expand the corresponding methods hoping to make it clear (lines 661-663):

      “Of note, we fitted the model aiming to estimate the position of the harmonic oscillators and did not consider for other non- harmonic factors, therefore the model may not fully capture the variations of the data.”

      2) Another point for consideration is that the time between samplings (2010-2012) is comparatively short, given a 24-yr predicted periodicity. What would happen to the predictions if the periodicity were 35-yr or 6-yr? Would the model fail to call individuals accurately in these cases?

      Thank you for your comments. We repeated our predictions in Figure 4F-G by assuming a 35-year and 6-year periodicity respectively as suggested. Results suggested that model predictions with either 35-year or 6-year did not outcompete the model predictions assuming a 24 years old (Figure 4—figure supplement 1). For instance, the observed proportion of seroconversion to circulating strains in each cohort have correlation coefficients of 0.49 (p-value = 0.05), 0.63 (p-value = 0.02) and -0.12 (p-value = 0.69) with the predicted proportion of phase IV when assuming a 35-, 24- and 6-year periodicity, respectively.

      We also hope to clarify that we investigated the prediction potentials of long-term periodicity from two perspectives. Except for using the periodicity to predict the seroconversions between baseline and follow-up, we also predict the phase of each individual in the year of 2012 only using HI titers against strains that were isolated before 2002. Our results suggested our 10-years ahead predictions well correlated with observations (Figure 4C).

      To address this comment, we also included the results of analyses using alternative 35- and 6-year periodicity as Figure 4—figure supplement 1, and reported in the main text (lines 262-264).

      3) Similarly, if the samples were taken further apart, would the model still be effective at predicting phase?

      Thank you for your comments. We hope to clarify that we collected two cross-sectional serum samples, while we identified the long-term periodicity and predicted phase with serums collected from each visit, separately. For instance, in our sensitivity analysis that using serum collected in follow-up (Figure 1—figure supplement 1), we revealed similar long-term periodicity (baseline in Figure 1) with that identified using the baseline serums, despite pervasive exposures during this time period (time separating samples varied from 3-4 years). In addition, the Vietnam data collected sera from six consecutive years. These data showed a similar long-term periodicity (Figure 2—figure supplement 5).

      For the phase prediction, we used residuals of HI titers against 14 historical strains that were isolated between 1968 and 2002, and predicted the phase of strain that was isolated in the year 2012. This prediction was derived purely by depending on the periodic pattern of the time series and without information for strains isolated 10 years prior to 2012. Therefore, the prediction was 10 years ahead and was well correlated with observations from the complete time series, further supporting that there may be an intrinsic cycling in individual antibody responses and that this cycle is fairly stationary and predictable.

    1. This unique positioning made us view our work with renewed purpose, greater creativity, and a sense of urgency. The experience also underscored what we have always known—that diversity, equity, and inclusion are core to the realization of the institutional mission on each one of our campuses.

      Even though the pandemic set us as a world back it also helped us realize how much more needs to be done. It also allowed us to think deep and develop ways to address other inequities we may face a a nation.

    1. Author Response

      Reviewer #1 (Public Review):

      This is a very interesting paper trying to quantify excess deaths due to the COVID-19 pandemic in the USA. The paper is roughly divided into two main sections. In the first section, the authors estimate age and cause-specific excess mortality. In the second section, using their excess mortality estimates, the authors attempt to disentangle the impact of SARS-CoV-2 infection (direct impact) vs. the impact of NPIs on this excess mortality (indirect impact). I have some concerns, particularly with respect to the second section.

      The model used to estimate excess mortality is quite clear. The authors adjust the baseline model to account for low influenza circulation (and deaths) during the COVID-19 pandemic, to avoid underestimating the number of deaths caused by COVID-19. While this makes sense if the authors are trying to estimate the total number of deaths caused by COVID-19, I'm not sure it needs to be accounted for if the authors want to estimate excess/added deaths. A counterfactual scenario would've included influenza. It also raises the question of whether (conceptually) they should be adjusting for other causes of deaths that may have also decreased during the pandemic. The authors briefly acknowledge this in the discussion ("we can't account for changes in baseline respiratory mortality due to depressed circulation of endemic pathogens other than influenza") but my comment goes beyond respiratory diseases. Analyses of excess mortality from other settings have suggested, for example, decreased deaths due to fewer traffic accidents (not in the US) or due to decreased air pollution, and not accounting for these would also lead to an underestimate of the total deaths caused by COVID-19. I understand that it is not feasible to account for all potential factors, so I wonder if they should focus on reporting excess deaths as compared to a counterfactual with influenza.

      Thanks. We think it is helpful to “single out” influenza as it causes major fluctuations in mortality from multiple causes in regular years and is a useful reference to contrast the pandemic impact. But the reviewer’s point is well taken. We have clarified our assumptions about the meaning of the baseline in this analysis (methods p 5), discussed the depressed circulation of other pathogens in depth, and mentioned air pollution (p 12-13). We have also slightly reworked our comparison between COVID19 and influenza so that excess mortality estimates are comparable and now cover periods of the same duration (Nov 2017-Mar 2018 for flu and Nov 2020-Mar 2021 for COVID19, see Figure S11).

      The second section, trying to estimate direct vs. indirect effects is also very interesting. However, more details are required about the regression model used and, importantly, what the assumptions and limitations of the approach are. Specifically:

      • Please provide a bit more information on the regression used for direct vs. indirect effects. I'd like to see explicit discussion of the assumptions and limitations of the approach but also of the stringency index used. Does this model include an intercept? Was the association between stringency index and excess deaths assumed to be linear? Or were different functional forms considered? It is also not clear how well the model fits the data.

      Thanks for these comments which helped us improve this section. We have provided more details about the stringency index in methods (it captures the “sum” of interventions), described the model in methods and supplement, and discussed limitations in caveats section, especially regarding effectiveness of these interventions (p13). We had tried different linear models with and without intercepts but elected to use models with intercepts so as not to overly constrain the relationship between interventions, COVID19 activity and excess mortality. These models also incorporate lags in the predictors that are determined by cross-correlation analysis (as detailed in supplement). In the revised version, we now use gam models, where the relationships between excess mortality and predictors do not have to be linear. We can do so since we were able to add several weeks of data (the regression is now based on 96 pandemic weeks from March 1, 2020 to January 1, 2022). The models are described in detail in supplement p 4-5, and we now specify that they have intercepts. We have also provided additional plots of model fits in main text and supplement (Figures 4 and S16-19).

      • Related to the above, please provide more details on how the results of the regressions were translated into the results presented. The main text reports percentages, but the methods only briefly explain how numbers of direct deaths were calculated, and the supplementary tables report coefficients. It is not clear if these estimates of direct and indirect deaths were somehow constrained to add up to the total number of excess deaths, but it doesn't seem like it since point estimates cross 100% in some cases.

      As discussed in response to one of the editor’s questions, estimates are not constrained to 100%. We have provided more details in the supplement on how we estimate the direct impact of the pandemic. Briefly, we calculate expected deaths in the gam model with all predictors set to their observed values and again with the COVID19 predictor to zero. The direct impact is the difference between the two predictions, divided by the predictions of the full model.

      We note that while some of the estimates derived from gam model exceed 100% (and are similar to the linear model estimates presented in the initial analysis, before revision), these estimates echo the findings from a more empirical analysis, in which we compare all-cause excess deaths with official COVID19 deaths tallies. There, in the two oldest age groups, we find more official COVID19 deaths than estimated by the excess mortality models. Hence both analyses point to an underestimation of the direct burden of COVID19 by the excess mortality approach, specific to the oldest age groups. We return to this point in depth in the discussion (p 12-13) and consider the possible effects of harvesting, depressed circulation of non-SARS pathogens, and inaccurate coding of official statistics (as pointed by reviewer #3).

      • Please discuss the potential limitations of using the stringency index to quantify NPIs.

      Several limitations have been added to caveats (p 13); major issues include aggregation of multiple interventions into a single index, which does not consider the actual implementation nor the effect of interventions. The index is solely based on mandates in place in different locations and time periods. We also assume that the effectiveness of these interventions, for a given level of stringency, does not change over time.

      • When estimating direct and indirect effects, the paper assumes that the estimated parameter is time-invariant? Indirect effects might have changed over the course of the epidemic by factors not necessarily captured by the stringency index used, particularly since the index doesn't take into account the implementation of the measures. Have the authors tested this assumption?

      This is an interesting point, which we have explored further. The non-linear relationships we find between NPIs and chronic condition excess mortality may suggest that the reviewer is right. We discuss the role of NPIs in the results section much more deeply than we were previously (bottom of p8).

      “At lower levels of interventions (Oxford index between 0 and 50), representing the early stages of the lockdown in March 2020, excess mortality rose with interventions. Later in the pandemic, increased interventions were estimated to have a beneficial effect on excess mortality, driven by comparison between the period when interventions were strengthened in response to increasing COVID19 activity in late 2020 (Oxford index above 60) to the period when interventions were relaxed in 2021 (Oxford index between 50 and 60).”

      We cannot run an analysis over different time windows because NPI and time are highly conflated (for instance NPI rise from 0-50% in the very early part of the lockdown period, and then stays above 50% for the rest of the study, so we cannot compare the effect of a 25% level in 2020 and 2021). We have added this limitation in the caveat section p.13.

      • The authors state "In contrast, the indirect impact of the pandemic measured by the intervention term was highest in youngest age groups, decreased with age, and lost significance in individuals above 65 years" - I'm not entirely sure of where this statement comes from? For example Table S3 suggests that the indirect effect (multivariate or univariate) is higher in 25-64 yo than in <25s? The same table also suggests negative impacts (protective effects?) in >75s in the multivariate model. Please clarify.

      There are fewer deaths in the under 25 yo so this is why the coefficients were lower overall in table S3. Yet we find that the proportion of variance explained by interventions is higher in the under 25 yrs than in 25-44 yrs.

      We have now changed our modeling strategy to use gam so Table S3 is no longer relevant but the main conclusion that interventions explain a larger relative portion of excess mortality in the under 25 yrs than in the other age groups, and than other covariates, remains valid. The NPI term is now significant is in all groups (although the relative contribution of NPI still declines with age, as in the prior analysis), so we have rephrased this sentence: “In contrast, the relative contribution of indirect effects, via the intervention variable, was highest in youngest age groups and decreased with age”.

      • How do the authors interpret "Percents of excess deaths" over 100%? Similarly, I don't fully understand how to interpret "The upper bound of the 95% confidence interval for heart diseases was above 100% (158%), suggesting that for every excess death from heart disease estimated by our model, up to 1.58 death from heart disease could be directly linked to SARS-CoV-2 infection.

      We have rephrased this section although the overall conclusions remain unchanged. GAM estimates of the direct COVID 19 impact is statistically significantly above 100% in the 85 yo and over, suggesting that our excess mortality approach is too conservative and does not estimate enough COVID19 excess deaths in this age group. We draw a similar conclusion from a more empirical analysis, in which we compare all-cause excess death estimates with official COVID19 deaths tallies. In this analysis, we find more official COVID19 deaths than estimated by the excess mortality models in the two oldest age groups (point estimates above 100% in the 75-84 and 85+ yrs). Hence both analyses point to an underestimation of the direct burden of COVID19 in the oldest age groups by excess mortality approaches.

      Rephrased results section bottom of p.9: “We estimate that the direct contribution of COVID-19 to excess mortality increases with age, from negative and non-statistically significant in individuals under 25 yrs to over 100% in those over 85 years, echoing the gradient seen in official statistics (Table 4). It is also worth noting that our excess mortality estimates may be too conservative (too high) as we did not account for missed circulation of endemic pathogens. This could explain why our estimates of direct COVID-19 contribution exceed 100% in the oldest age group.“

      We return to this point in depth in the discussion and consider the possible effects of harvesting and depressed circulation of non SARS pathogens (p 12-13).

      • Table 3: The signs of the point estimate vs CI for vehicle accidents are inconsistent.

      Thanks, this was a typo. It should have been 4300 (-700, 9300) excess deaths from accidents. This has been updated with more recent data.

      Reviewer #3 (Public Review):

      Authors examine mortality data in the US and use time-series approaches to estimate excess mortality during the COVID-19 pandemic.

      Major comments:

      I would encourage authors to discuss the two different concepts of excess mortality:

      (#1) what deaths were caused, directly or indirectly, by the pandemic. This is what the authors have aimed to assess, and I have no major concerns with the methodology

      (#2) how many additional deaths occurred during the pandemic, compared to what would have been expected in the absence of a pandemic. For such an analysis I think expected annual influenza deaths should be added back to the baseline (or subtracted from the excess)? Some of the discussion seems to relate more to an impression of #2 rather than #1 but I would be interested in the authors' thoughts.

      We have added more details about the approach, in particular why we think that #1 is the proper analysis here (see methods p 5). Given the sheer magnitude of COVID19 excess deaths (over 1 million excess deaths at the end of our study), adding back influenza deaths (up to 52,000 deaths in a recent severe season with a mismatched vaccine, as in 2017-18) would not make a large difference. We have also provided a more direct comparison of the impact of influenza and COVID19.

      1. Authors estimate fewer excess COVID deaths in the elderly than there were confirmed deaths (Table 3). Could this be an indication of some confirmed deaths being "deaths with COVID" rather than "deaths from COVID"? I'm not sure how to interpret the %s in the final column when they exceed 100%. The authors suggested a harvesting effect but I would suggest "deaths with COVID" might be a more likely explanation? This issue can be a limitation of confirmed-death data.

      This is a good point. We have added a comment along these lines in discussion in the middle of p 12. Still, we think harvesting and/or the depressed circulation of endemic pathogens, which would have inflated our baseline, are more likely explanations for these findings. This is because we find similar estimates (exceeding 100%) in gam models that ignore official statistics and rely on COVID19 case data, or COVID19 hospital occupancy data, and this suggests that other mechanisms, beyond coding of official mortality statistics, are at play.

      Yet, as more detailed official statistics become available, a tabulation of confirmed deaths by presence of a primary vs secondary COVID (U07) code may be revealing and get more directly at the reviewer’s question.

    1. Author Response*

      Reviewer #1 (Public Review):

      ARL3 is a small GTPase that localizes to the primary cilium and plays a role in regulating the localization of some specific ciliary membrane proteins, including PDEδ and NPHP3. Mutations in this gene cause Joubert syndrome, a type of ciliopathy characterized by cerebellar malformation, and retinal degeneration. While the majority of the diseases occur in an autosomal recessive manner, two mutations in ARL3 (D67V and Y90C) have been reported to cause autosomal dominant retinal diseases. In the current paper, Travis et al. sought to understand the pathogenesis of the diseases caused by the two autosomal dominant mutations. They found that D67V acts as a constitutive active mutation, whereas Y90C is a fast-cycling mutant, which can be activated in a guanine nucleotide exchange factor (GEF) independent manner. Since the fast-cycle mutant did not bind to the effector proteins in vitro (likely because the guanine nucleotide falls off from the mutant ARL3, which has a lower affinity to GDP/GTP), they developed a method to snapshot the interaction between ARL3 and its effector. Using this method, they showed that the Y90C mutant indeed has increased interaction with the effectors, suggesting that Y90C is an overactive form of ARL3. They then addressed how photoreceptor cells are affected by these two mutations using a mouse model and found that the mutations disrupt the proper migration of the photoreceptor cells.

      Strengths:

      • The paper is well written, and it was easy to understand what the authors did from the figure legends and the methods section.

      • It was easy to find out what is known or unknown, as the paper has accurate references.

      • The authors developed a method to analyze a snapshot of the interaction between ARL3 and its interactors.

      • The paper has an in vivo model and connects the biochemical characteristics of ARL3 to in vivo cellular phenotypes.

      Weaknesses:

      (1) I understand that authors focused on nuclear migration defect as the phenotype was first described in ARL3-Q71L transgenic mice. The similar phenotype observed in RP2 knockout mice further supports the idea that the defect is caused by the hyperactivation of ARL3. Indeed, the defect is not reported in the ARL3 knockout mice, however, I feel that it does not necessarily mean that the defect is not caused by loss of function. Although it has not been assessed, ARL3 knockout mice might have the same defect. Therefore, I think analyzing both the migration defect and trafficking defect would be more informative, rather than focusing on the migration defect. The fact that the relationship between nuclear migration defect and the retinal degeneration phenotype is not entirely clear further enhances the importance of analyzing the trafficking defect.

      Does the expression of ARL3-Y90C also cause the trafficking defect? If it is the case, you can separate the nuclear migration phenotype from the one caused by the trafficking defect. Would the expression of lipidated cargo(s) rescue the trafficking defect as well?

      I think many questions can be addressed by analyzing the localization of the lipidated cargos, such as PDEδ and GRK1.

      The effect of Arl3-Y90C expression on trafficking of lipidated cargos is an interesting question. Previous papers showed mislocalization of lipidated outer segment proteins in Arl3-KO rods and down-regulation or subtle mislocalization in Arl3-Q71L overexpressing rods. So, this was one of the first things we investigated; however, we never observed mislocalization of ciliary or outer segment lipidated cargos (i.e. GRK1, transducin, Rab28, and PDE) in wild type mature rods that were overexpressing Arl3 mutants, and many were tested. It was through these experiments that we first identified the pronounced nuclear migration defect. Rod photoreceptor nuclear migration is a developmental process that is completed by P10, so Arl3-Y90C overexpression is causing a developmental defect. When rods are positioning their nuclei in the ONL, they are still “immature” as their primary cilium has not begun to elaborate disc membranes for light capture. All our analysis was performed in mature rods, so it is not surprising that we did not observe any lipidated trafficking defects at this timepoint. Since the developmental timing of the nuclear migration defect is important for our manuscript, we have added this to our introduction. Additionally, we use “immature” photoreceptors for the cartoon diagrams showing how Arl3 activity is altered by different mutation and rescue experiments, since formation of the mature outer segment occurs post-migration.

      (2) I am not quite sure if the nuclear migration was assessed properly. Based on the pictures in Fig.1, some of the FLAG-negative cells also seem to be migrating to INL (please see Fig.1C and Fig.1D). Is this biologically normal during development? Could this analysis be affected by the thickness of OPL, the layer between ONL and INL? Also, the picture is cut out in the middle of INL. Could authors include more layers, such as IPL, of the retina in the picture, so that we can evaluate INL and OPL better? Taking this into account, I think it is worth measuring the nuclear position of FLAG-negative cells as a negative control in all the experiments.

      Our electroporation technique results in a small population of rods that express our constructs of interest (~5-15% with a patch). All the experiments were performed in wild type retina which develop normal retinal layers, so analysis of the nuclear position of FLAG-positive cells with the retina is cell autonomous. Migration defects are assessed by differences in the skew of FLAG-labeled rods relative to the boundaries of the wild type ONL, which is marked by Hoechst nuclear stain (also a measure of the FLAG-negative rods). Wild type photoreceptors nuclei are not found within the INL, the nuclei in that layer belong to either horizontal cells or bipolar cells both of which are not targeted by our electroporation approach. As a control, we show that when wild type Arl3-FLAG was expressed FLAG-labeled rods were never observed within the INL. We have now included the % of displaced nuclei in Table 1.

      (3) The way that the authors showed the Y90C mutant of ARL3 is a fast-cycling mutant is not very compelling. In Figure 2C, the authors showed that ARL3 Y90C can bind to PDEδ, its effector, once it is pre-loaded with GTP. The authors also showed that the mutant can bind to its effector even without EDTA as long as an excess amount of GTP is added. The authors used endogenous ARL3 as a control to compare the effects between wild-type and mutants. I see that this experiment has multiple pitfalls. First, ideally, this type of experiment needs to be done with a purified protein using fluorescent guanine nucleotide/radioactive guanine nucleotide (e.g. nucleotide loading assay or nucleotide exchange assay) to directly access the kinetics of nucleotide exchange. However, I do understand that this is out of the authors' expertise. In the authors' experimental setting, I am not sure loading the protein with GTP in the presence of the EDTA means anything more than confirming that the protein is intact. Theoretically, wild-type and a fast-cycling mutant can load GTP with similar efficiency in the presence of EDTA. Then during immuno-precipitation, GTP falls off from the Y90C mutant faster than wild-type (because a fast-cycling mutant theoretically has a lower affinity to guanine nucleotides), assuming that GTP was not added during immuno-precipitation (GTP addition was not mentioned in the method, but could authors confirm this?). But in this case, the kinetic of GTP dissociation can be affected by many factors, including the presence of GAP in the reaction, the dissociation constant of Y90C, the volume of the buffer used, and the number of washing steps. Thus, it is not very easy to estimate the difference between wild-type and Y90C. Besides, using endogenous ARL3 rather than ARL3-wild type FLAG as a control can be dangerous. I have experienced that a tagged protein is cleaved to a protein that has a similar size to endogenous protein. (I expressed GFP-protein X in knockout cells lacking protein X, and saw the band at the position where the endogenous protein is observed in wild-type cells). So, the endogenous band that the authors showed could come from the cleaved FLAG-Arl3. (Authors can easily confirm this by having wild-type not expressing FLAG-tagged ARL3, though).

      An alternative experiment that I would suggest is doing immuno-precipitation in the buffer containing: 1) no guanine nucleotide, 2) 10mM GDP, or 3) 10mM GTP in the cells expressing the following protein: 1) ARL3 wild-type FLAG, 2) ARL3 Y90C FLAG, or 3) ARL3 D129N FLAG. 10mM guanine nucleotide should be added throughout the process including washing. This experiment might also be affected by many factors, but variability should be lower than the experiment presented in Fig 2C. ARL3-wild type FLAG is also a better control here than endogenous protein.

      Variability due to the factors you mention is a concern, but we were able to repeatedly obtain the same results using our method—admittedly our method is testing whether the mutated Arl3 can exchange under a certain condition more than exactly how. We know that we are not providing precise kinetics or elucidating the underlying mechanism for how these mutations lead to what we are calling fast cycling. While that information is important, it is outside the scope of this paper.

      As you mention, an important conclusion from the PDEδ binding experiments is that we confirm the Arl3-Y90C protein is intact by showing it can indeed bind nucleotide as long as there is an excess of GTP (Fig 2B. The interesting finding from these experiments is that Arl3-Y90C binds GTP even in the presence of magnesium, a behavior not observed for wild type Arl3. We feel that showing that endogenous Arl3 is not activated in the presence magnesium in each of our preparations is a lovely internal control. However, we agree that showing wild type Arl3-FLAG in these assays is an important negative control and have now included this blot as Fig 2-Sup Fig 1.

      (4) In Fig.3, the authors attempted to take a snapshot of the interaction between ARL3 and multiple effector proteins. The three bands that were enriched in the Q71L cells were found as RP2, UNC119, and BART by mass spec (Fig.3B). These bands were used as a readout for the subsequent experiments. I am not quite sure why the authors used this approach rather than using the cell line that expresses both FLAG-ARL3 and GFP tagged protein of interest, just like what the authors did in Fig3G. The reasons why I prefer the latter approach are the following: FLAG bands that correspond to the three proteins (RP2, UNC119, and BART) in wild-type cells are very close to the detection limit, 2) authors failed to confirm that the lowest band actually comes from BART, 3) authors cannot access some important effector proteins, such as PDEδ because 293 cells might not express them. All of the problems can be solved by using the approach that was taken in Figur 3G.

      If the authors chose the former approach because of some specific reason, I would appreciate it if the authors could explain that in the main text of the paper.

      In vitro crosslinking experiments were performed to test whether overexpression of Arl3 mutants resulted in an active cellular Arl3 without artificially changing any components of the GTPase cycle. We feel these experiments are highly elegant as they allow us to take a snapshot of native Arl3 activities without compromising the analysis by artificially altering GAP/GEF/effector interactions through overexpression or during lysis (as we show that the concentration of GTP/Mg could alter interactions in Fig 2). While AD293T cells are not rod photoreceptors, we are able to use this system to better understand how the Arl3 mutants alter the level of activity within the cell. Yes, this experimental assay is novel, but we confirmed the identity of the effectors by Western and mass spec, used positive and negative controls in each experiment, and show that the method is highly reproducible. We agree with Reviewers 2 and 3 that using this method to study the cellular activity of fast cycling Arl3 mutants is a strength of our paper.

      (5) ALR3 Y90C causes nuclear migration defect. Given that Y90C is a fast-cycling mutant (hyperactive) and has a high affinity to ARL13B, the nuclear migration defect might come from either the increased activity of ARL3 or sequestration of ARL13B, which can act as a GEF for ARL3 but potentially have other functions. If my understanding is correct, the authors concluded that the defect caused by ARL3-Y90C is likely due to hyper-activation of the protein, as Y90C/T31N mutant, which cannot bind to effectors but still retains the ability to capture ARL13B, did not cause migration defect. But I am a little confused by the fact that Y90C/R149H, which is unable to bind to ARL13B (Fig.2C) but still retains the ability to interact with the effectors (Fig.3F), did not have migration defect (Fig.7B). Wouldn't this mean that the sequestration of ARL13B could contribute to the phenotype?

      If my understanding is correct, the authors are trying to say that both hyper-activation of cytosolic ARL3 and the defect in endogenous ARL3 activation in cilium is necessary to cause migration defect. I am not very convinced by this hypothesis, and still think that the defect could be caused by sequestration of ARL13B to the cytoplasm.

      Then why Y90C/T31N did not cause the defect even though they can sequester ARL13B? This might be explained by the localization of the ARL13B mutants. If Y90C can localize to the cilium while the double mutant, Y90C/T31N, does not, then only Y90C might be able to inhibit the ARL13B function in the cilium. This could explain the lack of the defect in the cells expressing Y90C/T31N.

      It would be helpful to understand how exactly the fast-cycling mutant causes the defect if the authors can provide more information, including localization of ARL3 (wild-type and mutants) as well as key proteins, such as ARL13B and the effector proteins. Assessing ARL13B defect seems to be particularly important to me because ARL13B deficiency has been connected to neuronal migration defect (Higginbotham et al., 2012)

      What I am trying to say here is that how the defect is caused is likely very complex. So, providing more information without sticking to one specific hypothesis might be important for readers/authors to accurately interpret the data.

      Our data shows that for the fast cycling Arl3-Y90C mutation both features: blocking endogenous Arl3 activation in the cilium (through Arl13B binding) and increasing activity of Arl3-Y90C in the cell body are required to produce a nuclear migration defect. We find that we can rescue migration defects by either restoring activation in the cilium or reducing GTP activity outside the cilium. As long as there is more Arl3-GTP activity in the cilium, then the rod can handle aberrant Arl3-GTP activity in the cell body. The Y90C/R149H was a critical result that led to our hypothesis that there is a gradient between the two compartments that is used for proper migration. One interesting point is that absence of any activity does not produce the migration phenotype, further suggesting that an imbalance in the gradient is important.

      We performed new experiments to investigate whether Arl3-Y90C is sequestering Arl13B away from the cilium but found that localization of Arl13B (both endogenous and overexpressed) is not altered by expression of Arl3-Y90C – see Fig 3-SupFig 1-2.

      It is an interesting question as to how different Arl3-FLAG constructs are localized within the photoreceptor. Sadly, we did not analyze the data in a way that would allow us to draw any conclusion about the localization of different Arl3-FLAG constructs. In general, we observed FLAG localization throughout the photoreceptor cell and focused our imaging on the FLAG staining around the nucleus so we could further analyze ONL position. Looking back through our images, some of mutants might have a more prominent localization within a specific subcellular compartment (e.g. Arl3-D67V is more prominent in the inner segment than outer segment and Arl3-Y90C appears to have dominant outer segment localization). Likely, these differences represent each mutant binding a particular effector: D67V to RP2 and Y90C to Arl13B, which we show biochemically. Ideally, Arl3 mutant localization would be analyzed during development to provide a more direct link to the nuclear migration defect, a future direction for our lab. We have updated our manuscript to be more transparent about the potential differences in rod localization of Arl3 mutants.

      (6) The rescue experiments that the authors presented in Fig.5-6 are striking and would build a base for future therapy of the diseases caused by ARL3 defects. However, I believe more examinations are needed to accurately interpret the data. The authors did this rescue experiment by co-injecting ARL3-FLAG and chaperons/cargos if I understand the method section correctly. But I feel we can interpret this data correctly only when ARL3-FLAG and chaperons/cargos are co-expressed in the same cells. I think a better way to analyze the data might be by comparing the nuclear migration phenotype between ARL3-FLAG only and ARL3-FLAG;chaperons/cargos double-positive cells.

      Our lab has found that the initial estimates by the Cepko Lab that co-injection of two plasmids results in above 90% of rods expressing both proteins is accurate (see reference Matsuda and Cepko PNAS 2004). Since we only assess nuclear position of FLAG-labeled rods, it is true that a small percentage of cells in this analysis express the Arl3-FLAG mutant and not the chaperone/cargo; however, inclusion of these cells really only bolsters our findings as complete rescue would likely be even more robust than measured.

      Reviewer #2 (Public Review):

      The small GTPase Arl3 (Arf-like 3) is a well-characterized component of primary cilia, including the outer segment of photoreceptors, which contain specialized cilia. Arl3 is critical for the import of multiple lipid-modified proteins into cilia that are vital to ciliary function. Human mutations in Arl3 are reported to cause both autosomal recessive and dominant inherited retinal dystrophies, but the mechanisms through which these mutations disrupt photoreceptor development are not known. Here the authors show that two dominant Arl3 mutants, Arl3-D67V and Arl3-Y90C exhibit increased activity, but for different reasons. Arl3-D67V is constitutively active (unable to hydrolyze GTP), whereas Arl3-Y90C is a classic rapid-cycling mutant, able to bind GTP spontaneously (independent of its guanine nucleotide exchange factor Arl13) but still able to complete the GTPase cycle by hydrolyzing GTP. Expression of either mutant in developing murine retinas results in a nuclear migration defect, specifically aberrant localization of rod nuclei to the inner rather than outer nuclear layer. In this sense, they phenocopy another well-characterized constitutively active mutant, Arl3-Q71L. Normal nuclear distribution could be restored by overexpression of Arl3 effectors, suggesting that the active mutants disrupt nuclear migration, at least in part, by sequestering Arl3 effectors.

      While the data are reasonably clear and convincing, there are several instances where the conclusions drawn are either confusing or problematic. Specifically:

      1) Although retinal rod cells are ciliated in their outer segment, the authors never actually examine ciliation here. Their only morphological readout is nuclear migration. How does nuclear migration failure impact ciliogenesis in the outer segment?

      Imaging was performed in mature retinas at P21 after outer segment formation is completed. Electroporation only targets a small population of cells for which we observed normal outer segments structures in all conditions tested — therefore we conclude that ciliogenesis is unaffected. Previous literature has also showed that defects in rod nuclear migration do not affect ciliation of the outer segment.

      2) The Arl3-Y90C mutant seems to act physiologically more like a dominant-negative than an activated mutant. A second mutation in Y90C (R149H) that blocks binding to the GEF Arl13 abrogates the nuclear migration defect, suggesting that Y90C is preventing activation of endogenous Arl3 by sequestering the GEF. Yet overexpression of effectors or cargos still rescues nuclear migration in the presence of Y90C, suggesting that it also sequesters effectors. How do the authors explain this?

      We agree with this interpretation. We have now included language about Arl3-Y90C’s role as a dominant negative in that it blocks Arl13B activity. The interesting caveat to this black and white usage is that blocking Arl13B would suggest a reduction in endogenous Arl3 activity in rods (which we find to be true, see Fig 5A). However, the migration defect phenotype mimics overly active Arl3 (Arl3-Q71L) and not a loss of function in Arl3 (Arl3-T31N). Using in vivo crosslinking experiments, we show that the fast cycling nature of Arl3-Y90C also causes GEF-independent activation of Arl3 (Fig 4D-E) that leads to the migration defect. Our rescue data shows that only the combination of both effects – reduced Arl3 activity in the cilium and GEF-independent Arl3 activation outside the cilium - is enough to disrupt the ciliary gradient and produce the migration defect.

      3) Fig. 1 suggests that an Arl3-T31N mutant has no phenotype. This is a canonical mutation in small GTPases that typically renders them dominant negative. The lack of phenotype is surprising since most dominant-negative mutants act by sequestering their GEFs, thereby preventing activation of the endogenous GTPase. Fig. 2C suggests that this may not be the case for Arl3-T31N, which binds Arl13 only weakly. Some of this confusion may arise from the fact that Arl13 is not a typical GEF. It is very unusual for one GTPase to directly promote nucleotide exchange on another. Does Arl3-T31N affect ciliation in the rod outer segment, or in other ciliated cells? Some discussion of this point is warranted here.

      Our paper finds that Arl3 mutants must produce an aberrant activity outside the cilium, whether through constitutive activity (seen for D67V and Q71L) or fast cycling (seen for Y90C and D129N) to cause the migration defect. Since T31N does not cause excess Arl3 activity in cells (see Fig 4) even if it does have some dominant negative activity toward Arl13B, then it is still not enough to cause the migration phenotype. This was directly tested in Fig 5, where we increase T31N binding to Arl13B by introducing Y90C/T31N and still do not see migration defect. Our results are also in line with a previous study showing that despite rapid photoreceptor degeneration in a retina-specific conditional Arl3 knockout mouse the outer segments were initially formed, in contrast the retina-specific conditional Arl13B knockout mouse did disrupt photoreceptor ciliogenesis leading to a more rapid degeneration (Hanke-Gogokhia, JBC 2017). Since complete loss of Arl3 activity did not disrupt ciliogenesis, it is unlikely that expression of Arl3-T31N in wild type retinas would alter outer segment formation, and we observed that outer segments formed in all Arl3 mutants.

      4) Oddly, Arl3-Y90C does robustly bind Arl13 (Fig. 2C), while at the same time binding to effectors (Fig. 3D/E), although less strongly than the canonical Q71L constitutively active mutant (Fig. 2A). As noted in point #2, the Y90C/R149H double mutant, which fails to bind Arl13, abrogates the nuclear migration defect observed with Y90C alone. Although the authors refer to Y90C as "rapid cycling" its phenotype is more similar to a dominant-negative than an activated mutant.

      We agree with this interpretation. We have now included language about Arl3-Y90C’s role as a dominant negative in that it blocks Arl13B activity. However, the rapid cycling behavior is important to cause the phenotype.

      5) The authors also mention that loss of Arl3 has no phenotype in their assay, however, Arl3 knockout mice exhibit severe retinal degeneration. How do they explain this?

      Our study finds that not all human Arl3 mutations will target the same cellular process even though they all result in degeneration. Arl3 knockout mice show drastic alterations in lipidated protein trafficking to the rod outer segment in mature retinas, a phenotype that we did not observe by expressing the dominant Arl3 mutants in wild type rods. Since our tools are not designed to study degeneration of rods, the precise mechanisms of degeneration caused by loss of function or dominant mutations remains to be determined. We outline some ideas in the discussion, but more work needs to be done before making any big statements regarding this. We hope that our manuscript will inspire clinicians to take a closer look at human patients to determine if there are subtle differences between disease presentation for dominant and recessive forms Arl3 inherited mutations. This is beyond the scope of our expertise.

      Reviewer #3 (Public Review):

      This work provides mechanistic insights into two recently described dominant variants of Arl3, a small GTPase, namely mutations D67V and Y90C. The authors identified a phenotype of these dominant variants during the development of rod photoreceptors by in vivo experiments in mice. They specifically observed a defect in rod nuclear migration to their final outer nuclear layer. This phenotype has been previously observed in another constitutively active variant of Arl3, Q71L. The authors performed a series of extensive and thorough biochemical assays to clarify the mode of action of these variants, mostly the Y90C variant, comparing the behavior of these variants to previously described mutants and combining multiple variants by mutagenesis. They also developed a new in vivo crosslinking strategy to be able to identify transient states of protein-protein interactions. They finally performed phenotypic rescue experiments by co-expression of various relevant proteins interacting/involved with Arl3. They finally propose a model based on differential subcellular compartmentalization of Arl3 activation which when disrupted leads to rod nuclei misplacement. These data add to the current understanding of contribution of different Arl3 variants causing human retinal degeneration, which has strong potential translational implications.

      Strengths:

      Relevance of Arl3 dominant variants to human retinal degeneration. Identification of Y90C variant as a "fast cycling" GTPase, and not as a predicted destabilizer of the protein structure.

      New method of crosslinking to enable snapshots of endogenous protein-protein interactions.

      Weaknesses:

      • The relevance of this study is justified by the fact that newly identified dominant variants of Arl3 have been associated to retinal degeneration. However, the authors never assess a degeneration phenotype.

      Electroporation technique allows for rapid expression of constructs, but the sparse expression makes it a poor means to study retinal degeneration. This is important to examine in the future using robust genetic mouse models.

      • The authors show new dominant variants of Arl3, namely Y90C and D67V, cause rod nuclear mislocalization. This phenotype is interesting but this was previously observed with other constitutively active mutation of Arl3, Q71L, and therefore is not novel.

      Yes, the Q71L paper is well cited in our manuscript and set the basis for many of our experiments.

      • The main claim of this paper is that subcellular compartmentalization of Alr3 activation to the cilium (the so called gradient by the authors) is required for proper rod nuclear migration to their final outer nuclear layer destination. The authors provide multiple experiments to support this model, but this is never directly demonstrated.

      We are not aware of any methods that could be done to directly show the subcellular localization of active Arl3-GTP within rod photoreceptors. We agree that we have provided many experiments that support our hypothesis that altering the Arl3-GTP gradient between cilium and cell body produces a nuclear migration defect. Some of our favorites include Fig 6, where we find that the migration phenotype is only rescued with expression of ciliary cargos and not rescued by non-ciliary cargos. Also, the new data requested by reviewers showing Arl13B expression in the cilium can restore the Y90C defect further supports that the Arl3 ciliary gradient is necessary for proper nuclear migration.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Manuscript number: RC-2022-01707

      Corresponding author(s): Sarah Butcher, Richard Lundmark

      1. General Statements [optional]

      We thank the reviewers for their insightful comments. The inclusion of the points raised by the referees have strengthened the manuscript. However, some of the reviewer suggestions are beyond the scope of the work (see below), but will doubtlessly be touched upon in future studies by the authors. In addition to incorporating changes relevant to answering the reviewers’ comments, we have edited the manuscript for increased clarity and precision.

      2. Description of the planned revisions

      1. Liposome flotation assay Reviewer #1 suggested that we should perform a liposome floatation assay to separate possible C protein aggregation from membrane binding: "I would strongly recommend supplementing the current liposome sedimentation assay by liposome flotation assay. In contrast to liposome co-sedimentation, the flotation assay can discriminate protein aggregates from proteins bound to liposomes. Although the SDS PAGE shown in Fig. 1A looks pretty convincing, a faint protein band in the „P" lane of the middle panel for the (-) sample is evident. Therefore, C protein aggregation cannot be ruled out and it would be indistinguishable from liposome binding examined by mere co-sedimentation assay”

      Response: We agree that this is a necessary control experiment to add, and we will perform it with liposomes containing 40 % POPS. As we detected complete C protein co-sedimentation with this lipid composition, performing the flotation experiment with the same composition will prove that the earlier result indicates lipid binding and not protein aggregation.

      3. Description of the revisions that have already been incorporated in the transferred manuscript

      1. Reviewer #1
      2. In addition, it needs to be clarified which TBEV C protein construct, whether full-length or truncated, was used for co-sedimentation fragmentation.

      Response: We have clarified in this section of the manuscript that the full-length C protein construct was used for the liposome co-sedimentation assays by adding “full-length” prior to instances of “C protein” e.g. in the paragraph starting line 118.

      1. How to understand the finding that „the C protein forms a very rigid layer when adsorbed to the membrane". Can the aggregation of C-protein be ruled-out? Following the 1M NaCl wash of C-protein-bound to SLB, the authors stated: „This shows that initial membrane recruitment of C protein is strongly dependent on its interactions with the negatively-charged lipid headgroups. However, once bound, the C protein-membrane interaction is complemented with non-electrostatic interactions such as membrane insertion or protein oligomerization": does it mean that there are several layers of C protein, the first held by electrostatic interactions, overlayed by non-electrostatically bound C protein? If yes, the illustration of single-layered C-protein adsorbed onto SLB in Fig. 2A, B is not correct.

      Response: We understand the confusion regarding the term “rigid” which was used as a way to describe how we interpret the relatively minor change in the dissipation upon membrane binding. What we intended to describe was that this indicates that the protein is attached in a stable way that does not add viscoelastic properties to the system. These data indicate that the protein does not form large aggregates that non-specifically attach to the membrane in different protrusive orientations. We have clarified this in the manuscript and specified that the as there is no dissipation change, there is no aggregation. We added the following to line 168 “This, in turn, indicates that the C protein does not bind as non-specific aggregates as these would have changed the viscoelastic properties of the system.”

      We do not mean that there are several layers of C protein. We consider, due to the highly charged nature of C, that the most likely explanation is that there are multiple modes of C binding but the result is only one layer, with multiple C-proteins interacting with each other within that layer. We have modified the text at line 184 to: “However, once bound, the C protein-membrane interaction is complemented with non-electrostatic interactions such as membrane insertion or protein oligomerization within the bound layer.”

      1. The sentence: “To confirm that the C protein is biologically active, we investigated its ability to bind RNA" seems to be a little odd because it suggests the model membrane binding assays do not require biological active proteins. However, considering that the interactions leading to binding either negatively-charged lipid or negatively-charged RNA are electrostatic - this sentence must be rewritten.” Response: We thank the reviewer and have now rephrased this sentence to the following at line 249 “Since RNA binding is crucial for the NC assembly, we investigated the C protein’s ability to perform this function.”

      2. “The authors´ statement in the Abstract: „....we investigate nucleocapsid assembly..." is too speculative because the assembly was not studied in their work. It needs to be reformulated.” Response: We agree, and the statement has been removed from the abstract.

      3. Despite this clear and valuable methodological contribution, the authors' contribution to our knowledge of the coordination of the nucleocapsid components to the sites of assembly and budding is not so obvious. Contrary to the earlier idea that the flavivirus is asymmetrically charged (that is, hydrophobic on one side (α2) and positively charged on the other side (α4), recent studies show that the entire surface of the protein is highly electropositive (Mebus-Antunnes et al., 2022). Therefore, a well-ordered neutralization of the flaviviral C proteins' highly positive surface seems critical for the proper organization and assembly of nucleocapsid. I am afraid that the authors do not shed much light on this issue.” Response: The recent structure of the TBEV C protein, published after we submitted the manuscript, shows that indeed the C protein is highly positively charged on all surfaces (updated Supplementary Figure 1 and Selinger et al., 2022). The recruitment of C protein to the membrane, that we demonstrate is dependent on negatively-charged head groups, provides a biologically relevant mechanism for charge neutralization on the C protein surface that interacts with the lipids. The remaining surface charge can be then neutralized by RNA recruitment. Mebus-Antunnes et al. made their observations with just RNA and C protein from Dengue virus in the context of artificial surfaces e.g. mica. However, our experiments utilize the TBEV C protein and specifically include a membrane, the third critical component of NC assembly. Thus, we build upon the work of Mebus-Antunnes et al. by adding a second biologically relevant charge-neutralising component and comparing with a distantly-related virus. We have changed the discussion section of the manuscript to reflect this new structure and to emphasize the advance here. Starting from line 371 we changed the text to: “Recently, it has been shown that the neutralization of the C protein surface positive charge is important for RNA binding in the distantly-related Dengue virus (DENV) (Mebus-Antunes et al, 2022). The recruitment of C protein to the membrane, that we demonstrate is dependent on negatively-charged head groups, provides a biologically relevant mechanism for charge neutralization on the C protein surface that interacts with the lipids. The remaining surface charge can be then neutralized by RNA recruitment.”

      Reviewer #2 1. “What results demonstrate C protein inserts into membrane? The current results support the C protein interacts with membranes with positive charge, but do not seem to demonstrate membrane insertion. If the C protein inserts into the membrane, which regions (helices) play this role?”

      Response: The Langmuir-Blodgett trough tensiometry experiments with monolayers directly measure the insertion of a protein into the monolayer. By determining the maximum insertion pressure of the C protein constructs, we also show that the membrane insertion can occur in bilayers. We show that the N-terminus is not inserting into the membrane, further work, beyond the scope of this manuscript, is needed to pinpoint the residues responsible for insertion, for instance by hydrogen-deuterium exchange or FRET measurements that would not affect folding. To clarify the use of the LB trough, we added the following at line 216: “To investigate if the C protein membrane binding includes insertion into the membrane after the initial electrostatic binding, we used Langmuir-Blodgett trough monolayer experiments. In this approach, the insertion of a protein into a lipid monolayer can be detected by following the pressure (π) of the monolayer after protein injection into the aqueous subphase, with increases in π corresponding to protein injection (Brockman, 1999; Liu et al, 2022).“

      1. The authors should discuss several previous papers reporting the effect of partial deletions of the C gene on the replication of TBEV, West Nile virus, and other flaviviruses.” Response: We agree that this is a necessary addition, and have now added a paragraph in the discussion section starting at line 333: “N-terminally truncated flaviviral C proteins have been shown to be assembly competent and in vitro, able to bind RNA, which is consistent with our results with N-terminally truncated TBEV C protein (Khromykh & Westaway, 1996; Kofler et al, 2002; Patkar et al, 2007; Schlick et al, 2009). One role of C is in the modulation of host responses to infection and the N-terminus maybe involved in that (Yang et al, 2002; Limjindaporn et al, 2007; Colpitts et al, 2011; Bhuvanakantham & Ng, 2013; Katoh et al, 2013; Urbanowski & Hobman, 2013; Samuel et al, 2016; Slomnicki et al, 2017; Fontaine et al, 2018). The membrane insertion directly detected in our experiments is central to C protein function. Other studies have found that deletions in the hydrophobic region of the α2 helix significantly impair particle assembly (Kofler et al, 2002; Patkar et al, 2007; Schlick et al, 2009). In the light of this evidence, we consider that the α2 helix could be responsible for membrane insertion (Markoff et al, 1997; Kofler et al, 2002; Nemésio et al, 2011, 2013).”

      Reviewer #3 1. “In Figure 4, the band (256:1) that are supposedly in the wells (red arrow) is not clear- it is only slightly darker than the other wells.”

      Response: This confusion was the result of unclear wording. We have now revised the figure legend at line 278 to : “The black arrow indicates the bands of freely-migrating RNA, and the red arrow the wells. On lanes 624:1 and 256:1, RNA has been immobilized in the wells.”

      1. Figure S1A, the N-terminal end (which is truncated in the mutant) should be colored on the cyan molecule.” Response: We have coloured the truncated part of the cyan molecule in the figure (now S1B) according to the reviewer’s comment.

      Other 1. As the nuclear magnetic resonance structure of the truncated TBEV C protein has recently been released (Selinger et al, 2022), we have updated the manuscript and Figure S1 to include the information from this structure. We have also generated a new homology model of the full-length TBEV C protein using this structure as a template and included that in Figure S1.

      4. Description of analyses that authors prefer not to carry out

      1. Reviewer #1
      2. However, we do not know whether in the infected cells, the C protein is pre-bound to ER membrane or to viral RNA. Having such a unique assay in their hands, I wonder whether the authors could use the pre-bound C protein with genomic RNA (i.e. the experiment shown in Fig. 4A) ribonucleoprotein complex in the SLB binding assay. If doable, this experiment would be exciting and could bring some important information about NC assembly.”

      Response: We agree that it would be very interesting to decipher if the C-protein first binds to RNA or to membranes using the QCM-D methodology. Yet, our data on pre-incubated C-protein and RNA suggests that large aggregates are formed which would hamper the interpretation of the QCM-D data. Furthermore, based on the suggested experiment, we will not be able to firmly conclude whether or not the C-protein first binds to RNA or to membranes since the time of the experiment will allow rearrangement of preformed complexes between C-protein and RNA. Additionally, the QCM-D measurement cannot differentiate if the preformed complexes bind on their own, or if excess unbound C protein binds the membrane and then recruits the complex. Therefore, addressing this question would require major adjustments to the RNA model system and methodology that we feel are beyond the scope of this study.

      Reviewer 2 1. “The authors should use the lipids detected in the virions to confirm C protein binding experiments.”

      Response: In the mass spectrometry characterization of the TBEV virions, we detected lipids from 9 classes (Car, PE, PS, PI, PG, PC, Cer, HexCer & TG). We have tested four of them (PE, PS, PI, PC) in the liposome sedimentation assay. Additionally, we tested GalCer, which, like HexCer, are cerebrosides. Our liposome binding experiments clearly demonstrate that the C protein does not bind to a specific lipid class, but instead to lipids with negatively-charged headgroups. Therefore, we would argue that doing additional sedimentation experiments with Car, PG, Cer, and TG would not add extra insight to the manuscript.

      Additionally, while the population of lipid species in the TBEV envelope is diverse, the diversity mostly comes from differences in the lipid tails, which do not generally affect the head group-mediated binding of proteins. Therefore, performing additional lipid binding experiments with varying tail lengths would not likely lead to new observations.

      Finally, to perform the authentic experiment of testing C protein binding to liposomes formed from lipids extracted from purified virions would require orders of magnitude more virus sample than our research laboratory is capable of producing. Therefore, we argue that this experiment is beyond the scope of this study.

      1. The study may be strengthened by performing virus mutagenesis experiments.” Response: While we agree that, ultimately, experiments on virus and cells would help to understand the role of the C protein in the biological context, we think these experiments are beyond the scope of this study. For virus mutagenesis, candidate residues should be first identified with biochemical and biophysical studies, which is already beyond the scope of this work. Additionally, the C protein has multiple functions in the host cell in addition to NC assembly, and interpreting the effect on the mutations on e.g. virus titer is difficult.

      Reviewer #3 1. “In all figure legends, authors should write a conclusion line after the description of the experiments - what conclusion is drawn from each experiment.”

      Response: While we agree that adding such a conclusion line would make it easier for the reader to understand each figure, the format of the figure legends is highly subject to journal policy. Therefore, we think that the addition of such lines will be an editorial decision and will depend on the journal. We have, however strived to make the figure titles as informative as possible in lieu of such concluding lines.

  6. Nov 2022
    1. Author Response

      Reviewer #1 (Public Review):

      In this manuscript, the author characterizes the lattice of kinesin-decorated microtubule reconstituted from porcine tubulins in vitro and Xenopus egg extract using cryo-electron tomography and subtomogram averaging. Using the SSTA, they looked at the transition in the lattice of individual microtubules. The authors found that the lattice is not always uniform but contains transitions of different types of lattices. The finding is quite interesting and probably will lead to more investigation of the microtubule lattice inside the cells later on for this kind of lattice transition.

      The manuscript is easy to read and well-organized. The supporting data is very well prepared.

      Overall, it seems the conclusion of the author is justified. However, the manuscript appears to show a lack of data. Only 4 tomograms are done for the porcine microtubules. Increasing the data number would make the manuscript statistically convincing.

      One tomogram can contain one to several tens of microtubules. For example, 64 microtubules were analyzed in the Xenopus-DMSO dataset obtained on 5 tomograms, versus 24 microtubules for the GTP-dataset obtained on 4 tomograms (see Table 1). Hence, taking the number of tomograms to assess the statistical relevance of our work cannot be considered as a valid criterion. Tomograms are taken randomly on the EM-grid sample, solely based on ice quality and the covering of microtubules in the holes as determined at low magnification before tomographic acquisition. No prior knowledge of the structure and lattice-type organization of the microtubules can be obtained before acquisition. It appears to us that a more pertinent criterion is the number of events that we characterized, specifically lattice-type transitions along individual microtubules. In the dataset mentioned by the referee (see Figure 2-figure supplement 3-4 and Table I), 24 microtubules were analyzed and further divided into 195 segments, providing an equivalent number of individual 3D reconstructions. For each 3D reconstruction, almost all lateral interactions could be characterized in terms of lattice-type, i.e., 2091 of the B-type, 460 of the A-type, and 112 not determined (essentially at transition regions). Most importantly, we document in this specific dataset 119 transitions in lattice-type, which we think is sufficient to characterize such molecular events and provide solid statistics for this dataset. Adding the GMPCPP and Xenopus data, we end-up with 938 individual 3D reconstructions (not including the full-length microtubule volumes), 12 463 lateral interactions analyzed (A-, B-, or ND-type), and the observation of 172 lattice-type transitions. Therefore, we respectfully disagree with the referee stating that our work lacks data.

      To highlight the quantity of data used in our work, we have modified the following sentences: L124-131: ' Analysis of 24 microtubules taken on 4 tomograms, representing 195 segments of ~160 nm length (i.e., 2664 lateral interactions), allowed us to characterize 119 lattice type transitions with an average frequency of 3.69 µm-1 (Table 1), but with a high heterogeneity' L160-164: ' Analysis of 31 GMPCPP-microtubules taken on 6 tomograms, representing 338 segments of ~150 nm in length (i.e., 3236 lateral interactions), and using the same strategy as in the presence of GTP (Figure 5—figure supplement 1-2) revealed a transition frequency of 1.25 µm-1 (Table 1), i.e., ~3 fold lower than microtubules assembled in the presence of GTP.' L200-203: ' A total of 64 microtubules taken on 5 tomograms were analyzed in the Xenopus-DMSO dataset (i.e., 419 segments from which we characterized 5446 lateral interactions), and 15 microtubules taken on one tomogram for the Xenopus Ran-dataset (i.e., 86 segments from which we characterized 1118 lateral interactions), (Table 1).'

      In addition, having the same transition with the missing wedge orientation randomly from different subtomograms will allow a better average of transition without the missing wedge artifact.

      In this work, we did not aim at averaging transitions. Transitions in lattice-types are highly heterogeneous in nature, and we wonder what additional information an averaging strategy would have provided. Conversely, each transition is a unique event that we characterized to obtain useful statistics, and the missing data at high angle inherent to electron tomography were not an obstacle to fulfill this task.

      Another thing that I found lacking is the mapping of the transition region/alignment in the raw data.

      In Figure 4, we clearly show the correspondence between the segmented sub-tomogram averages (SSTA) and the raw filtered images at the transition region. This is also the case in Figure 5 where the SSTA (Figure 5A) are compared with the raw tomogram (Figure 5B), and where we clearly visualize the holes that result from the transitions in lattice types.

      However, it is not easy for me or the reader to understand how each segment is oriented relative to each other apart from the simplified seam diagrams in the figures, and also the orientation of the seam corresponding to the missing wedge in the average. With these improvements, I think the conclusion of the manuscript will be better justified.

      The segmentation process is explained in Figure 2-figure supplement 2 and in the Materials and Methods section, which shows that each segment is linearly related to the next. Small rotations can happen between individual segments, and it is important to check that the same protofilaments are followed during the initial modeling (see the online tutorial referenced in the manuscript for full-length microtubules). The segment models are derived from that of the full-length microtubule, as explained in the Materials and Methods section, using a new routine (splitIntoNsegments) implemented into the PEET program. In addition, a detailed protocol describing our SSTA strategy will be submitted following publication of our manuscript.

      Reviewer #2 (Public Review):

      Differences in protofilament and subunit helical-start numbers for in vitro polymerized and cellular microtubules have previously been well characterized. In this work, Guyomar et al. analyze the fine organization of tubulin dimers within the microtubule lattice using cryo-electron tomography and subtomogram averaging. Microtubules were assembled in vitro or within Xenopus egg cytoplasmic extracts and plunge frozen after addition of a kinesin motor domain to mark the position of tubulin dimers. By generating subtomogram averages of consecutive sections of each microtubule and manually annotating their lattice geometry, the authors quantified changes in lattice arrangement in individual microtubules. They found in vitro polymerized microtubules often contained multiple seams and lattice-type changes. In contrast, microtubules polymerized in the cytoplasmic extract more frequently contained a single seam and fewer lattice-type transitions.

      Overall, their segmented subtomogram averaging approach is appropriately used to identify regions of lattice-type transition and quantify their abundance. This study provides new data on how often small holes in the lattice occur and suggests that regulators of microtubule growth in cells also control lateral tubulin interactions. However, not all of the claims are well supported by their data and the presentation of their main conclusions could be improved.

      1 - We have corrected approximative claims and conclusions where necessary. In particular, we now discuss separately the Xenopus-DMSO and the Xenopus-Ran egg extract samples, and have modified our conclusions accordingly. We also deposited onto the EMPIAR all tomograms and PEET models to reproduce the 938 segmented sub-tomogram averages analyzed in this study (see new Supplementary file 2).

      Reviewer #3 (Public Review):

      Protofilament number changes have been observed in in vitro assembled microtubules. This study by Guyomar and colleagues uses cryo-ET and subtomogram averaging to investigate the structural plasticity of microtubules assembled in vitro from purified porcine brain tubulin at high concentrations and from Xenopus egg extracts in which polymerization was initiated either by addition of DMSO or by adding a constitutively active Ran. They show that the microtubule lattice is plastic with frequent protofilament changes and contains multiple seams. A model is proposed for microtubule polymerization whereby these lattice discontinuities/defects are introduced due to the addition of tubulin dimers through lateral contacts between alpha and beta tubulin, thus creating gaps in the lattice and shifting the seam. The study clearly shows quantitatively the lattice changes in two separate conditions of assembling microtubules. The high frequency of defects they observe under their microtubule assembly conditions is much higher than what has been observed in vivo in intact cells. Their observations are clear and supported by the data, but it is not at all clear how generalizable they are and whether the defect frequencies they see are not a result of the assembly conditions, dilutions used and presence of kinesin with which the lattice is decorated. The study definitely has implications for mechanistic studies of microtubules in vitro and raises the question of how these defects vary for protocols from different labs and between different tubulin preparations.

      1 - High tubulin concentration: It has been documented by many laboratories since the discovery of tubulin and the characterization of its assembly properties that a sufficient concentration of free tubulin is necessary to self-assemble microtubules. This is called the critical concentration for self-assembly (the CC, i.e., the critical concentration to overcome the nucleation barrier), and has been reported to be in the range 14~25 µM in the presence of GTP depending on laboratories. For example, in the seminal work of Mitchison and Kirschner the CC was estimated at 14 µM (Fig. 5 of ref. (Mitchison & Kirschner, 1984b)) and self-assembly was induced at concentrations in the range 32-59 µM (Mitchison & Kirschner, 1984a). Our own estimate of the CC for porcine brain tubulin was 21 µM (Fig 2C of (Weis et al., 2010)), and we routinely use a tubulin concentration slightly above the CC when we aim at robust microtubule self-assembly. Hence, we argue that 40 µM, which is ~twice the CC, cannot be considered as a "very high" tubulin concentration to induce microtubule self-assembly.

      2 - Protofilament number and lattice-type transitions in cells: While microtubules with protofilament numbers different than 13 have been observed in different cell types and species (reviewed in (Chaaban & Brouhard, 2017)), we are aware of only one recent study where changes in protofilament numbers along individual microtubules have been reported in cells (Foster et al., 2021), but with no statistics concerning their frequencies. Hence, we cannot compare changes in protofilament number frequencies in Xenopus egg extracts with those that occur in intact cells. Concerning lattice-type transitions, we are not aware of any previous study that documented such features, whether in vitro or in cells.

      3 - Generalization of our results, source of tubulin and protocols: Multi-seams in microtubules assembled in vitro have been reported by several groups in the past (see our Introduction, L49-62), starting from (Kikkawa et al., 1994), the Milligan group (Dias & Milligan, 1999; Sosa et al., 1997), and more recently by the Sindelar group (Debs et al., 2020). In Kikkawa et al. (1994), the authors purified tubulin from porcine brain by three cycles of assembly/disassembly followed by phosphocellulose chromatography. Assembly was carried out at 24 µM in the presence of Taxol. In Sosa and Milligan (1996-1997), the authors used a commercial source (Cytoskeleton) and assembled the microtubules at 30 µM in the presence of Taxol. In Debs et al. (2020), the authors used tubulin purified from porcine brain according to (Castoldi & Popov, 2003), as we did, to assemble GMPCPP microtubules, and bovine brain tubulin (Cytoskeleton) to assemble Taxol-stabilized microtubules. Noticeably, they used an initial tubulin concentration of 100 µM to initiate microtubule polymerization and then added Taxol to continue the reaction.

      We add to these previous studies that microtubules with different numbers of seams are not unique ones, but that both the number and location of seams can vary within individual microtubules. The reason why this was not observed before is that the analytical tools used in those previous studies were not suited to reveal this structural heterogeneity within individual microtubules. By contrast, the SSTA approach that we designed was specifically developed towards this aim. Even in the recent work by Debs et al. (2020) that provides the most comprehensive characterization of multi-seams in microtubules assembled in vitro and that obtained a seam distribution very similar to ours (compare their Figure 3C with our new Figure 10C for GDP microtubules, dark blue bars), their protofilament-based approach could not reveal changes in the number and location of seams within individual microtubules. Yet, they probably could have done it if they had asked whether segments with different seam numbers had been extracted from the same microtubules.

      Here, we designed a specific approach to tackle the structural heterogeneity of individual lattices that permitted this discovery. Not only do we confirm results obtained by others, but we also propose a molecular mechanism that explains how multi-seams form in microtubules assembled in vitro and how they change in location in a cytoplasmic environment. By doing so, we propose a novel molecular event - formation of unique lateral interactions without longitudinal ones - that was not envisioned before, and which to our opinion, must be incorporated in further modelling studies concerning microtubule nucleation and assembly, including the mechanism of dynamic instability (see the Ideas and speculation section).

      4 - Dilution: A 50X dilution was used only for Xenopus egg cytoplasmic extracts to decrease their density on the EM grid just before freezing. These conditions were settled by cryo-fluorescence microscopy to ensure that we had the adequate density of microtubules onto the EM-grid (Figure 7 and Figure 2—figure supplement 1D). Of note, the microtubules analyzed by SSTA were assembled in extracts that were not supplemented with fluorescent tubulin. While we could imagine that dilution may induce the removal of dimers from the microtubule lattice, we cannot foresee how this could change the register between tubulin subunits within the microtubule lattice.

      5 - Kinesin decoration: Like many other laboratories (see the Table in Figure 3 of (Manka & Moores, 2018)), we use the non-processive motor domain of kinesin 1 to decorate microtubules, with the aim to differentiate the - and -tubulin monomers within the microtubule lattice. In particular, it has been shown that lattice parameters such as the protofilament skew and lattice spacing are unmodified when kinesin motor domains are added to GMPCPP- or GDP-microtubules (Zhang et al., 2015, 2018). In addition, we cannot envisage how this non processive motor added to preformed microtubules could change the registry of the -tubulin heterodimers within the microtubule lattice.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Overall comments

      We are pleased by the reviewers’ comments and appreciate their suggestions for improvements. In addition to correcting small typos throughout the manuscript, the major changes we did in response to their comments are as follows:

      • Changed the title of our paper to reflect the strong evolutionary correlation more accurately between sex chromosomal meiotic drive and gains/losses of SNBP genes in
      • New experiments to test the role of the well-conserved, universally retained SNBP, CG30056, in male fertility in * melanogaster*. Although reviewers had suggested we could eliminate this section, we felt that this would add a lot of weight to the unexpectedly inverse relationship between age/retention and fertility functions of SNBP genes. Thus, over the past few months, we have carried out new experiments with increased sample sizes, better controls, and sperm exhaustion. These new results strengthen our earlier analyses.
      • Better clarification of the X-Y chromosome fusion, which is a new observation, in the montium group via careful rewriting as partly suggested by Reviewer #2.
      • Highlighting that the genetic conflicts hypothesis does not rule out a role for sperm competition or other conflicts in shaping SNBP evolution in a revised Discussion. All changes in response to the reviewer’s comments have been detailed in our point-by-point response (below). You will see that we have addressed almost all the suggestions made, including with new experiments. The only reviewer suggestions (all optional from Reviewer 3), which we did not directly address in our revision are:

      • __Branch specific protamine evolution analyses for sex chromosome amplified SNBP genes: __given the state of SNBP gene annotation and the difficulties of assembling these genes in large tandem arrays, this will require considerable work and is beyond the scope of the paper.

      • Covariation between SNBP evolution and sperm morphology: We cannot perform these experiments as there is a paucity of sperm morphology data currently. Obtaining this data reliably is a significant undertaking.
      • Are SNBP genes more prone to be lost than average in the montium group: We have not comprehensively examined all loss events in the montium group or any other Drosophila This is also a non-trivial analysis, albeit it would be very interesting. However, we believe the more relevant comparison is whether these lost SNBP genes are more likely to be retained in non-montium species, which they are, as we now highlight. We hope you will favorably judge our good faith efforts to address all other reviewers’ comments, and their laudatory comments during the previous round of reviews.

      Reviewer #1 (Evidence, reproducibility and clarity (Required)): __

      Chang and Malik present a comprehensive evolutionary analysis of sperm nuclear basic proteins (SNBPs) in Drosophila. In addition, they provide a preliminary functional characterization of one such protein (CG30056) and describe a newly discovered X-Y chromosomal fusion in the Drosophila montium species group. All of these findings are interesting and important, but the headline from this study is the well-supported possibility that SNBPs, or at least a large fraction of them, function in suppressing X vs. Y chromosome meiotic drive. While this hypothesis is challenging to test experimentally, the authors provide strong correlational evidence that SNBPs are associated with drive by documenting these proteins' rapid evolution. This rapid evolution takes the form of sequence changes (as predicted by coevolution between drivers and suppressors of drive), gene amplification in cases when SNBPs move to sex chromosomes (consistent with the SNBP becoming a potential agent of drive for its new "home chromosome"), and gene loss in species with X-Y chromosome fusions (in which drive is not predicted to occur).

      Overall, this is an excellent, comprehensive study. The phylogenetic and genomic analyses are first-rate (and one of the first to make use of the new 101 Drosophila genomes); the logic is very well explained; conclusions are supported by multiple lines of evidence; the writing and figures are clear and accessible; and, the findings are fascinating. It's a good sign that it is easy to imagine several experiments one could do to follow up on this study, but I do not feel any are required in revision, as the manuscript is comprehensive as is. Thus, I have just a few minor points the authors may wish to consider in making revisions and a few suggestions for clarity/typos.

      __

      We thank the reviewer for their positive comments on our work.

      1. I would be interested in whether the authors think that all SNBPs in a given Drosophila species function(ed) in meiotic drive, or whether some fraction may play other roles, such as sexual selection or chromatin compaction, which have been the traditional hypotheses for SNBP function. Relatedly, given the high turnover of SNBPs the authors observe and the fact that some melanogaster-essential SNBPs are younger genes, would they like to comment on whether the subsets of SNBPs involved in drive/suppression vs. chromatin packaging/sperm traits/Wolbachia defense are likely to differ from across fly species? The reviewer raises an excellent point. In our revised discussion, we now speculate that different SNBPs might have distinct functions. For example, the same subset of SNBPs is subject to gene amplification and loss whereas other SNBPs are subject to less turnover. Moreover, even this stable set of SNBPs evolves rapidly, including in the montium group of species that have undergone dramatic SNBP loss. As the reviewer suggests, sperm competition or pressures from Wolbachia toxins might be is a driving force for sperm evolution. We discuss these possibilities and conclude in our discussion: “Our findings do not rule out the possibility that forces other than meiotic drive are also important for driving the rapid evolution and turnover of SNBP genes in Drosophila species.

      What do the authors make of the lower isoelectric points for a few of the SNBPs (e.g., CG31010 with pI = 4.77 in Table 1)? These proteins have identifiable HMG box domains, so is the pI driven lower by other parts of the protein sequence?

      We thank the reviewer for raising this point. We found that the pI of HMG domains can range from 6 to 12. Thus, the pI is driven by both HMG domains and other parts of proteins. We now include the pI of the whole SNBP protein and the HMG domain alone in Table 1. We do not have enough biochemical information to speculate on how these differences could alter SNBP function.

      __3. For readers less familiar with the field, it may help to spell out (e.g., on p. 6) why the authors consider ProtA/B to be important for fertility. Some of the previous papers on these genes describe them as dispensable - though the present authors are correct that these previous studies do detect fertility defects of various magnitudes under some conditions.

      __

      We agree with the reviewer. Previous studies are in disagreement about the importance of ProtA/ProtB for male fertility- while no significant effects were seen under standard fertility assays, sperm exhaustion conditions (mating with excess females) did reveal fertility effects. We have now added these references and discussed ProtA/ProtB more fully in our revision.

      On p. 9, paragraph 2, the data showing that "six different SNBP genes underwent 11 independent degeneration events in the montium group" are shown in Fig. 6A, not 5A.

      Thank you. This has been fixed in our revision.

      5. The summary Table 2 is useful, but I wonder whether including relative levels of expression and dN/dS in addition to ordinal rankings might help clarify. For instance, if there were a drop off in mean expression level between the 5th and 6th most highly expressed SNBP, this wouldn't be evident from the table.

      We agree with this suggestion and have now added this information.

      In Fig. 3, I like the use of the clean CG31010 figure in panel A to illustrate the circular representations. In addition, though, it might be useful to show Prot's graph at this same, larger size, since it's the most complicated and will likely be most closely examined.

      We agree with this suggestion and have now amended this figure in line with the reviewer’s suggestion.

      In Fig. 4, the end of the legend says that the species tree is shown "on the right," but it's on the left in the figure.

      Thank you. This has been fixed in our revision.

      __CROSS-CONSULTATION COMMENTS • I agree with both Reviewers 2 and 3 that the title could be changed to be a bit more tentative. I'd had this thought as well.

      __

      We agree with this suggestion. We have now amended this title to “Expansion and loss of sperm nuclear basic protein genes in Drosophila correspond with genetic conflicts between sex chromosomes.”

      • I agree with Reviewer 2 that the fertility assay could be conducted with a larger sample size and a better control in order to be better compared with how the authors described other published fertility phenotypes for SNBPs. For the control, crossing the deletion line to y w (or w1118) and using the resulting heterozygotes (KO/+) would be better than using the mutation over the balancer chromosome (KO/CyO). We agree with both suggestions. We now compare fertility between KO/KO and KO/+ males in sperm exhaustion assays. Our more stringent fertility assays find no evidence of CG30056 role in male fertility, strengthening our previous findings. We have now added the motivation for the new assays and the new results to our Revision.

      • I agree with Reviewer 3's third bullet point about spending a bit more time on the different possible roles that SNBPs could play in spermatids. (This is a more eloquent version of my review point #1.)

      We have now expanded our discussion of other possibilities in our revision.

      • I agree in principle with Reviewer 3's first bullet point about examining whether SNBP evolution correlates with changes in sperm morphology, but this feels like it could be a whole, fascinating study on its own, while this manuscript is already packed with data. I'd welcome the authors' thoughts about this in discussion, but wouldn't personally require a formal analysis of this to be added prior to publication.

      We also agree that this would be an interesting test. However, we are not able to do the test due to the scarcity of sperm phenotype data in Drosophila. We also think that our original version unintentionally downplayed this possibility. Our revised discussion makes clear that the rapid evolution of some Drosophila SNBP genes may be driven by sperm competition, just as in mammals, and influence the evolution of sperm morphologies.

      __Reviewer #1 (Significance (Required)):

      This study describes an important conceptual advancing in our understanding of the evolution and potential functions of sperm nuclear basic proteins (SNBPs) in Drosophila, which stands in interesting contrast to the functional roles of equivalent proteins in primates. It should be of broad interest to biologists studying spermatogenesis, meiotic drive, and genome evolution, both in and out of Drosophila. __

      We thank the reviewer for their positive appraisal.

      __ To contextualize the work, paternal DNA is typically compacted during spermatogenesis. This process involves the replacement of histones with other small, positively charged proteins in a sequential order, ending with protamines that bind DNA in mature sperm. In Drosophila, work over the last two decades (largely from the labs of R. Renkawitz-Pohl, B. Loppin and B. Wakimoto) has identified more than a dozen sperm nuclear basic proteins that localize to condensing/condensed spermatid nuclei. Two interesting observations have been that many of these proteins are dispensable for male fertility, and the proteins vary in their degree of evolutionary conservation. Recent work from Eric Lai's lab (J Vedanayagam et al. 2021, Nat Ecol Evol) showed that in D. simulans and sister species, at least one of these SNBP genes (Prot) underwent gene amplification and now acts in those species as a meiotic driver. This finding suggested the hypothesis, tested thoroughly in the present study, that the rapidly evolving SNBP gene family could be involved in causing or suppressing meiotic drive. Consistent with this idea, the authors here find that SNBP genes expand in copy number more frequently when they move from autosomes to sex chromosomes (consistent with the idea that they may cause or contribute to drive), and that otherwise well-conserved SNBP genes are lost in a group of species in which sex chromosome meiotic drive is not expected to occur. These findings are based on a thorough and well conducted phylogenomic and molecular evolutionary analysis of SNBPs across dozens of Drosophila species. Overall, this work generates exciting new hypotheses about the function of SNBPs and should be widely read both within and outside of the field.

      __

      We are grateful for the reviewer’s accurate summary of our work and its significance. We share the reviewer’s excitement and expect that more studies will explore the new function of SNBPs in multiple taxa soon.

      Keyword describing my field of expertise: Drosophila, molecular evolution, reproduction, genetics, genome evolution.

      __Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      The paper describes interesting patterns on the evolution of Drosophila SNBP genes, and proposes a very interesting explanation, namely, that meiotic drive is the main evolutionary force behind these patterns. Some of these observations have recently been made by other authors in a single case (the Dox genes in D. simulans), but not in the scale and breadth of the present ms. The ms combines an extensive investigation of available genomes with expert analysis, and new experimental data. In particular, the finding that the ancestral Y became incorporated into de X in montium species is very exciting, and may provide a smoking gun for the explanation proposed by the authors. Overall, I think it is a very good paper. I do have several criticisms and suggestions that may help to improve it.

      __

      We are grateful for the positive comments of the reviewer and for their constructive criticism and suggestions, which we have incorporated into our revision.

      __The paper has a speculative side that it almost unavoidable given its novelty and breadth. I do not see this as a problem per se, but I think the uncertain/unsupported/problematic points should be more openly presented to the readers. The main cases I noted are:

      1. The title of the ms states that "Genetic conflicts between sex chromosomes drive expansion and loss of sperm nuclear basic protein genes in Drosophila", but the evidence is somewhat circumstantial, and the patterns may be explained also by other known phenomena (e.g., demasculinization of the sex chromosomes; below). I think the tone of the end of the Introduction reflects more faithfully the strength of the evidence ("Thus, we conclude that rapid diversification of SNBP genes might be largely driven by genetic conflicts between sex chromosomes in Drosophila."). I understand the temptation of writing a bold title, but I think it is a bit misleading in the present case. I.e., it would be desirable that the title conveys the uncertainties of the data and their interpretation. __

      We agree with this suggestion. We have now amended this title to “Expansion and loss of sperm nuclear basic protein genes in Drosophila correspond with genetic conflicts between sex chromosomes.”

      However, we also want to highlight that de-masculinization of the X chromosome cannot explain the observed amplification and loss patterns of SNBP genes, except in cases of sex chromosome fusions. We now highlight the de-masculinization hypothesis for the latter case, but still strongly favor the genetic conflicts hypothesis.

      "In contrast, we found no instances of pseudogenization or subsequent translocation to the X chromosome of SNBP genes that are still preserved on their original autosomal locations or involved in chromosome fusions between autosomes (0/16). This difference is highly significant (Fig 5 and Table S11; 3:5 versus 0:16, Fisher's exact test, P=0.03). " Readers should be warned that this pattern can also be explained by the well-known demasculinization of X chromosomes (e.g., Sturgill et al. Nature 2007, 450, 238-241)

      We agree with this point and thank the reviewer for pointing this out. We now expressly raise the ‘de-masculinization of X chromosomes’ as one potential explanation of the pattern we observe here.

      "Indeed, no meiotic drive has been documented in the montium species even though it is rampant in many other Drosophila lineages [38]." Two remarks here: a) the authors should make clear that they are referring to sex-chromosome meiotic drive. b) I think the evidence is much weaker than the sentence implies. Sex-chromosome meiotic drive is known in less than 20 Drosophila species, scattered throughout the phylogeny. As far as I know all cases were discovered by accident, so the sampling is biased towards model species (e.g., the obscura group, which was very popular around 1930-1960). So we do not know the true frequency of sex-ratio meiotic drive among Drosophila species, nor, say, if it is more common in the Drosophila or Sophophora species, if it is suspiciously absent in the montium group (as suggested by the authors), etc. I think these uncertainties should be acknowledged or, perhaps, given the weakness of the argument, the sentence should be deleted or attenuated.

      We agree with this comment and have now removed this argument in our revision.

      __ "X-Y chromosome fusions eliminate the extent of meiotic drive and may lead to the degeneration of otherwise conserved SNBP genes, whose functions as drive suppressors are no longer required. Thus, unlike in mammals, sex chromosome-associated meiotic drive appears to be the primary cause of SNBP evolutionary turnover in Drosophila species." The authors found that in the montium species the ancestral Y became incorporated into de X chromosome, and that montium species seem to have an inordinate amount of SNBP gene losses. They combine these two observations by suggesting that these SNBP became dispensable or deleterious because they originally were involved in XY meiotic drive. I think many readers will think that males in montium species are X/0, whereas in fact in all of them carry a Y chromosome (just, in most cases, more gene poor than "normal" Y-chromosomes). I do not think this is a fatal flaw for the explanation proposed by the authors, but certainly is a difficulty that should be acknowledged.

      __

      We agree with this point. It was not our intention to suggest that montium group males are X/O, but this could be misinterpreted as we originally stated. We now add a clarification that montium group males still harbor a Y chromosome, which is missing most ancestrally Y-linked genes.

      __Problems/suggestions with experiments and data analysis

      1. There is a section titled "CG30056 is universally retained in Drosophila but dispensable for male fertility in D. melanogaster". In this section and in the figures, it is stated, "Although CG30056 is the most conserved SNBP we surveyed, we found no clear difference in offspring number between heterozygous controls and homozygous knockout males (Fig 2B). (...) We found either no or weak evidence of fertility impairments in two different crosses with homozygous CG30056 knockout males.". I think the fertility data are weak for the purpose of the authors, and I strongly suspect that this conclusion is wrong. Let me explain why. At other passages of the ms, the authors classify the SNPB genes in three groups. (i) essential/important for male fertility: "Three genes (Mst77F, Prtl99C and ddbt) are essential for male fertility while knockdown or knockout of two other SNBP genes (ProtA, and ProtB) leads to significant reduction in male fertility [27-30, 32]." (ii) genes that do not appear to impair male fertility at all. (iii) untested. CG30056 was in the last group, and hence the authors produced knockouts, tested their effect in male fertility, and concluded that it belongs to the second group. Now, look at Fig. 3B. The numbers of tested males are too small (it seems to range from 3 to 10), and male fertility is known to be a very noisy phenotype (as shown by the huge scatter in the authors' data). Furthermore, two different knockouts were tested, and both were nominally less fertile than the controls, and in one of them the difference is statistically significant. Taken at the face value, the knockouts seem to be perhaps ~25% less fertile than the controls. Another potentially big problem is that the "control males" actually carry visible dominant mutants (the balancers CyO or SM6) which certainly reduce their fitness, whereas the experimental males are wild-type for these mutants. Without the detrimental effect of these visible mutants in the controls, the difference to the CG30056 knockouts will probably be even larger. Note that the fertility effects of the genes ProtA, and ProtB (a.k.a. "Mst35B") , which the authors put in group "essential/important for male fertility" would not had been detected if assayed as the CG30056 gene: Tirmarche et al (2014; the reference cited by the authors) stated that: "In fact, the impact of Mst35B on male fertility was only revealed when mutant males were allowed to mate with a large excess of virgin females (1 for 10; Figure 3F) but not with a 1:1 sex ratio (not shown). " The authors' fertility test did not used this type of challenge. My general impression is that the fertility effects of CG30056 may actually be similar to ProtA and ProtB. I think the authors should do a proper fertility test of CG30056, or remove this section. Another possibly useful approach would be to classify the SNPB genes in those essential for male fertility and those that are not essential, because "experimentally speaking" this is a safer distinction (e.g., the fertility testes reported by other authors may also had been quick tests). Since these genes only function in sperm and are under purifying selection (otherwise they would have been lost; also, all have dN/dS We are very appreciative of the many important points raised by the reviewer. Rather than removing this conclusion, which is not central to our paper, we have now performed additional, well-controlled experiments to address the reviewer’s concerns, which we summarize below:

      2. We agree with the reviewer that it is easier classification to identify SNBP genes that are essential for male fertility versus those that are not.

      3. We also agree with the reviewer and now include more details about earlier studies to highlight that ProtA/ProtB fertility effects were only revealed in a sperm exhaustion setting.
      4. We agree with the reviewer’s suggestion and have now included sample sizes for all our experiments in a new supplementary Table (Supplementary file 8).
      5. We agree with the reviewer that a comparison between KO/KO and KO/Bal males is non-ideal given that Balancer chromosomes carry many deleterious mutations. We now include new experiments in our revision that compare KO/KO and KO/wt chromosomes.
      6. We agree with the reviewer that standard fertility assays may be too noisy to detect subtle fertility effects. We therefore now carry out much more stringent fertility assays under sperm exhaustion conditions with a male: female ratio of 1:10 and at least 10 males tested per genotype Despite this higher stringency, we detect no difference in fertility between KO/KO males and KO/wt controls for CG30056 (>10 males were tested for each). Thus, our original conclusion is even stronger that CG30056 has no detectable effect on male fertility. We have not tested the possibility of sperm storage or precedence being affected in our assays. However, we do believe that the finding that one of the best conserved and retained SNBP genes has no detectable effect on male fertility is an important conclusion which greatly increases the impact of our study, especially since most fertility-essential genes are either young or not universally conserved. We hope these changes will satisfy the reviewer's concerns about this section of our paper.

      "Our phylogenomic analyses also highlighted one Drosophila clade- the montium group of species (including D. kikkawai)- which suffered a precipitous loss of at least five SNBP genes that are otherwise conserved in sister and outgroup species (Fig 3). (...) Given our hypothesis that autosomal SNBP genes might be linked to the suppression of meiotic drive (above), we speculated that the loss of these genes in the montium group of Drosophila species may have coincided with reduced genetic conflicts between sex chromosomes in this clade." The montium data is an important part of the paper. I think the authors should test the statistical significance of this pattern.

      We appreciate the reviewer’s suggestion. However, we are unable to perform the statistical tests suggested for technical reasons. We note that three loss events occurred in the ancestor of D. montium species, while two happened in the ancestors of most D. montium species. Since it’s hard to estimate the evolutionary rates using these internal branches, we can’t directly compare them to other branches using statistics. However, in response to the reviewer’s comments, we now more clearly contrast the fate of SNBPs between D. montium species and other melanogaster group species, noting that three of five genes lost in the montium group are retained in all other melanogaster group species.

      __Other points:

      1. "The five remaining SNBP genes (Mst33A, CG30056, CG31010, CG34269, and CG42355) remain cytologically uncharacterized [30]." I think it will be interesting if the authors look at other potentially useful resources: Vibranovski et al papers which looked at gene expression in mitotic, meiotic and post-meiotic cells (_https://mnlab.uchicago.edu/sppress/index.php), and the papers by several labs on testis single-cell transcriptomic data (Witt et al 2021 PLOS Genetics. 17(8):e1009728 ; Nat Commun. 2021;12: 892). These may provide additional clues on the function of SNBP genes. There is also a recent report on sperm proteome (doi: _https://doi.org/10.1101/2022.02.14.480191) __

      We are grateful to the reviewer for this suggestion. We now add the data from single-cell expression analyses from Witt et al. in Table 1-figure supplement 1. We found most SNBPs are expressed at late spermatocytes and early spermatids, although CG30056 is primarily expressed in late spermatids, whereas CG34269 is expressed earlier in late spermagonia. The data from Vibrranovski et al. also show similar patterns but don’t have four of these genes, including CG34269. The data from Mahadevaraju et al. are from larva testes, and lack some critical stages during spermatogenesis. Thus, we only report the data from Witt et al.

      We also surveyed the proteome data as the reviewer suggested, but we only found 3 SNBPs (ProtA, ProtB, and Prtl99C) in the data. This did not include, Mst77F, which is the most highly expressed (see Table 2) and well-studied SNBP, so we suspect the proteomic study might be biased toward proteins from sperm tails. Therefore, we decide not to include this analysis.

      ____ "Our inability to detect homologs beyond the reported species does not appear to result from their rapid sequence evolution. Indeed, abSENSE analyses [45] support the finding that Prtl99C, Mst77F, Mst33A, Tpl94 and CG42355 were recently acquired in Sophophora within 40 MYA. For example, the probability of a true homolog being undetected for Prtl99C and Mst77F is 0.07 and 0.18 (using E-value=1), respectively (Table S1, Methods)." This should be complemented by synteny analysis.

      It may not have been clear from our original version that we did perform synteny analyses for all SNBP genes. We have now restated this more clearly in our revision.

      I found the following sentence unclear: "However, we could only ascribe a sex chromosomal linked location for species if no data was available from either BUSCO genes or females (only males and mixed-sex flies)."

      We modified the sentence to make it clearer: “However, we could not ascribe a sex-chromosomal linked location of a contig to either the X or Y chromosome in cases where there was no linkage information from BUSCO genes and no read data available from females, only from males and mixed-sex flies.”

      "Using the available assemblies with Illumina-based chromosome assignment, we surprisingly found that most ancestrally Y-linked genes are not linked to autosomes as was previously suggested [by Dupim et al 2018] (Fig 6A)."

      The new result of X-linkage is exciting, but the sentence is not exact: Dupim et al 2018 made clear that they could only separate X/A from Y-linkage. E.g., the legend of their Fig 3: "Phylogeny and gene content of the Y chromosome in the montium subgroup. "M" means amplification only in males (i.e., Y-linkage), whereas "MF" means amplification in both sexes (autosomal or X-linkage)."

      We are grateful to the reviewer for this correction. We now modified the sentence to make clear that Dupim et al had “showed that many ancestrally Y-linked genes are present in females because of possible relocation to other chromosomes in the montium group.”

      "The most parsimonious explanation for these findings is a single translocation of most of the Y chromosome to the X chromosome via a chromosome fusion in the ancestor of the montium group of species. Afterward, some of these genes relocated back to the Y chromosome in some species (Fig S6; Supplementary text)." Explanations for this pattern of "return to the Y" have been extensively discussed and tested in Dupim et al 2008 (see their section "Why genes seem to return to the Y chromosome after Y incorporations?" ) The available evidence strongly suggests that it is not a case of relocation to the Y.

      We thank the reviewer for raising this point. However, our conclusions disagree slightly with those from Dupim et al. 2018, in part because of additional sequencing in this clade. Dupim et al. suggested the possibility that most Y chromosomal loci duplicated to other chromosomes in the ancestor of the D. montium clade, following which each species degenerated either Y-linked or autosomal copies of genes. If this was the case, Y-linked copies should have diverged from X-linked copies since the ancestor of the D. montium clade. In contrast to this expectation, our phylogenetic analyses found that D. kikkawai Y-linked PRY is more closely related to X-linked PRY in all other related species (Figure 6- figure supplement 1). This result is much less parsimoniously explained by the ancient duplication event proposed by Dupim et al. and is more consistent with a ‘return-to-Y’ that we propose. We also make clear that, unlike PRY, we can’t differentiate the two hypotheses in the case of kl-2.

      Fig 6B suggests that the authors assembled the "translocated Y" in D. triauraria. However, no direct data or account for this assembly is provided. Please clarify.

      This was not our assembly. We searched all publicly available assemblies in the montium group and found one assembly (NCBI accession GCA_014170315.2) that assembled all ancestral Y-linked regions. We now clarify this in our revision.

      __ "Why would meiotic drive only influence Drosophila, but not mammalian, SNBP evolution? One important distinction may arise from the timing of SNBP transcription. In D. melanogaster, SNBP genes are transcribed before meiosis but translated after meiosis [29, 43, 57]. Thus, SNBP transcripts from a single allele, e.g., Xlinked allele, are inherited and translated by all sperm, regardless of which chromosomes they carry. Consequently, they can act as meiotic drivers by causing chromatin dysfunction in sperm without the allele, e.g., Y-bearing sperm." During spermatogenesis Drosophila haploid cells actually are syncytial, which has interesting consequences for the evolution of male genes (Raices et al, Genome Res. 1115-1122, 2019). This may be relevant for the present paper.

      __

      We thank the reviewer for this suggestion. We now gratefully include this citation in our revision.

      __Reviewer #2 (Significance (Required)):

      see above __ __Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      This manuscript by Chang & Malik consider the evolution of HMG-box-containing sperm nuclear basic proteins (SNBPs) across Drosophila species in phylogenetic context.

      Previous work in mammals had highlighted fast evolution of proteins involved in chromatin remodeling during spermatogenesis. Here, the authors provide evidence for widespread positive selection and likely involvement in genetic conflict in a set of proteins with analogous functions in Drosophila. Amongst other findings, the authors highlight biased amplification of SNBP paralogs on sex chromosomes along several Drosophila lineages, a tendency towards loss/pseudogenization following translocation onto a sex chromosome, and an intriguing concerted SNBP loss event in the montium group where parts of the Y chromosome have become fused to the X, thus nullifying the chance that genetic conflicts can play out via distorted segregation of sex chromosomes. The authors suggest that, taken together, their findings support widespread of SNBPs involvement (as instigators and repressors) in meiotic drive. Overall, I found the manuscript to be well written and thorough in its exploration of the evolutionary dynamics of SNBPs in this clade.

      __

      We thank the reviewer for the accurate summary and the kind comments.

      __Below, I have highlighted some aspects that I think would benefit from further attention, none of them major.

      • Following their exploration of patterns of SNBP evolution in Drosophila, the authors highlight support of their data for genetic conflict between sex chromosomes. They also rightly acknowledge that other evolutionary drivers such as sperm competition might also play a role in, for example, fast evolution of certain SNBPs. Yet those (not mutually exclusive) alternatives are never pitted directly against each other. The focus is firmly on exploring the support for the sex chromosome genetic conflict model. Given that the authors highlight Drosophila as a great model in part because of its well characterized sperm biology (including comparative morphology), I wondered why the authors had not made an explicit attempt to see if SNBP evolution covaries with aspects of sperm morphology across Drosophila. __

      We do agree with the reviewer that it will be very interesting to test whether SNBP evolution covaries with sperm morphology in Drosophila. However, data on sperm morphology is scant in most Drosophila species. Indeed, this trait has only been well studied in clades with heteromorphic (different-sized) sperm but we agree this will be an exciting topic to consider in the future.

      We also clarify better in our revised discussion that our analyses do not rule out a role for sperm competition or sperm morphology in driving the evolution of at least some SNBP genes. We note that a subset of SNBP genes undergo gene amplifications and loss, but most SNBP genes evolve rapidly including in species with gene loss. Thus, the meiotic drive hypothesis is not to the exclusion of other hypotheses.

      • The most intriguing part of the manuscript for me was the exploration of SNBP fate in the montium group, where the authors find evidence for an ancestral fusion event between the X and parts of the Y chromosome. The loss of SNBPs is certainly consistent with the conflict model but I was wondering to what extent this lineage is characterized more broadly by unusual evolution at the chromosomal level. Is there simply a lot of upheaval in montium, with more frequent gain/loss across the board? How specific is SNBP loss in the context of other orthologous groups? This could be investigated by looking at retention of other genes in other orthologous groups (in montium and some other control group) or perhaps by looking at synteny conservation. This is a good suggestion. Using the same methodology as used in this paper, we found that very few D. melanogaster essential genes (2000) are lost in any single species we surveyed here (unpublished data). However, we have not carried out similar analyses for all genes; given vastly different rates of evolution, this would be a significant undertaking. Thus, we are not able to make a direct comparison between SNBP genes and a control group, that would include other testes-specific or fertility-essential genes. Instead, we highlight the fact that since we identify SNBPs using syntenic analyses, we have known that the neighboring genes of SNBPs are much better conserved than the SNBP genes themselves in the montium group species.

      • In introducing SNBPs, the authors focus on their role as packaging agents. Clearly, SNBPs do package the genome in the sense that they bind to DNA and lead to reduced chromosome volume. But is this all packaging for packaging's sake (as portrayed by the sperm shape hypothesis)? Or is the situation a bit more nuanced, where condensation leads to a reduction of volume but also to a shutdown of transcription, protection from DNA damage, etc.? I think the focus on packaging alone is somewhat limiting when it comes to imagining how these proteins might act in the context of genomic conflicts. The authors may want to broaden their description of SNBPs in the Introduction accordingly. We completely agree with the reviewer and are currently exploring these possibilities in follow-up studies on SNBP function. However, it is fair to add that this hypothesis has not been well-recognized, and we, therefore, prefer to include it in our revised Discussion rather than Introduction. However, we also think that SNBP packaging function might be targeted by Wolbachia-encoded toxins, speeding up their evolution (revised Discussion). We think there are many molecular possibilities for SNBPs.

      • The authors highlight that some SNBPs are expressed in mature sperm whereas others are transition proteins. The evidence for positive selection chiefly comes from the latter group (and "undefined" proteins that could also be transition proteins). Can the authors comment on whether this is expected/unexpected? Along the same lines, the authors highlight differences between Drosophila and mammals when it comes to the timing of transcription/translation during meiosis, suggesting that meiotic drive can happen in Drosophila because alleles are expressed early and can exert an effect after meiosis regardless of whether the associated locus is present in the gamete. I wonder how this relates, if at all, to the author's finding that transition SNBPs are more likely to be part of conflicts (as indicated by positive selection signals) compared to SNBPs in mature sperm. We thank the reviewer for this comment. We expect that many genes expressed explicitly in spermatogenesis, including SNBP genes, would be under position selection, regardless of whether they are associated with X-Y conflicts. The positive selection signals could come from either X-Y conflicts, sperm competition, or conflicts with Wolbachia; we now discuss all of these in the Discussion.

      In contrast, the amplification and loss of a subset of Drosophila SNBPs are more likely associated with X-Y conflicts. We note that known SNBPs retained in mature sperm are more likely to be subject to amplification than known transition proteins.

      Regarding the timing of expression, it is true that transition SNBPs act earlier in spermatogenesis than SNBPs retained in mature sperm. However, for the meiotic drive hypothesis to apply, all it requires is for SNBP expression to precede sperm individualization, which it does for most SNBPs, including transition proteins.

      • ____ It is not entirely clear from the text (and also e.g. Table S4) how dN and dS (and subsequently dN/dS) where calculated. I presume as a single estimate across the whole phylogeny? If so, how heterogeneous is dN/dS across the phylogeny and can the authors identify specific branches on which selective regimes are different? A branch-level analysis should be better powered than the site-level analysis the authors present, which requires repeated selection on the same set of sites to get a strong enough signal. A branch-specific assessment of evolution would be particularly valuable in combination when combined with the assessment of amplifications/losses. We thank the reviewer for this question. The reviewer is correct. We estimated dN and dS in Supplementary file 4 across the whole phylogeny. We conducted branch tests for the amplification of tHMG only in the Dsim clade (Supplementary file 11).

      We are interested in how SNBP amplification happened across species, but we need better gene annotation for their structure in many of these 19 independent cases. Moreover, we hope to combine these with transcriptomic analyses with detailed sequence analyses to reveal how the event happened and how gene conversion, gene duplication, and mutations affect their evolution. Each of these analyses requires extensive additional resources and analyses, and we feel are beyond the scope of this current paper.

      • The authors suggest that young SNBPs are more likely to encode essential, non-redundant male fertility functions (p7, third paragraph). I'm not sure whether this generalization is appropriate given the small sample. Tpl94D is as young as Mst77F/Prtl99C, tHMG and CG14835 homologs have been lost along different lineages and most of the events are in a single lineage leading up to D. kikkawai. Do the authors really feel that this generalization is warranted? We agree with the reviewer. However, it is striking that the known fertility essential genes are either young or not universally conserved. We have therefore reworded our conclusion to make this contrast more accurate.__

      • How do the sex-chromosomal amplifications differ in sequence from the ancestral autosomal copies? The authors suggest that the sex chromosomal copies might be involved in meiotic drive? Does the sequence offer a function as to how? (e.g. loss of charged residues/DNA-binding capacity?__

      These are good questions. We do not know mechanistically how the sex-chromosome amplifications may cause meiotic drive. We did not observe the loss of positive charge or HMG domain in most sex-chromosomal amplified copies (Supplementary file 3). Our current working hypothesis is that they compete for the DNA binding with autosomal SNBP, and might interact with other proteins, e.g., heterochromatin proteins, to disturb sperm function. How they might function to cause meiotic drive is an active area of investigation in our and other labs.

      • I think it would be nice to have a final table/figure to summarizing the different lines of evidence for all the genes in Table 1 (i.e. positive selection yes/no, amplification in some lineages yes/no, sex chromosome translocations yes/no), for different lineages, including whether any of the HMG-box genes are unlikely to act as SNBPs. We agree with this suggestion. We have now significantly revised and added to Table 2 to include this added information.

      • The evidence the authors present is often consistent with genetic conflicts between sex chromosomes. Is it cogent? Arguably not (since direct tests of the mechanism are provided. I would therefore suggest a more cautious title than one stating that conflicts drive expansion and loss of SNBPs. We agree with all three reviewers and have amended our title to highlight the correlation. We also discuss other possibilities that can drive SNBP evolution in our revised Discussion.

      __Typographical errors etc.:

      • P3. First paragraph: "One of the driving forces ... " I found this sentence a bit odd in terms of causality (changes in composition being portrayed as a force that leads to selection) __

      We thank the reviewer for pointing out the confusing construction. We modified the sentence to “The positive selection of SNBPs results in changes to their amino acid composition.”

      - P3. Second paragraph: should be "HMG-box" rather than "HMB-box"

      Fixed.

      - P3. Fourth paragraph "..., consistent with the observation in mammals". I think "consistent" should be reserved for two observations that speak to the same phenomenon. SNBPs could evolve with no evidence for positive selection in Drosophila and that wouldn't exactly be "inconsistent" with mammals. It would just be different.

      Fixed. We changed “consistent with” to “similar to”.

      ____- P5. Fifth paragraph: should be "in the PAML package" rather than "in PAML package"

      Fixed.

      - P9. Second paragraph: "... montium group (Fig 5A)...)" should be Fig 6A.

      Fixed.

      __CROSS-CONSULTATION COMMENTS I have not much to add. The other reviews seem fair and well-informed from my somewhat-outside perspective. I don't know how tricky/time-consuming the suggested additional fly mating experiments are but want to note that, in general, I'm loath to "punish" authors of principally bioinformatic work for including some experiments. If experimental shortcomings can be addressed with appropriate caveats, that should be an option, as should removal of experimental data that - by the experts - would be considered too preliminary.

      __

      We thank the reviewer for their support. However, we felt that improved experiments on CG30056 role in fertility could broaden the scope of this paper, despite the additional time and labor commitment. We have now finished these experiments and they do reinforce our original conclusions with much greater support.

      __It is my policy to sign my reviews.

      Tobias Warnecke

      Reviewer #3 (Significance (Required)):

      I'm not enough of an expert in the field of SNBPs to assess the level of advance provided by this study. __

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      The paper describes interesting patterns on the evolution of Drosophila SNBP genes, and proposes a very interesting explanation, namely, that meiotic drive is the main evolutionary force behind these patterns. Some of these observations have recently been made by other authors in a single case (the Dox genes in D. simulans), but not in the scale and breadth of the present ms. The ms combines an extensive investigation of available genomes with expert analysis, and new experimental data. In particular, the finding that the ancestral Y became incorporated into de X in montium species is very exciting, and may provide a smoking gun for the explanation proposed by the authors. Overall, I think it is a very good paper. I do have several criticisms and suggestions that may help to improve it.

      The paper has a speculative side that it almost unavoidable given its novelty and breadth. I do not see this as a problem per se, but I think the uncertain/unsupported/problematic points should be more openly presented to the readers. The main cases I noted are:

      1. The title of the ms states that "Genetic conflicts between sex chromosomes drive expansion and loss of sperm nuclear basic protein genes in Drosophila", but the evidence is somewhat circumstantial, and the patterns may be explained also by other known phenomena (e.g., demasculinization of the sex chromosomes; below). I think the tone of the end of the Introduction reflects more faithfully the strength of the evidence ("Thus, we conclude that rapid diversification of SNBP genes might be largely driven by genetic conflicts between sex chromosomes in Drosophila."). I understand the temptation of writing a bold title, but I think it is a bit misleading in the present case. I.e., it would be desirable that the title conveys the uncertainties of the data and their interpretation.
      2. "In contrast, we found no instances of pseudogenization or subsequent translocation to the X chromosome of SNBP genes that are still preserved on their original autosomal locations or involved in chromosome fusions between autosomes (0/16). This difference is highly significant (Fig 5 and Table S11; 3:5 versus 0:16, Fisher's exact test, P=0.03). " Readers should be warned that this pattern can also be explained by the well-known demasculinization of X chromosomes (e.g., Sturgill et al. Nature 2007, 450, 238-241)
      3. "Indeed, no meiotic drive has been documented in the montium species even though it is rampant in many other Drosophila lineages [38]." Two remarks here: a) the authors should make clear that they are referring to sex-chromosome meiotic drive. b) I think the evidence is much weaker than the sentence implies. Sex-chromosome meiotic drive is known in less than 20 Drosophila species, scattered throughout the phylogeny. As far as I know all cases were discovered by accident, so the sampling is biased towards model species (e.g., the obscura group, which was very popular around 1930-1960). So we do not know the true frequency of sex-ratio meiotic drive among Drosophila species, nor, say, if it is more common in the Drosophila or Sophophora species, if it is suspiciously absent in the montium group (as suggested by the authors), etc. I think these uncertainties should be acknowledged or, perhaps, given the weakness of the argument, the sentence should be deleted or attenuated.
      4. "X-Y chromosome fusions eliminate the extent of meiotic drive and may lead to the degeneration of otherwise conserved SNBP genes, whose functions as drive suppressors are no longer required. Thus, unlike in mammals, sex chromosome-associated meiotic drive appears to be the primary cause of SNBP evolutionary turnover in Drosophila species." The authors found that in the montium species the ancestral Y became incorporated into de X chromosome, and that montium species seem to have an inordinate amount of SNBP gene losses. They combine these two observations by suggesting that these SNBP became dispensable or deleterious because they originally wee involved in XY meiotic drive. I think many readers will think that males in montium species are X/0, whereas in fact in all of them carry a Y chromosome (just, in most cases, more gene poor than "normal" Y-chromosomes). I do not think this is a fatal flaw for the explanation proposed by the authors, but certainly is a difficulty that should be acknowledged.

      Problems/suggestions with experiments and data analysis

      1. There is a section titled "CG30056 is universally retained in Drosophila but dispensable for male fertility in D. melanogaster". In this section and in the figures, it is stated, "Although CG30056 is the most conserved SNBP we surveyed, we found no clear difference in offspring number between heterozygous controls and homozygous knockout males (Fig 2B). (...) We found either no or weak evidence of fertility impairments in two different crosses with homozygous CG30056 knockout males.". I think the fertility data are weak for the purpose of the authors, and I strongly suspect that this conclusion is wrong. Let me explain why. At other passages of the ms, the authors classify the SNPB genes in three groups.
        • (i) essential/important for male fertility: "Three genes (Mst77F, Prtl99C and ddbt) are essential for male fertility while knockdown or knockout of two other SNBP genes (ProtA, and ProtB) leads to significant reduction in male fertility [27-30, 32]."
        • (ii) genes that do not appear to impair male fertility at all.
        • (iii) untested. CG30056 was in the last group, and hence the authors produced knockouts, tested their effect in male fertility, and concluded that it belongs to the second group. Now, look at Fig. 3B. The numbers of tested males are too small (it seems to range from 3 to 10), and male fertility is known to be a very noisy phenotype (as shown by the huge scatter in the authors' data). Furthermore, two different knockouts were tested, and both were nominally less fertile than the controls, and in one of them the difference is statistically significant. Taken at the face value, the knockouts seem to be perhaps ~25% less fertile than the controls. Another potentially big problem is that the "control males" actually carry visible dominant mutants (the balancers CyO or SM6) which certainly reduce their fitness, whereas the experimental males are wild-type for these mutants. Without the detrimental effect of these visible mutants in the controls, the difference to the CG30056 knockouts will probably be even larger. Note that the fertility effects of the genes ProtA, and ProtB (a.k.a. "Mst35B") , which the authors put in group "essential/important for male fertility" would not had been detected if assayed as the CG30056 gene: Tirmarche et al (2014; the reference cited by the authors) stated that: "In fact, the impact of Mst35B on male fertility was only revealed when mutant males were allowed to mate with a large excess of virgin females (1 for 10; Figure 3F) but not with a 1:1 sex ratio (not shown). " The authors' fertility test did not used this type of challenge. My general impression is that the fertility effects of CG30056 may actually be similar to ProtA and ProtB. I think the authors should do a proper fertility test of CG30056, or remove this section. Another possibly useful approach would be to classify the SNPB genes in those essential for male fertility and those that are not essential, because "experimentally speaking" this is a safer distinction (e.g., the fertility testes reported by other authors may also had been quick tests). Since these genes only function in sperm and are under purifying selection (otherwise they would have been lost; also, all have dN/dS < 1 ), they all most likely affect male fertility to some extent. In case the section on male fertility stays, it will be necessary to provide more details. How many males were crossed for each genotype? In some cases in Fig 2B, it seems that as low as 3, but it may be data superposition in the graph. Please provide the raw data in the supplementary material.
      2. "Our phylogenomic analyses also highlighted one Drosophila clade- the montium group of species (including D. kikkawai)- which suffered a precipitous loss of at least five SNBP genes that are otherwise conserved in sister and outgroup species (Fig 3). (...) Given our hypothesis that autosomal SNBP genes might be linked to the suppression of meiotic drive (above), we speculated that the loss of these genes in the montium group of Drosophila species may have coincided with reduced genetic conflicts between sex chromosomes in this clade." The montium data is an important part of the paper. I think the authors should test the statistical significance of this pattern.

      Other points:

      1. "The five remaining SNBP genes (Mst33A, CG30056, CG31010, CG34269, and CG42355) remain cytologically uncharacterized [30]." I think it will be interesting if the authors look at other potentially useful resources: Vibranovski et al papers which looked at gene expression in mitotic, meiotic and post-meiotic cells (https://mnlab.uchicago.edu/sppress/index.php), and the papers by several labs on testis single-cell transcriptomic data (Witt et al 2021 PLOS Genetics. 17(8):e1009728 ; Nat Commun. 2021;12: 892). These may provide additional clues on the function of SNBP genes. There is also a recent report on sperm proteome (doi: https://doi.org/10.1101/2022.02.14.480191)
      2. "Our inability to detect homologs beyond the reported species does not appear to result from their rapid sequence evolution. Indeed, abSENSE analyses [45] support the finding that Prtl99C, Mst77F, Mst33A, Tpl94 and CG42355 were recently acquired in Sophophora within 40 MYA. For example, the probability of a true homolog being undetected for Prtl99C and Mst77F is 0.07 and 0.18 (using E-value=1), respectively (Table S1, Methods)." This should be complemented by synteny analysis.
      3. I found the following sentence unclear: "However, we could only ascribe a sex chromosomal linked location for species if no data was available from either BUSCO genes or females (only males and mixed-sex flies)."
      4. "Using the available assemblies with Illumina-based chromosome assignment, we surprisingly found that most ancestrally Y-linked genes are not linked to autosomes as was previously suggested [by Dupim et al 2018] (Fig 6A)." The new result of X-linkage is exciting, but the sentence is not exact: Dupim et al 2018 made clear that they could only separate X/A from Y-linkage. E.g., the legend of their Fig 3: "Phylogeny and gene content of the Y chromosome in the montium subgroup. "M" means amplification only in males (i.e., Y-linkage), whereas "MF" means amplification in both sexes (autosomal or X-linkage)."
      5. "The most parsimonious explanation for these findings is a single translocation of most of the Y chromosome to the X chromosome via a chromosome fusion in the ancestor of the montium group of species. Afterward, some of these genes relocated back to the Y chromosome in some species (Fig S6; Supplementary text)." Explanations for this pattern of "return to the Y" have been extensively discussed and tested in Dupim et al 2008 (see their section "Why genes seem to return to the Y chromosome after Y incorporations?" ) The available evidence strongly suggests that it is not a case of relocation to the Y.
      6. Fig 6B suggests that the authors assembled the "translocated Y" in D. triauraria. However, no direct data or account for this assembly is provided. Please clarify.
      7. "Why would meiotic drive only influence Drosophila, but not mammalian, SNBP evolution? One important distinction may arise from the timing of SNBP transcription. In D. melanogaster, SNBP genes are transcribed before meiosis but translated after meiosis [29, 43, 57]. Thus, SNBP transcripts from a single allele, e.g., Xlinked allele, are inherited and translated by all sperm, regardless of which chromosomes they carry. Consequently, they can act as meiotic drivers by causing chromatin dysfunction in sperm without the allele, e.g., Y-bearing sperm." During spermatogenesis Drosophila haploid cells actually are syncytial, which has interesting consequences for the evolution of male genes (Raices et al, Genome Res. 1115-1122, 2019). This may be relevant for the present paper.

      Significance

      see above

    1. And on this logic—the same logic, by the way, that rightly grounds the conclusion that we should, for example, prefer the term “person living with depression” to “depressed person”—we should not refer to persons in ways that may imply that they are essentially defined by something that they are, in fact, managing.

      I disagree with this. I don't think it is a particularly relevant argument to make, as something like a medical condition or diagnosis does not correlate ot something like being gay. Being gay is not something people "suffer from"

    1. A lot of [my students]think people are obese because people can’t put down afork . . . [In this unit] we do research about things likegenetics . . . [to counter that notion].” In addition to theinformation about the availability of healthy food in theircommunities, this challenged the idea that obese and/oroverweight people are just lazy: they may be respondingto larger forces outside their control.

      It's always interesting for a student to challenge their own thinking and questions the things that may be thought of as "absolute truths". Growth and understanding, empathy even, comes from these realizations.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Manuscript number: #RC-2022-01697

      Corresponding author(s): William Roman; Edgar R. Gomes

      [The “revision plan” should delineate the revisions that authors intend to carry out in response to the points raised by the referees. It also provides the authors with the opportunity to explain their view of the paper and of the referee reports.

      The document is important for the editors of affiliate journals when they make a first decision on the transferred manuscript. It will also be useful to readers of the reprint and help them to obtain a balanced view of the paper.

      If you wish to submit a full revision, please use our "Full Revision" template. It is important to use the appropriate template to clearly inform the editors of your intentions.]

      1. General Statements [optional]

      This section is optional. Insert here any general statements you wish to make about the goal of the study or about the reviews.

      We would like to thank the reviewers for their careful evaluation of our study. The goal of this work is to demonstrate that fiber type composition can be altered with exercise of in vitro muscle cultures. These findings provide an additional strategy to better mimic muscle in vitro for biological investigation and disease modelling. The reviewers’ comments will strengthen the conclusions of our study.

      In this point-by-point answer, we also include a statement on the feasibility of each comment based on preliminary work we have performed since receiving the reviews. We expect experiments can be achieved within 2 – 3 months.

      2. Description of the planned revisions

      Insert here a point-by-point reply that explains what revisions, additional experimentations and analyses are planned to address the points raised by the referees.

      *Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      The manuscript by Henning et al describes a method to induce myofiber subtype specification in vitro based on optogenetics and particle image velocimetry. The work is well performed and the manuscript is clear. The findings might be useful to the muscle community, but there are some issues which should be addressed in order to improve the quality and impact of the manuscript.

      My main concern is that the whole work is performed in murine cells. Although I appreciate that the authors have used primary myoblasts rather than cell lines, I also think that the key advantage of such in vitro platforms is the possibility to "humanise" the experiments as much as possible. In this context, the key findings of this work should be reproduced using human myoblasts. This will significantly enhance the relevance of the work. *

      Point 1.1) We thank the reviewer for his suggestion and have already performed some pilot experiments to “humanize” experiments. We infected hiPSC-derived myotubes (van der Wal et al., 2018) and human immortalized myotubes (Mamchaoui et al., 2011) with AAV9-pACAGW-ChR2-Venus-AAV. After infection, human immortalized myotubes did not express ChR2, not permitting optogenetic training on these cultures. For hiPSC-derived myotubes, the infection rate was very low and insufficient to perform a bulk analysis to evaluate the effect of long term intermittent light stimulation. Moreover, the contractile behavior of hiPSC-derived myotubes expressing ChR2 significantly differed from primary mouse myotubes. They underwent a single and slow contraction when compared to the cyclic contractions observed in mouse myotubes. This suggests that the maturation of the contractile apparatus of 2D hiPSC-derived myotubes is insufficient to perform consistent in vitro training studies.

      As such, we agree with the reviewer that reproducing our key findings with human cells would improve the relevance of this work. However, due to the experimental limitations described above, significant improvements in human myotube maturation in vitro are required to perform such experiments. We will attempt to increase infection efficiency by using another AAV serotype in hiPSC-derived myotubes but this has a low probability of solving all the technical limitations. Our work is a proof of principal that fiber type composition can be influenced in vitro through contraction stimulation. We expect these findings to be the translated to human cultures when the field has discovered the necessary protocols to push human myotube maturation.

      Feasibility: run additional tests but probability of success is low due to technical limitations.

      *Other issues: *

      1) From a methodological perspective, I think some clarifications are needed on the western blots shown in Fig 4K-L, as the pattern of Myh3 and Myh8 in both panels appear very similar. This could easily be ruled out by providing raw data/images. Please accept my apologies if this is simply caused by similar migration patterns in the gels (worth checking).

      Point 1.2) The very similar appearance of both patterns is due to the same molecular weight (220 kDA) of distinct myh isoforms. After an initial staining of western blot membranes, primary and secondary antibodies were stripped off and the membrane was subsequently re-probed using a primary and secondary antibody. We incubated stripped membranes with secondary antibodies only and observed no signal, confirming the stripping was efficient. We have updated the representative images of the Western Blot membranes in Figure 4 and included the α-actinin loading controls on which the bands are normalized to account for sarcomerogenesis (Figure 4 K-M).

      Feasibility: Accomplished

      *2) Figure 3K-L (BTX): better imaging should be performed to assess morphology of NMJ (eg. pretzel-shaped as in mature/adult NMJ?) *

      Point 1.3) We agree with the point raised by the reviewer. However, a morphological assessment of the NMJ is difficult in this in vitro system due to our inability to generate mature muscle end plates as seen in in vivo adult NMJs. We will nevertheless perform a more quantitative evaluation of BTX stainings imaged with high spatial resolution by measuring the size and shape of the AChR clusters. The technical pipeline to do this quantitative approach is already established.

      Feasibility: will be accomplished

      *3) Figure 3 N-P: Why did the authors used a relatively complex techniques such as smFISH to answer a question more simply addressable with more conventional (and perhaps less operator dependent) techniques such quantitative PCR?

      *

      Point 1.4) We agree with the reviewer that the more conventional qPCR technique would highlight similar results to the smFISH quantifications. Due to the heterogeneity of our primary myotube cultures (presence of non-muscle cell types and varying degrees of muscle cell maturation), we opted to monitor AChR expression by conserving a spatial dimension. This allows us to observe ChrnE and ChrnG expression in mature muscle cells selected to perform the contraction analysis. Nevertheless, performing a bulk RNA expression analysis would be informative to show a significant increase in AChR expression across the culture. This point will be fully addressed by qPCR assays of ChrnE and ChrnG.

      Feasibility: will be accomplished

      *Reviewer #1 (Significance (Required)):

      Nature and significance: as mentioned in the previous section, the work can be very significant if expanded to human myoblasts/myotubes, which can have different slow/fast myosin expression pattern. The work is clearly methodological/descriptive, so showing an application of this technique using diseased/mutant cells may increase its relevance even more (but I do not believe it is a key barrier to publication). *

      We thank the reviewer for his comments as the “other issues” raised will significantly improve the manuscript and will all be tackled. With regards to using human myotubes, we will attempt a few more strategies to translate our findings to human cultures, but our preliminary data suggests that many technical barriers need to be overcome to perform such experiments. Nevertheless, it is our opinion that the main contribution of this manuscript is to show that fiber switching can be achieved in vitro and that this will be routinely used in the next generation of human in vitro muscle systems.

      *

      *

      *Comparison with other methods: Similar methods have been published but not with this level of resolution.

      Expertise: muscle disease and regeneration, in vitro and in vivo models.*

      *Reviewer #2 (Evidence, reproducibility and clarity (Required)): *

      * The work presented shows that muscle stem cells isolated from 5-day-old mice can be transduced with a DNA coding for a Channelrhodopsin2-Venus which will allow the muscle cell to be excited by a light beam (475nm) and to induce the contraction of myotubes. The authors measure the speed of contraction, relaxation and fatigability of such cells as a function of a more or less long excitation time. In particular, they show that myotubes in culture, excited at a frequency of 5 Hz, 8 hours per day for 7 days are larger than unstimulated myotubes and are more resistant to fatigue. Surprisingly, they show that myotubes stimulated at the low frequency of 5Hz express the neonatal Myosin heavy chain more than the slow Myh whose expression is known in adult muscle to be specifically strong in muscle fibers stimulated at low frequency. As the authors do not apply a high stimulation frequency (100Hz) to their culture, it is difficult to conclude whether the stimulation frequency applied in the study induces a specific phenotypic specialization of the myofiber, or a more general role. In this respect, the size of the myotubes obtained after training seems to be increased, showing a hypertrophic effect on the cultured myotubes. This study does not allow us to conclude, beyond the expression of the Myh8 gene, on the “gain” of the fast-twitch specialization of the myofiber by repeated stimulation over several days. A complementary study would certainly provide elements to better understand the role of muscle fiber stimulation, apart from the trophic contribution provided in vivo by the motoneuron. If the study is well conducted, some points are nevertheless important to address before publication.*

      *Reviewer #2 (Significance (Required)): *

      * - Figures 4F/G are difficult to understand: the Myh7 signal seems much higher in trained myonuclei (F), but the histogram shows the opposite (G).*

      __Point 2.1) __We apologize for the confusion. The apparent higher Myh7 signal in trained cells in Figure 4F is due to background noise in the image. When mRNA is expressed, the smFISH probes are visible as small round dots. For clarity, we updated the representative images for the smFISH probes and highlighted the smFISH dots with arrows. We also adapted the y-axis of each graph to better represent the analysis of mRNA counts per myonuclei.

      Feasibility: Accomplished

      *- Figures 4L, the western blot shows the same increase in Myh3 and Myh8 at day 4, while the graph shows an increase at d4 only in Myh8, why? *

      Point 2.2) We have chosen another western blot to better reflect the quantification. It is important to note that we have normalized the band intensity to a-actinin instead of a house keeping gene to account for changes in sarcomerogenesis over the lifetime of the cultures. As such, although we observe an increase in Myh3 intensity, it is counter balanced by an increase in a-actinin expression. We have now added the a-actinin bands.

      - For immunocytochemistry against fMyh (Fig4 H, I) as well as for Western blots (Fig 4M, N), the authors have to provide arguments regarding the specificity of the antibodies used: some fMyh-specific antibodies recognize, Myh 3, 8, 1, 2, and 4, some only Myh 8, 1, 2, and 4, so it is quite difficult to conclude on the experiments using sc-32732 antibodies, (clone F59) which Myh are actually recognized in Western blot or immunocytochemistry.

      Point 2.3) According to the manufacturer, the sc-32732 antibody is specific for fast Myh (Myh1, 2, 4 and 6). Nevertheless, we will ensure the specificity of the sc-32732 antibody against fast Myosins by staining neonatal and adult TA/EDL muscle sections with anti-Myh3 (embryonic), anti-Myh8 (neonatal) and anti-fMyh antibodies.

      Feasibility: will be accomplished

      While 10Hz stimulation is known in vivo to increase the slow program, and Myh7 expression in adult muscles, the authors show that ex vivo this is not the case with primary myotubes, with Myh7 protein level not being upregulated in the 7 day stimulation paradigm, while on the contrary Myh8 expression is upregulated. I think it would be important to quantify the mRNA of each of the Myh genes to be sure that there is no problem with the antibodies, which could recognize several Myh proteins, in the absence of a resolving acrylamide gel allowing visualization and relative level of each isoform according to its migration. Nevertheless, this is an interesting observation that could be related to the early phases of muscle contraction in vivo. Indeed, it has been shown in rats that early postnatal development animals are essentially sedentary and whose muscles (Sol and EDL) are stimulated by short intermittent bursts similar to 10Hz (doi: 10.1111/j.0953-816X.2004.03418.x) during the first 2-3 weeks of life. This should be compatible with Myh8 expression. It would be relevant in this idea to verify that the paradigm presented leads to myotubes with a "neonatal" phenotype. Quantification of the expression level of *genes specifically expressed during the neonatal period, compared with those expressed in adult slow or fast myofibers, would enhance the conclusions drawn by the authors. *

      Point 2.4) The reviewer raises an important technical limitation of observing Myh proteins to identify fiber types due to the cross-reactivity of antibodies. Despite our best efforts to select the appropriate antibodies, we agree that investigating mRNA expression of individual Myh isoforms would strengthen the conclusion of our study. We will design specific primers and perform qPCR for distinct Myh isoforms on untrained and trained cultures.

      With regards to the “neonatal” phenotype of these in vitro cultures, this does indeed seem to be the case as the cultures transition from embryonic and neonatal myosins to adult myosins during the lifetime of the cultures.

      Feasibility: will be accomplished

      *Should we also be cautious about bulk analysis since, as shown in Figure S1, not all myotubes express ChR2? *

      Point 2.5) Although 10% of myotubes do not express ChR2, we believe that 90% of infected myotubes is sufficient for bulk analysis. We nevertheless combine in our study bulk analysis with single cell assays such as smFISH and immunofluorescence, which are in line with the bulk analyses.

      Feasibility: Accomplished

      May the authors correlate the ex vivo neonatal phenotype observed with the neonatal muscles they used to prepare myogenic stem cells?

      Point 2.6) We understand from this that the reviewer would like us to check the expression of distinct Myh isoforms in our in vitro system and compare it to neonatal muscle. We will perform Myh staining of muscle sections from 6-day old mouse pups (time of myogenic stem cell isolation) and compare the expression of Myosin heavy chains with what we observe in our in vitro cultures.

      Feasibility: will be accomplished

      Overall, we will address all the points of the reviewer. Those ensuring the specificity of antibodies used are particularly relevant. With regards to the comparison between our in vitro cultures with neonatal muscle, we believe this will help contextualize our findings with the literature.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      *Summary: *

      *In this work, the authors propose an in vitro model describing a strategy to alter fiber type composition of myotubes with a long-term, intermittent mechanical training. The authors present a model of myotubes transfected with an adenovirus, which makes them photosensitive; in this way, fibers contraction can be induced upon stimulation with blue LEDs. *

      *Even though ChR2 expressing myotubes have previously been used by other groups (Asano T, Ishizua T, Yawo H. Optically controlled contraction of photosensitive skeletal muscle cells. Biotechnol Bioeng. 2012 Jan;109(1):199-204), no one has ever used it in the way proposed by the authors. For this reason, this work opens new perspectives on the possible use for clinical and therapeutic purposes for this in vitro muscle system. *

      *Major comments: *

      *I believe that the authors have presented their results, conclusion and methods in a fair and clear way, so that the experiment could also be reproduced. *

      *However, I think there are some adjustments that could be done in order to improve and strengthen the quality of this work: *

      *- The authors have analysed the expression of different myosin heavy chain isoforms, both regarding the slow and fast twitch fibers. Though, I think it would be interesting to investigate also the expression of Myh4, which is mainly expressed in type IIB fast twitch fibers; *

      Point 3.1) We agree with the reviewer’s comment. We will add the analysis for Myh 4 (western blots and qPCR) to our manuscript.

      Feasibility: will be accomplished

      The authors have observed a switch in the fiber type upon prolonged intermittent stimulation with blue LEDs, which translates into a higher number of type II fibers. It is known that exercise helps rescuing the loss of type II fibers, which is typical of age-related physiological processes, such as sarcopenia (Brunner F, Schmid A, Sheikhzadeh A, Nordin M, Yoon J, Frankel V. Effects of aging on Type II muscle fibers: a systematic review of the literature. J Aging Phys Act. 2007 Jul;15(3):336-48). However, I believe that providing a deeper analysis of the metabolism of the type II fibers (i.e. oxidative or glycolytic) could be helpful in order to have a clearer view on the specific subset of fibers that are generated with the given experimental conditions;

      Point 3.2) We agree with the reviewer's suggestion that an additional metabolic analysis would strengthen our observation. We propose to perform lactate measurements in cell lysate and supernatant to monitor a switch from oxidative to glycolytic metabolism. Specific inhibitors of the glycolytic pathway (2-DG, UK5099, Rotenone and AntimycinA) will be used as a control to prevent trained cells to shift towards a fast fiber type.

      Alternatively, we will assess the protein expression levels of key metabolic proteins involved in oxidative phosphorylation and in pyruvate and lactate production (e.g. OxPhos, …). All these techniques are routinely performed in an adjacent laboratory and we foresee no technical limitations.

      Feasibility: will be accomplished

      *Minor comments: *

      *The text and the figures are clear and well written, and help to explain better the experimental setup and procedures. Still, I would suggest some minor adjustments: *

      - I would suggest providing more information on the pH used for the experiments, since it plays a pivotal role in regulating myosin ATPase activity and, thus, muscular contractility. This would improve the replicability of your experiment.

      We thank the reviewer for this comment. We will provide information regarding the pH and add it in the method and materials section.

      Feasibility: will be accomplished

      The caption of Figure 1 is missing a description of panel E, even if it has been addressed in the text.

      Point 3.3.) We apologize for this mistake. We added the missing description of Fig. 1E.

      Feasibility: Accomplished

      *Reviewer #3 (Significance (Required)): *

      *This model opens new perspectives on in vitro muscle systems for the study of pathologies. The authors have been able to assess that myofibers contraction is able to induce a shift towards type II fibers, reproducing in vitro what is also known in vivo. For this reason, I believe that this model could be useful for further clinical approaches. It is important, though, to keep in mind that muscular disorders are not all characterized by a loss of type II fibers; for instance, myotonic dystrophies type I and type 2 exhibit similar phenotypes, even if different types of muscle fibers are affected. *

      *For this reason, it would be interesting to investigate the versatility of this model in terms of giving rise to different fiber types. *

      Point 3.4.) We added a sentence in the introduction that highlights an example of muscle disorders in which slow muscle fibers are predominately affected. Concerning the versatility of the model, we will add a paragraph to the discussion elaborating on how different stimulus frequency and durations could influence the specialization of fiber types.

      Feasibility: Accomplished

      Overall, we will address all major and minor comments from the reviewer. We have identified the experiments required for the metabolic analysis and agree that it will bolster our findings.

      Description of the revisions that have already been incorporated in the transferred manuscript

      Please insert a point-by-point reply describing the revisions that were already carried out and included in the transferred manuscript. If no revisions have been carried out yet, please leave this section empty.

      We have already carried out the following changes in the manuscript, which were proposed by the reviewers:

      Point 1.2: pattern of Myh3 and Myh8 in both panels appear very similar - We updated the representative images of Myh 3 and Myh8 in Figure 4 K-N __and included the loading controls Myh 8 and fMyh images in __Figure 4K-N __and to __supplementary Figure 4 A, B.

      Point 2.1: Figures 4F/G: representative images of Myh7 smFISH probe and the graph showing opposite trends – We have updated the representative images of Figure 4F and we have changed the x-axis of the graph in Figure 4E and G.

      __Point 2.5: __caution around bulk analysis we consider that based on the high percentage of contracting cells in response to blue light (~90%), this concern is not warranted.

      Point 3.3: caption of Figure 1 is missing a description of panel E – We have added the missing description to the manuscript (Figure 1E).

      Point 3.4: muscular disorders are not all characterized by a loss of type II fibers – we have added an example of a muscle disorder, in which slow fibers are predominantly affected, to the introduction (line 42-44) of the manuscript.

      investigate the versatility of this model in terms of giving rise to different fiber types – we added a paragraph to the discussion elaborating on how different stimulus frequency can lead to different fiber types (line 264-275).

      3. Description of analyses that authors prefer not to carry out

      Please include a point-by-point response explaining why some of the requested data or additional analyses might not be necessary or cannot be provided within the scope of a revision. This can be due to time or resource limitations or in case of disagreement about the necessity of such additional data given the scope of the study. Please leave empty if not applicable.

      Point 1.1: Reproducing our key findings with human cells – we ran pilot experiments on immortalized human cell lines and human iPSC-derived myotubes but were not able to mature these cells sufficiently nor infect them to allow long-term in vitro training. Increased maturation of myotubes derived from hiPSCs is an endeavor currently undertaken by many laboratories. Although we will attempt a few more trials, we believe the technical limitations are too important to address this point.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      The work presented shows that muscle stem cells isolated from 5-day-old mice can be transduced with a DNA coding for a Channelrhodopsin2-Venus which will allow the muscle cell to be excited by a light beam (475nm) and to induce the contraction of myotubes. The authors measure the speed of contraction, relaxation and fatigability of such cells as a function of a more or less long excitation time. In particular, they show that myotubes in culture, excited at a frequency of 5 Hz, 8 hours per day for 7 days are larger than unstimulated myotubes and are more resistant to fatigue. Surprisingly, they show that myotubes stimulated at the low frequency of 5Hz express the neonatal Myosin heavy chain more than the slow Myh whose expression is known in adult muscle to be specifically strong in muscle fibers stimulated at low frequency. As the authors do not apply a high stimulation frequency (100Hz) to their culture, it is difficult to conclude whether the stimulation frequency applied in the study induces a specific phenotypic specialization of the myofiber, or a more general role. In this respect, the size of the myotubes obtained after training seems to be increased, showing a hypertrophic effect on the cultured myotubes. This study does not allow us to conclude, beyond the expression of the Myh8 gene, on the "gain" of the fast-twitch specialization of the myofiber by repeated stimulation over several days. A complementary study would certainly provide elements to better understand the role of muscle fiber stimulation, apart from the trophic contribution provided in vivo by the motoneuron.

      If the study is well conducted, some points are nevertheless important to address before publication.

      Significance

      • Figures 4F/G are difficult to understand: the Myh7 signal seems much higher in trained myonuclei (F), but the histogram shows the opposite (G).
      • Figures 4L, the western blot shows the same increase in Myh3 and Myh8 at day 4, while the graph shows an increase at d4 only in Myh8, why?
      • For immunocytochemistry against fMyh (Fig4 H, I) as well as for Western blots (Fig 4M, N), the authors have to provide arguments regarding the specificity of the antibodies used: some fMyh-specific antibodies recognize, Myh 3, 8, 1, 2, and 4, some only Myh 8, 1, 2, and 4, so it is quite difficult to conclude on the experiments using sc-32732 antibodies, (clone F59) which Myh are actually recognized in Western blot or immunocytochemistry.
      • While 10Hz stimulation is known in vivo to increase the slow program, and Myh7 expression in adult muscles, the authors show that ex vivo this is not the case with primary myotubes, with Myh7 protein level not being upregulated in the 7 day stimulation paradigm, while on the contrary Myh8 expression is upregulated. I think it would be important to quantify the mRNA of each of the Myh genes to be sure that there is no problem with the antibodies, which could recognize several Myh proteins, in the absence of a resolving acrylamide gel allowing visualization and relative level of each isoform according to its migration. Nevertheless, this is an interesting observation that could be related to the early phases of muscle contraction in vivo. Indeed, it has been shown in rats that early postnatal development animals are essentially sedentary and whose muscles (Sol and EDL) are stimulated by short intermittent bursts similar to 10Hz (doi: 10.1111/j.0953-816X.2004.03418.x) during the first 2-3 weeks of life. This should be compatible with Myh8 expression. It would be relevant in this idea to verify that the paradigm presented leads to myotubes with a "neonatal" phenotype. Quantification of the expression level of genes specifically expressed during the neonatal period, compared with those expressed in adult slow or fast myofibers, would enhance the conclusions drawn by the authors.
      • Should we also be cautious about bulk analysis since, as shown in Figure S1, not all myotubes express ChR2?
      • May the authors correlate the ex vivo neonatal phenotype observed with the neonatal muscles they used to prepare myogenic stem cells?
    1. Author Response

      Reviewer #1 (Public Review):

      The authors are trying to determine how time is valued by humans relative to energy expenditure during non-steady-state walking - this paper proposes a new cost function in an optimal control framework to predict features of walking bouts that start and stop at rest. This paper's innovation is the addition of a term proportional to the duration of the walking bout in addition to the conventional energetic term. Simulations are used to predict how this additional term affects optimal trajectories, and human subjects experiments are conducted to compare with simulation predictions.

      I think the paper's key strengths are its simulation and experimental studies, which I regard as cleverly-conceived and well-executed. I think the paper's key weakness is the connection between these two studies, which I regard as tenuous for reasons I will now discuss in detail.

      The Title asserts that "humans dynamically optimize walking speed to save energy and time". Directly substantiating this claim would require independently manipulating the (purported) energy and time cost of walking for human subjects, but these manipulations are not undertaken in the present study. What the Results actually report are two findings:

      1. (simulation) minimizing a linear combination of energy and time in an optimal control problem involving an inverted-pendulum model of walking bouts that (i) start and stop at rest and (ii) walk at constant speed yields a gently-rounded speed-vs-time profile (Fig 2A);

      2. (experiment) human subject walking bouts that started and stopped at rest had self-similar speed-vs-time profiles at several bout lengths after normalizing by the average duration and peak speed of each subject's bouts (Fig 4B).

      If the paper established a strong connection between (1.) and (2.), e.g. if speed-vs-time trajectories from the simulation predicted experimental results significantly better than other plausible models (such as the 'steady min-COT' and 'steady accel' models whose trajectories are shown in Fig 2A), this finding could be regarded as providing indirect evidence in support of the claim in the paper's Title. Personally, I would regard this reasoning as rather weak evidence - it would be more accurate to assert 'brief human walking bouts look like trajectories of an inverted-pendulum model that minimize a linear combination of energy and time' (of course this phrasing is too wordy to serve as a replacement Title -- I am just trying to convey what assertion I think can be directly substantiated by the evidence in the paper). But unfortunately, the connection between (1.) and (2.) is only discussed qualitatively, and the other plausible models introduced in the Results are not revisited in the Discussion. To my naive eye, the representative 'steady min-COT' trace in Fig 2A seems like a real contender with the 'Energy-Time' trace for explaining the experimental results in Fig 4, but this candidate is rejected at the end of the third-to-last paragraph in the 'Model Predictions' subsection of Results based on the vague rationale that is never revisited.

      We have addressed most of this comment above, but respond here regarding Fig. 4. The argument against steady min-COT should also point out the peak speed. The Results have been revised thus: “In contrast to the min-COT hypothesis, the human peak speeds increased with distance, many well below the min-COT speed of about 1.25 m/s. The human speed trajectories did not resemble the trapezoidal profiles of the steady min-COT hypothesis for all distances, nor the triangular profiles of steady acceleration.”

      An additional limitation of the approach not discussed in the manuscript is that a fixed step length was prescribed in the simulations. The 'Optimal control formulation' subsection in the Methods summarizes the results of a sensitivity analysis conducted by varying the fixed step length, but all results reported here impose a constant-step-length constraint on the optimal control problem. Although this is a reasonable modeling simplification for steady-state walking, it is less well-motivated for the walking bouts considered here that start and stop at rest. For instance, the representative trial from a human subject in Figure 8 clearly shows initiation and termination steps that differ in length from the intermediate steps (visually discernable via the slope of the dashed line interpolating the black dots). Presumably different trajectories would be produced by the model if the constant-step-length constraint were removed. It is unclear whether this change would significantly alter predictions from either the 'Energy-Time' or 'steady min-COT' model candidates, and I imagine that this change would entail substantial work that may be out of scope for the present paper, but I think it is important to discuss this limitation.

      This is addressed elsewhere (Essential Revisions 2), but we explain more here. One of the parameter studies included step length increasing with speed according to the human preferred relationship. This is included in Fig. 3, and so we concluded that variable step lengths are not critical to the speed trajectories. A related assumption is that the energetic cost of modulating step length/frequency is small compared to the step-to-step transition cost. We believe that humans expend substantial energy for both costs, but that the overall cost of walking is still dominated by step-to-step transitions.

      With my concerns about the paper's framing and through-line noted as above, I want to emphasize that I regard the computational and empirical work reported here to be top-notch and potentially influential. In particular, the experimental study's use of inexpensive wearable sensors (as opposed to more conventional camera-based motion capture) is an excellent demonstration of efficient study design that other researchers may find instructive. To maximize potential impact, I encourage the authors to release their data, simulations, and details about their experimental apparatus (the first two I regard as essential for reproducibility - the third a selfless act of service to the scientific community).

      I think the most important point to emphasize is that the bulk of prior work on human walking has focused on steady-state movement - not because of the real-world relevance (since one study reports 50% of walking bouts in daily life are < 16 steps as summarized in Fig 1B), but rather because steady walking is a convenient behavior to study in the laboratory. Significantly, this paper advances both our theoretical and empirical understanding of the characteristics of non-steady-state walking.

      It is also significant to note the relationship between this study, where time was incorporated as an additive term in the cost of walking, with previous studies that incorporated time in a multiplicative discount in the cost of eye and arm movements. There is an emerging consensus that time plays a key role in the generation of movement across the body - future studies will discern whether and when additive or multiplicative effects dominate.

      We have acknowledged this in a brief sentence: “Indeed, we have found a similar valuation of time to explain how reaching durations and speed trajectories vary with reaching distance (Wong et al., 2021).” As an aside, in that reference we measured metabolic cost of cyclic arm reaching, combined it with a linear time cost, and predicted reaching durations vs. distance and bell-shaped hand speed trajectories. Others (Shadmehr et al. Curr Biol. 2016) have proposed multiplicative (hyperbolic) temporal discounting to explain durations, but the cost formulas are not dynamical, and cannot produce trajectories. We agree with reviewer’s point, but we think the evidence for hyperbolic discounting is not strong. Linear time costs are simpler and work at least as well. This is of great interest to us, but we didn’t discuss beyond the brief mention above, because we fear it is too far afield.

      Reviewer #2 (Public Review):

      This paper provides a novel approach to quantifying the tradeoff between energetic optimality during walking and the valuation of time to travel a given distance. Specifically, the authors investigated the relationships between walking speed trajectories, distance traveled, and the valuation of (completion) time. Time has been proposed as a potential factor influencing movement speed, but less is understood about how individuals balance energetic optimality and time constraints during walking. The authors used a simple, sagittal-plane walking model to test competing hypotheses about how individuals optimize gait speed from gait initiation to gait termination. Their approach extends literature in the space by identifying optimal gaits for shorter, partially non-steady speed walking bouts.

      The authors successfully evaluated three competing walking objectives (constant acceleration, minimum cost of transport at steady speed, and the energy-time objective), showing that the energy-time objective best matched experimental data in able-bodied adults. Although other candidate objectives may exist, the paper's findings provide a likely-generalizable explanation of how able-bodied humans select movement strategies that encompass studies of steady-speed walking.

      Overall, this paper provides a foundation for future studies testing the validity of the energy-time hypothesis for human gait speed selection in able-bodied and patient populations. Extensions of this work to patient populations may explain differences in walking speed during clinical assessments and provide insight into how individual differences in time valuation impact performance on assessments. For example, understanding whether physical capacity or time valuation (or something comparable) better explains individual differences in walking speed may suggest distinct approaches for improving walking speed.

      Strengths:

      The authors presented a compelling rationale for the tradeoffs between energetic optimality and time and their results provide strong support for a majority of their conclusions. In particular, significant reductions in the variance of experimental speed trajectories provides good support for the scaling of speeds across individuals and the plausibility of the energy-time hypothesis. Comparison to theoretical (model-based) reductions across difference time valuation (cT) parameters would further enhance confidence in the practical significance of the variance reductions. Further, while additional work is needed to determine the range of "normal" valuations of time, the authors present experimental ranges that appear reasonable and are well explained. The computational and analytical methods are rigorous and are supported by the literature. Overall, the paper's conclusions are consistent with experimental and computational results.

      The introduction of a model-based analytical approach to quantify the effects of time valuation of walking could generalize to test other cost functions, populations, or locomotion modes. Further, models of varying complexity could be implemented to test more individualized estimates of metabolic cost, ranging from 3D dynamic walking models (Faraji et al., Scientific Reports, 2018) or physiologically-detailed models (Falisse et al., Journal of The Royal Society Interface. 2019). The relatively simple set of analyses used in this paper is consistent with prior literature and should generalize across applications and populations.

      The authors justified simplifications in the analysis and addressed major limitations of the paper, such as using a fixed step length in model predictions, using a 2D model, and basing energy estimates on the mechanical work of a simple model. It is unlikely that the paper's conclusions would change given additional model complexity. For example, a 3D walking model would need to control frontal plane stability. However, in able-bodied adults, valuation of frontal-plane stability during normal walking would not likely alter the overall shape of the predicted speed profiles.

      Weaknesses:

      The primary weakness of this work is that alternative objectives may provide similar speed profiles and thus be plausible objectives for human movement. For example, the authors tested an objective minimizing the steady-speed cost of transport. This cost function is consistent with the literature, but (as predicted) unlikely to explain acceleration and deceleration during gait. An objective more comparable to the energy-time hypothesis would be to minimize the net energy cost over the entire bout, including accelerations and decelerations. This may produce results similar to the energy-time hypothesis. However, a more complex model that incorporates non-mechanical costs (e.g., cost of body weight support) may be needed to test such objectives. Therefore, the energy-time hypothesis should be considered in the context of a simple model that may be incapable of testing certain alternative hypotheses.

      We have addressed some of this comment in Essential Revisions 4.

      We are unsure what is meant by “net energy over the entire bout, including accelerations and decelerations.” Our hypothesis uses total (gross) energy over the entire bout, and already includes accelerations and decelerations. If “net” refers to the customary definition of metabolic energy minus resting, then it differs from our gross cost (Fig. 6A) only in the amount of constant offset, namely resting cost. Removing the offset is equivalent to a decrease in C_T. As shown in Fig. 3, this would reduce peak speeds magnitudes but not change the shape of the speed, peak speed, and duration patterns. There is also another interpretation where the cost of walking includes only net energy, and the cost of time includes the resting metabolic rate (Fig. 6C). This interpretation yields the same predictions, the only difference is whether resting rate is treated as an energy or a time cost. We have not made further changes, because we are unsure what the reviewer meant. The difference between net and total is at most one of degree, not of qualitatively different behavior.

      We do not address the proposed “cost of body weight support” because we are unsure of the definition. There is a hypothesis by Kram & Taylor (1990) that defines a metabolic cost rate proportional to body weight divided by ground contact time. It is unclear if this is what reviewer is referring to, so we did not include it in the manuscript. However, IF this is what reviewer means, we do not consider the Kram & Taylor (“K&T”) cost to be a viable hypothesis for computational models. It is a correlation observed from data, which is inadequate as a model, for several reasons. First, in a model optimization, it leads to absurd predictions, because metabolic cost could then be reduced simply by increasing stance (contact) time. A model could do so simply by walking with very long double support phases, or running with a very brief aerial phase, both of which people clearly do not do. In walking, extended double support durations result in much higher metabolic cost (Gordon et al., APMR 2009). Models must operate quite literally on whatever objective they are given, and here, a literal interpretation of K&T makes absurd predictions.

      Another issue with the K&T cost is that it is not mechanistic. A mechanistic model is concerned with the forces and work performed by an actuator such as muscle. Muscles experience forces far greater than body weight, not captured by the K&T cost. Of course, overall cost for animal locomotion is roughly proportional to body weight, but what a model needs is a cost associated with its control inputs, e.g. actuator forces.

      We have also examined the K&T hypothesis in previous publications. In Schroeder & Kuo (Plos Comp Biol 2021), we used a simple model of running that minimizes an energetic cost dominated by mechanical work. Even though the model has no cost similar to K&T, its predicted metabolic cost is correlated with the K&T cost. Correlation does not imply causation, which is known in this model.

      We have also examined the K&T hypothesis in experimental data. In Riddick & Kuo (Sci Rep 2022), we examined human data and found that there are many variables that correlate quite well with metabolic cost, including the K&T correlate. We use human data to show how mechanical work could explain metabolic cost, and even if it does, the K&T cost appears as a correlate. In our interpretation, both model and data that experience an energetic cost proportional to mechanical work may have a number of variables correlated to energy cost. Those correlates need not have any causal influence.

      There are, of course, many similar correlates that could be or have been proposed to explain the metabolic cost of running. Most such correlates are not operational enough to work in a model, and it is also difficult to predict what a reader might consider plausible, even if we do not.

      We agree with this statement: “the energy-time hypothesis should be considered in the context of a simple model that may be incapable of testing certain alternative hypotheses.” In fact, in Discussion of limitations we listed other potential factors (e.g. forced leg motion, stability, 3D motion), and stated “We did not explore more complex models here, but would expect similar predictions to result if similar, pendulum-like principles of work and energetic cost apply.” We had also cited other models that include such factors and are compatible with the step-to-step transition concept. Finally, we already stated, “the Energy-Time hypothesis should be regarded as a subset of the many factors that should govern human actions, rendered here in a simple but quantitative form.” We believe this is already aligned with reviewer’s comment.

      An experimental design involving an intervention to perturb the valuation of time would provide stronger support for time being a critical factor influencing gait speed trajectories. The authors noted this limitation as an area of future work.

      While the results are compelling, the limited sample size and description of participants limit the obvious generalizability of the results. Older adults tend to have higher metabolic costs of walking than younger adults, which may alter the predicted time-energy relationships (Mian OS, et al., Acta physiologica. 2006). As noted in the introduction, differences in walking speeds have been observed in different living environments. General information on where participants lived (city, small town, etc...) may provide readers with insight into the generalizability of the paper's conclusions. Additionally, the experimental results figures show group-level trends, but individual-specific trends and the existence of exceptional cases are unclear.

      We wish to defend the “limited sample size.” The present sample size was (in our opinion) sufficient to test the hypothesis, and we have reported confidence intervals and other statistics where appropriate. (As always, it is up to the individual reader to decide whether they are convinced or not.) It is true that the data may be insufficient for other purposes, but we cannot anticipate or address all other purposes.

      We appreciate the relevant connection to aging. We have added to Discussion, “We do not know whether that family [of trajectories] also applies to older adults, who prefer slower steady speeds and expend more energy to walk the same speed (Malatesta, 2003). Perhaps an age-related valuation of time might explain some of the differences in speed.”

      We agree about the participants, and have added “Subjects were recruited from the community surrounding the University of Calgary; the city has a moderately affluent population of about 1.4 M, with a developed Western culture.”

      No specific reviewer recommendation was made about individual-specific trends, but there are several indicators already included in the manuscript. First, all trials from all subjects are shown in Fig. 4A. Any truly exceptional cases should be visible as substantial deviations from the group. Second, the normalization by peak speed in Fig. 4B shows how individuals tend to be fairly consistent in their preferred speeds, in that shorter and longer bouts of an individual are consistent with each other, even if some walk faster than others. Third, this observation is analyzed more quantitatively by the reduction in standard deviations with normalization (Results). Fourth, we will provide a data repository with all the data, to allow readers to inspect individuals more carefully (Data availability statement).

      The authors' interpretation of clinical utility is vague and should be interpreted with caution. A simple pendulum-based walking model is unlikely to generalize to patient populations, whose gait energetics may involve greater positive and negative mechanical work (Farris et al., 2015; Holt et al., 2000). Additionally, the proposed analytical framework based on mechanical work as a proxy for the metabolic cost may not generalize to patient populations who have heterogeneous musculotendon properties and increased co-contraction (e.g., children with cerebral palsy; Ries et al., 2018). Consequently, the valuation of time for an individual could be incorrectly estimated if the estimates of metabolic cost were inaccurate. Therefore, as the authors noted for their able-bodied participants, more precise measures of metabolic rates will be critical for translating this work into clinical settings.

      We agree, and did not intend to say that clinical populations must walk the same way, rather that the Normal patterns could be used as a basis of comparison. To make this clearer, we have amended the Discussion of clinical implications (new text emphasized): “it may be possible to predict the duration and steady speed for another distance, referenced from a universal family of walking trajectories. We have identified one such family that applies to healthy individuals with pendulum-like gait. Of course, some clinical conditions might be manifested by a deviance from that family, perhaps in the acceleration or deceleration phases, or in how the trajectories vary with distance. If quantified, such deviance might prove clinically useful… the characterization of distance-dependent speed trajectories can potentially provide more information than available from steady speed alone.”

      We agree that the valuation of time can be inaccurate if the metabolic cost is inaccurate. That is why we did not make a precise estimate of the valuation. We have amended the text to help clarify that our rough estimates are based on previous data. In addition, our general scientific intent is to reveal behavioral sensitivities, for example of walking duration to bout distance, as opposed to absolute numerical quantities.

    1. I believe Victor Margolin when he says that he developed his own system. That's what I did in the years before people started widely discussing personal knowledge systems online. Nobody taught me how to do it when I was in college. @chrisaldrich repeatedly tries to connect everyone's knowledge practices to an ongoing tradition that stretches back to commonplace books, but he overstates it. There is such a thing as independent development of a personal knowledge system. I know it because I've lived it. It's not so difficult that it requires extraordinary genius.

      Reply to Andy https://forum.zettelkasten.de/discussion/comment/16865#Comment_16865

      Andy, I'll take you at your word. You're right that none of it requires extraordinary genius--though many who seem to exhibit extraordinary genius do have variations of these practices in their lives, and the largest proportion of them either read about them or were explicitly taught them.

      With these patterns and practices being so deeply rooted in our educational systems for so long (not to mention the heavy influences of our orality and evolved thinking apparatus even prior to literacy), it's a bit difficult for many to truly guarantee that they've done these things independently without heavy cultural and societal influence. As a result, it's not a far stretch for people to evolve their own practices to what works for them and then think that they've invented something new. The common person may not be aware of the old ideas of scala naturae or scholasticism, but they certainly feel them in their daily lives. Commonplacing is not much different.

      By analogy, Elon Musk might say he created the Tesla, but it's a far bigger stretch for him to say that he invented a new means of transportation, or a car, or the wheel when we know he's swimming in a culture rife with these items. Humans are historically far better at imitation than innovation. If people truly independently developed systems like these so many times, then in the evolutionary record of these practices we should expect to see more diversity than we do in practice. We might expect to see more innovation than just the plain vanilla adjacent possible. Given Margolin's age, time period, educational background, and areas of expertise, there is statistically very little chance that he hadn't seen or talked about versions of this practice with several dozens of his peers through his lifetime after which he took that tacit knowledge and created his own explicit version which worked for him.

      Historian Keith Thomas talks about some of these traditions which he absorbed himself without having read some of the common advice (see London Review of Books https://www.lrb.co.uk/the-paper/v32/n11/keith-thomas/diary). He also indicates that he slowly evolved to some of the often advised practices like writing only on one side of a slip, though, like many, he completely omits to state the reason why this is good advice. We can all ignore these rich histories, but we'll probably do so at our own peril and at the expense of wasting some of our time to re-evolve the benefits.

      Why are so many here (and in other fora on these topics) showing up regularly to read and talk about their experiences? They're trying to glean some wisdom from the crowds of experimenters to make improvements. In addition to the slow wait for realtime results, I've "cheated" a lot and looked at a much richer historical record of wins and losses to gain more context of our shared intellectual history. I'm reminded of one of Goethe's aphorisms from Maxims and Reflections "Inexperienced people raise questions which were answered by the wise thousands of years ago."

    1. drawing on materials less often granted the legitimacy of academic preservation.

      This line reminded me of something a friend told me recently. The last few generations of humans will be the first generations that will have living memories (Videos and audio) of them living their lives. To think that in several hundred years, someones great great great great grandchild may be able to pull up a video of us today and say to their grandchildren that this is how we used to be/look/interact with our world is immensely interesting in my opinion. We don't have videos of what life was like in the 1500's. We do have paintings and records and thus can fill in the blanks with our imaginations, but future generations won't have to employ that technique nearly as much as we do.

    1. Part of the activation energy required to start any task comes from the picture you get in your head when you imagine doing it. It may not be that going for a run is actually costly; but if it feels costly, if the picture in your head looks like a slog, then you will need a bigger expenditure of will to lace up. Slowness seems to make a special contribution to this picture in our heads. Time is especially valuable. So as we learn that a task is slow, an especial cost accrues to it. Whenever we think of doing the task again, we see how expensive it is, and bail. That’s why speed matters.

      The story you tell yourself creates reality.

    1. Abstract

      This work has been published in GigaScience Journal under a CC-BY 4.0 license (https://doi.org/10.1093/gigascience/giac034 and has published the reviews under the same license. These reviews were as follows.

      Reviewer 1. Sean Walkowiak

      First review: Comment 1: The authors could more clearly and accurately present and discuss sequencing and assembly approaches, including the advantages and limitations of the ONT assembly presented here

      While the standards of 'quality' for assemblies are evolving, there are standard sets of 'science-based' criteria for considering the quality of a genome, such as the 14 criteria listed in the manuscript here: https://www.nature.com/articles/s41586-021-03451-0#Tab1. Many of these criteria are ambitious, particularly for wheat due to its size and complexity, and many criteria are not met using previous assembly approaches, or the approaches used in this study. It is true that CS and 10+ Wheat Genomes do not use long reads; however, these assemblies are valuable and have been rigorously validated using 10X Genomics, Hi-C, and long read data. They also perform well for TE content, BUSCO (as outlined by Tables 1 and 2 and Fig 3 in this manuscript), and they were actually used in this MS as a reference for guiding the ONT assembly. I would also expect that they have a better base pair accuracy than the assembly presented here. I therefore suggest that the authors revise their statement "these assemblies have been produced using short-read technologies and are therefore not up to the quality standard of current genome assemblies". If the authors wish to discuss assembly quality, which I recommend they should, I suggest focusing on advantages and limitations of each technology and assembly approach in a constructive way, perhaps with a stronger focus on the value of the ONT resource developed here. In regards to base pair accuracy, ONT is at a disadvantage to short reads or to PacBio. This is particularly true in the context of HiFi reads, which have increased accuracy over ONT and Illumina and have greater lengths than Illumina, but PacBio and HiFi are not discussed. This is not to say that PacBio is superior in every way, the reads from ONT are longer and these hold a significant value. As an example of differences between PacBio and ONT that might provide useful context to describe the differences between ONT and PacBio approaches, please see: https://pubmed.ncbi.nlm.nih.gov/33319909/, for differences between short read (TriTex) and PacBio, please see https://www.nature.com/articles/s41586-020-2947-8 . All of these approaches are valuable but have both advantages and limitations, with ONT also having many clear advantages and disadvantages. But these need to be clearly communicated and supported, either through the results of this study or through the literature. For example, in the discussion, the authors state that "ONT devices HAVE a real advantage over other long-read technologies". There is only one other long read sequencing technology, so are if you saying that ONT HAS a 'real advantage' over PacBio based on read length, this is valid, but can be stated more explicitly and with examples of the read lengths from this study and the literature. It is then stated that the "error rate is drastically reduced for nanopore", again this valuable and a great advancement in regards to ONT, but it would be wise not to dismiss that this error rate is still higher than PacBio HiFi, which again can be stated explicitly with support from the literature. While both of these concepts are important, after they are stated, they are not actually discussed or framed to highlight the work from this study. The true advantage of ONT, even over PacBio HiFi, is that the long reads can resolve more complex regions that span greater distances, which are abundant in wheat (see reference from above). The authors are presenting an exciting and valuable resource with this genome assembly and this assembly has advantages due to the application of ONT, for the reasons mentioned above regarding long complex regions, but these are not fully highlighted and the authors do not take full advantage of what this assembly has to offer. I think the authors should provide additional context and support related to the value and drawbacks of their ONT assembly. The advantages are discussed superficially at the gene level through a couple of examples (Fig 5), though none of these examples are supported with any significant biological data or validation analysis. There are many interesting features of genomes that are captured by ONT that are not captured well by short reads or PacBio, and it is unfortunate that these are not explored in any significant depth in the manuscript.

      Comment 2: Some of the 'highlighted features' in the manuscript could be better selected/executed

      This comment relates to the previous comment on having little detail on what the ONT genome is uniquely capable of providing over other approaches. Instead, the authors focus on some anomalies in the D genome as well as differences in the nanopore software for base calling. It is unclear to me what the objective is of the report on the D genome. I suspect that this may be due to differences in repeat content between D and the other subgenomes, or an artifact of the tools and analyses used. Page 6, Figures S1 and S2, may be a consequence of poor read filtering for reads that align ambiguously - i,e perhaps reads from A and B may crossmap at a greater likelihood than those from D due to differences/similarities in repeat content between subgenomes. Once reads are aligned, the alignments should be properly filtered using standard 'best practices for NGS'- I do not see that any filtering or analysis of cross mapping was performed, but I may have missed it. Once the alignments are filtered, read coverage dips and peaks can then be assessed statistically using tools such as CNVnator and cn.mops, which are designed specifically for comparative read depth analysis since depth may not be normally distributed, rather than arbitrarily looking at 2 times the median. There may be differences between genes and intergenic regions in terms of mapping accuracy, so it may be ideal to interrogate read depth for those separately. The increased gaps is also interesting and I wonder if this could be due to the read accuracy of ONT and read mapping and assembly biases when having similar subgenomes.

      Nevertheless, the results and discussion on the D genome are interesting but distracting and likely reflect that the authors should take more time to explore their data and its biases before presenting this information. In summary, I believe that additional work is needed to bring value to the read depth and D genome analysis should the authors choose to include this in the manuscript. While I agree that it would be useful to communicate that a significant gain was observed when basecalling using the more accurate basecaller, the emphasis on this is disproportionate to its value in the manuscript. The observation of a better assembly when using reads from a more advanced basecaller is not something new. As for the error rate of the ONT between organisms (yeast and wheat), with a sample size of 2, I do not think that this is worth presenting or discussing in any detail. While this may just be an artifact of the DNA quality itself from two experiments, I suspect that this may be a valid result from the manuscript and due to sequencing repeats, which are more abundant in wheat, in combination with how these basecallers self train to be more accurate. While this is certainly valid, it is not novel or interesting. This result comparing species was not tested with sufficient scientific rigor/evidence, it distracts from the central result of the manuscript, and just reaffirms something that we already known about the basecalling software and challenges of sequencing homopolymers and the importance of getting accurate reads using the more advanced basecalling methods.

      Comment 3: Why Renan? This comment relates to the other two comments on the selected areas of focus. The biological story, which was on gliadins, was of some value and highlighted some of the advantages of an ONT assembly, but this was not supported by any significant biological data. Renan is a well-known cultivar with abundant genomic data, mapping populations, trait data for diseases, etc. It is unfortunate that the authors could not use the genome to dig deeper to more thoroughly demonstrate the value of this assembly specifically in the context of ONT and genomics of wheat or the biology of wheat and Renan, specifically. With abundant QTL data available specifically for Renan, these could have easily been anchored to the assembly to highlight novel transcripts from the RNAseq from this study, just as an example. Even the comparisons of the Renan assembly to other available assemblies was mostly superficial and did not highlight in significant detail the value of having an ONT assembly or the value of having data specifically for Renan. While a detailed 'biological story' may be beyond the scope of this manuscript, there was minimal effort to highlight the value of the assembly, and this comment is more of a larger reflection that more could have been done to highlight the value of the genome to support the author's vague claims that the genome "will benefit the wheat community and help breeding programs".

      Minor Comments The absence of numbered lines made it difficult to provide more detailed feedback, but there are minor items throughout, so I suggest numbering the lines and also giving the manuscript a thorough review. I appreciate that the authors present and suggest methods for future assembly of complex genomes using ONT, but unlike the abstract states 'we also provide the methodological standards to generate high-quality assemblies of complex genomes'. I would argue that the standards used for ONT assembly are known and are not established here. I would also suggest caution when stating that the methods here should be considered the 'standard' for the reasons indicated in Comment 1 regarding other approaches used to assemble complex genomes, such as PacBio/HiFi, and the lack of a proper investigation/discussion/comparison of assembly quality.

      Page 2: last line - what is the abbreviation ca. ? Table 1: Busco is presented twice with different values. Table 1 and 2 use different versions of RefSeq, I would stick to one version. It is unclear to me what trend or result is that the authors are trying to present in figure 1, which I would say is common for circos plots. Presenting data 'for the sake of presenting it' is not terribly valuable and I would encourage the authors to use the figures to present a trend or result that is impactful. In addition, the data that is presented is not presented clearly, and is cryptic. The roman numerals in the figure caption for Figure 1 are not actually in the figure. The caption also indicate that the dots indicate lower and higher values, but not of what - perhaps density of gaps? The color scales are not presented for each track. Two of the color scale pallets also look similar.

      Page 6: 62% of exons were identical, which means 48% had SNPs, so the authors argue that SNPs are therefore rare at 48% of exons? I do not think that 48% of exons having SNPs is rare, I think it that this would mean that nearly half of exons have SNPs, so this is therefore common. Perhaps this statistic is misleading and the focus should instead be on the 0.7% divergence. How does this value compare with other within species comparisons of gene content and could this be an artifact of ONT accuracy? This question relates to a general comment that the authors could do better at bringing relevant comparisons or parallels in from the literature throughout the manuscript to bring value to any findings or insights they are presenting. Particularly in the context of other ONT assemblies.

      Page 7, capitalize the T for technology, it is part of the name of the company and is a proper noun. This is repeated elsewhere.

      Page 7: 'on wheat'? this statement could be written more clearly The way that the text is worded, it sounds like the basis for selecting the SmartDenovo assembly was the number of unknown bases, when I suspect it was actually a multitude of factors (BUSCO, gene or TE content, assembly stats, etc). While I do not question the selection of the assembly, I do suggest a clearer presentation of the information. I appreciate that the authors presented the data from multiple assemblers, one of the concerns with ONT is that the read accuracy is low and this may lead to issues in assembly of complex polyploids with similar subgenomes. I suspect that based on this study, it is clear that this is a valid concern for some assemblers, but may have been overcome in others. Though none of this is explored or discussed. Again, is there any information in the literature contrasting assemblers that could provide insights into what you observed?

      Searches at 90% identify and coverage for genes and TEs is not strict, especially with genomes that have highly identical subgenomes. If you reduce your thresholds enough, all features will map to your genome.....

      The choice of language is often objective or not representative of the results. For example, the 'extremely' similar TE content between Renan and CS. Why not say it is similar and actually report a value or a % difference. This would be more concise and informative than using vague and overzealous language. Page 8, short reads (dash or no dash?) The font sizes in Figure 2 are very small.

      The RNAseq is not really presented at all, except in the Materials and Methods. I thought the genes were ab initio predicted until I saw RNAseq in the materials and methods. I suggest at least making a note of RNAseq into the results and/or discussion since this additional effort does bring added value to the annotations and the manuscript. The discussion says de novo annotations, but I suggest explicitly stating that RNAseq was performed.

      Figure 3 C and D do not have horizontal axis labels, the top should be labelled as subgenome, bottom as chromosome, and the vertical axis (not the top) should be labelled as number of gaps and chromosome length. Same comment for labelling of vertical axis for panels A and B, horizontal axis should be labelled as genome assemblies, which are reflected in the pallet/legend. Note that many of the colours in this pallet are similar and difficult to differentiate, it may actually take less space to label the bars with each wheat line to make it less cryptic.

      How were the dotplots in figure 4 generated? Perhaps I missed it in the materials and methods. Also one of the axis have labels or units, etc.

      Much of the text in Figure 5 is too small and illegible.

      Page 10: The discussion is superficial and vague and should provide an accurate and pragmatic discussion of the results in the context of the literature. For example, the manuscript boasts a 'higher resolution'... but of what? Perhaps 'complex repetitive regions'? To reiterate my previous comment on the lack of literature support throughout the manuscript - Were these 'higher resolutions' of <complex repetitive regions> comparable to what was observed in the literature when ONT was applied to other systems? Again, these advantages of ONT and the assembly could be more thoroughly

      Re-review:

      The revised manuscript addresses the major concerns/comments. The assembly and its report are an exciting new resource for the wheat community. I only have one very minor comment below:

      When writing variety names in text and figures, it is important to be exact because there are many varieties with similar names internationally. CDC Stanley, not "Stanley"; CDC Landmark, not "Landmark"; "LongReach Lancer", not "Lancer", not "LongRead Lancer" - typo on line 308. I suggest performing a thorough check throughout.

    1. Whenever I read about the various ideas, I feel like I do not necessarily belong. Thinking about my practice, I never quite feel that it is deliberate enough.

      https://readwriterespond.com/2022/11/commonplace-book-a-verb-or-a-noun/

      Sometimes the root question is "what to I want to do this for?" Having an underlying reason can be hugely motivating.

      Are you collecting examples of things for students? (seeing examples can be incredibly powerful, especially for defining spaces) for yourself? Are you using them for exploring a particular space? To clarify your thinking/thought process? To think more critically? To write an article, blog, or book? To make videos or other content?

      Your own website is a version of many of these things in itself. You read, you collect, you write, you interlink ideas and expand on them. You're doing it much more naturally than you think.


      I find that having an idea of the broader space, what various practices look like, and use cases for them provides me a lot more flexibility for what may work or not work for my particular use case. I can then pick and choose for what suits me best, knowing that I don't have to spend as much time and effort experimenting to invent a system from scratch but can evolve something pre-existing to suit my current needs best.

      It's like learning to cook. There are thousands of methods (not even counting cuisine specific portions) for cooking a variety of meals. Knowing what these are and their outcomes can be incredibly helpful for creatively coming up with new meals. By analogy students are often only learning to heat water to boil an egg, but with some additional techniques they can bake complicated French pâtissier. Often if you know a handful of cooking methods you can go much further and farther using combinations of techniques and ingredients.

      What I'm looking for in the reading, note taking, and creation space is a baseline version of Peter Hertzmann's 50 Ways to Cook a Carrot combined with Michael Ruhlman's Ratio: The Simple Codes Behind the Craft of Everyday Cooking. Generally cooking is seen as an overly complex and difficult topic, something that is emphasized on most aspirational cooking shows. But cooking schools break the material down into small pieces which makes the processes much easier and more broadly applicable. Once you've got these building blocks mastered, you can be much more creative with what you can create.

      How can we combine these small building blocks of reading and note taking practices for students in the 4th - 8th grades so that they can begin to leverage them in high school and certainly by college? Is there a way to frame them within teaching rhetoric and critical thinking to improve not only learning outcomes, but to improve lifelong learning and thinking?

    1. https://www.youtube.com/watch?v=zCLCIw-HSJc

      I'm curious if you knew if Nelson, Engelbart or any of their contemporaries had/maintained/used commonplace books or card indexes as precursors of their computing work? That is, those along the lines of those most commonly used by academics, for example as described by Markus Krajewski in Paper Machines (MIT Press, 2011) or even Beatrice Webb's Appendix C on Note Taking in My Apprenticeship (Longmans, 1926) in which she describes a slip (or index card)-based database method of scientific note taking. I've always felt that Vannevar Bush held things back unnecessarily by not mentioning commonplace book traditions in As We May Think.

    1. Author Response

      Reviewer #1 (Public Review):

      The authors define regulatory networks across 77 tissue contexts using software they have previously published (PECA2, Duren et al. 2020). Each regulatory network is a set of nodes (transcription factors (TF), target genes (TG), and regulatory elements (RE)) and edges (regulatory scores connecting the nodes). For each context, the authors define context-specific REs, as those that do not overlap REs from any of the other 76 contexts, and context-specific regulatory networks as the collection of TFs, TGs, and REs connected to at least one context-specific RE. This approach essentially creates annotations that are aggregated across genes, elements, and specific contexts. For each tissue, the authors use linkage disequilibrium score regression (LDSC) to calculate enrichment for complex trait heritability within the set of all REs from the corresponding context-specific regulatory network. Heritability enrichments in context-specific regulatory network REs are compared with heritability enrichments in regions defined using other approaches.

      We thank the reviewers for the pertinent and precise summary of our paper.

      Reviewer #2 (Public Review):

      In this manuscript the authors develop a method, SpecVar, to perform heritability estimation from regulatory networks derived from gene expression and chromatin accessibility data. They apply this approach to public datasets available in ENCODE and Roadmap Epigenomics consortia as well as GWAS phenotype associations in UK Biobank. It promises to be a powerful method to interpret mechanisms from genetic associations. Below are some strengths and weaknesses of the paper.

      Strengths

      • The method performs heritability enrichment on two major genomic data types: gene expression and chromatin accessibility.

      • This method leverages gene regulatory networks to perform the heritability estimation, which may better capture complex disease architecture.

      • The authors perform an extensive comparison to other LDSC-based approaches using different tissue datasets.

      Weaknesses

      (1) This approach may represent a modest advance over existing LDSC methods when looking at other complex traits.

      (2) The authors only compare with LDSC using different functional annotations as input, which may not be appropriate. A more broad comparison with other heritability methods would be helpful.

      (3) The method seems to be applied to "paired" data, but this is still bulk profiles not paired single-cell RNA/ATAC data.

      The authors successfully applied a regulatory network approach to improving the heritability estimation of complex traits by using both gene expression and chromatin accessibility data. While the results could be further strengthened by comparing them to other network and non-network-based methods, it provides important insight into a few traits beyond the standard LDSC model with different functional annotations.

      Given that this method is based on the widely used LDSC approach it should be broadly applied in the field. However, the authors should consider adapting this to single-cell data as well as admixed human population genetic data.

      We thank the reviewer for the positive comment on our work by specifically pointing out that SpecVar is a powerful method to interpret mechanisms from genetic associations. We appreciate that the reviewer’s summarized “Strength” part well captures our major contribution in building an atlas of regulatory networks by integrating paired gene expression and chromatin accessibility data, leveraging regulatory networks to perform the heritability enrichment, and identifying relevant tissues and estimate relevance correlation. We also thank the reviewer for pointing out the weakness to further enhance our results. To address the comments, we (1) performed ablation studies and added more description to clarify the novelty of our methods; (2) conducted extensive comparison to another network-based method CoCoNet and non-network-based method RolyPoly; (3) discussed the promising direction in identification of relevant contexts at cell type level by leveraging single cell multi-omics profiles and application on admixed populations.

      Reviewer #3 (Public Review):

      Identifying the critical tissues and cell types in which genetic variants exert their effects on complex traits is an important question that has attracted increasing attention. Feng et al propose a new method, SpecVar, to first construct context-specific regulatory networks by integrating tissue-specific chromatin states and gene expression data, and then run stratified LD score regression (LDSC) to test if the constructed regulatory network in tissue is significantly associated with the trait, measured by a statistic called trait relevance score in this study. They apply their method to 6 traits for which there exists prior evidence on the most relevant tissues in the literature, and then further apply to 206 traits in the UK Biobank. They find that compared to LDSC using other sources of information to define context-specific annotations, their method can "improve heritability enrichment", "accurately detect relevant tissues", helps to "interpret SNPs" identified from GWAS, and "better reveals shared heritability and regulations of phenotypes" between traits.

      We thank the reviewer for the summary and appreciation of our efforts to address the important question: identifying the critical tissues and cell types in which genetic variants exert their effects on complex traits.

      However, I think it requires more work to understand where exactly the benefits come from and the statistical properties of their proposed test statistic (e.g., how to perform hypothesis tests with their relevance score and whether the false positive rate is under control). In addition, it's not clear to me what they can conclude about the shared heritability (which means genetic correlation) by comparing their relevance score correlation across tissues to the phenotypic correlation between traits.

      We thank the reviewer’s advice to do more work to enhance the statistical rigorousness of SpecVar. We have added the significant test of heritability enrichment and our proposed R score in the revision. We also clarified that SpecVar can use common relevant contexts and shared SNP-associated regulatory networks as potential explanation for the correlation between traits.

      They show that SpecVar gives much higher heritability enrichment than the other methods in the trait-relevant tissues (Fig. 2). The fold enrichment from SpecVar is extremely high, e.g., more than 600x in the right lobe of the liver for LDL. First, I think a standard error should be given so that the significance of the differences can be assessed. Second, it is very rare (hence suspicious) to observe such a huge enrichment. Since SpecVar is based on LDSC, the same methodology that other methods in comparison depend on, the differences to the other methods must come from the set of SNPs annotated for each tissue. I think it is important to understand the difference between the SpecVar annotated SNPs and those from other methods. For example, is the extra heritability enrichment mainly from the SpecVar-specific annotation or from the intersection narrowed down by SpecVar?

      The reviewer has pinpointed a question about one important advantage of our method to improve heritability enrichment. We addressed this question by first providing standard errors, p values, and q values of heritability enrichment. Second, we conduct the ablation analysis to study the source of extra heritability enrichment. This question greatly helps us to clarify the main contribution of our method.

      They propose to use the relevance score (R score) to prioritise trait-relevant tissues. In Fig. 3, they show tissue-trait pairs with the highest R scores, and from there they prioritise several tissues for each trait (Table 1). I can see that some tissue has an outstanding R score, however, it is not clear to me where they draw the line to declare a positive result. The threshold doesn't seem to be even consistent across traits. For example, for LDL, only the right lobe of the liver is identified although other tissues have R scores greater than 100, whereas, for EA, Ammor's horn and adrenal gland are identified although their R scores are apparently smaller than 100. It seems to me they use some subjective criteria to pick the results. It leads to a serious question on how to apply their R score in a hypothesis test: how to measure the uncertainty of their R score? What significance threshold should be used? Whether the false positive rate is under control? (Without knowing these statistical properties, readers won't be able to use this method with confidence in their own research.

      We thank the reviewer to raise the question about the hypothesis test of the R score. We used the block Jackknife stratagem to estimate standard errors, p values, and q values in our revision. We added the new result to the main text and they greatly enhanced the statistical rigorousness of our method.

      Another related comment to the above is to investigate false positive associations, they should show the results for all tissues tested to see if SpecVar tends to give higher R scores even in tissues that are not relevant to the trait. It would also be useful to include some negative control traits, such as height for brain tissues.

      We agree that negative control is important and the six phenotypes in our manuscript are negative for each other. For example, LDL is relevant to liver tissue and not relevant to brain tissue. Educational attainment is relevant to brain tissue but not relevant to liver tissue.

      Fig. 3 shows that tissues prioritised by LDSC-SAP and LDSC-SEG seem to make less sense than those from SpecVar. However, some of the results are not consistent with the LDSC-SEG paper (Finucane et al 2018). For example, LDL was significantly associated with the liver in Finucane et al (Fig. 2), but not in this study. How to explain the difference? (Question 3)

      We checked the results in Figure 3 and found that even though the liver was not ranked to be top 5 tissues, it has a significant P-value to LDL in our implementation. There is indeed some difference in heritability enrichment and P-value between the LDSC-SEG paper and our implementation. And the difference was from the different sets of tissues (77 tissues in our paper and 53 tissues in the LDSC-SEG paper) for the two applications.

      The authors highlight an example where SpecVar facilitates the interpretation of GWAS signals near FOXC2. They find GWAS-significant SNPs located in a CNCC-specific RE downstream of FOXC2 and reason these SNPs affect brain shape by regulating the expression of FOXC2. I think more work can be done to consolidate the conclusion. For example, if the GWAS signals are colocalised with the eQTL for FOXC2 in the brain. Also, note that the top GWAS signal is actually on the left of the CNCC-specific RE (Fig. 4b). A deeper investigation should be warranted.

      We agree that more work should be done to consolidate the regulation of FOXC2. In our revision, we used the HiChIP loop in the brain to support the SNP-associated regulation of FOXC2. We also thank the reviewer’s suggestion for the idea of eQTL colocalization and we conduct eQTL colocalization analysis on our method-revealed SNP-associated regulation to show our method can facilitate the fine mapping of GWAS signals. Lastly, brain shape is a complex trait and may be relevant to multiple tissues. Hence it is reasonable to suspect that the top GWAS signal may be active in other relevant tissues’ regulatory elements.

      They show that SpecVar's relevance score correlation across tissues can better approximate phenotypic correlation between traits. However, the estimation of the phenotypic correlation between traits is neither very interesting nor a thing difficult to do (it can be directly estimated from GWAS summary statistics). A more interesting question is to which extent the observed phenotypic correlation is due to common genetic factors acting in the shared tissues/cell types/pathways/regulatory networks between traits. Note that in their Abstract, they use words "depict shared heritability and regulations" but I don't seem to see results supporting that.

      We are sorry that we didn’t make it clear how SpecVar “depict shared heritability and regulations”. We added more results and one example in the UKBB application to show SpecVar can use common relevant contexts and shared SNP-associated regulatory networks as potential explanation for the correlation between traits.

      Line 396-402: "For example, ... heritability could select most relevant tissues ... but failed to get correct tissues for other phenotypes ... P-value could obtain correct tissues for CP ... but failed to get correct tissues for ... SpecVar could prioritize correct relevant tissues for all the six phenotypes." Honestly, I find hard to judge which tissues are "correct" or "incorrect" for a trait in real life. It would be more straightforward to compare methods using simulation where we know which tissues are causal.

      We thank the reviewers to pinpoint the improper statement of “correct”. It is difficult to find phenotypes with gold-standard relevant tissues and we used six relatively well-studied phenotypes with prior knowledge of possible relevant tissues in our paper. We revised the “correct” statement in our revision.

    1. Author Response

      Reviewer #2 (Public Review):

      Reinforcement learning (RL) theory is important because it provides a broad, mathematically proven framework for linking behavioral states to behavioral actions, and has the potential for linking realistic biological network dynamics to behavior. The most detailed neurophysiological modeling uses biophysical compartmental models with the theoretical framework of HodgkinHuxley and Rall to describe the dynamics of real neurons, but those models are extremely difficult to link to behavioral output. RL provides a theoretical framework that could help bridge across the still-underexplored chasm between behavioral modeling and neurophysiological detail.

      On the positive side, this paper uses a network of interacting neurons in region CA3 and CA1 (as used in previous models by McNaughton and Morris, 1987; Hasselmo and Schnell, 1994; Treves and Rolls, 1994; Mehta, Quirk and Wilson. 2000; Hasselmo, Bodelon and Wyble, 2002) to address how a simple representation of biological network dynamics could generate the successor representation used in RL. The successor representation is an interesting theory of hippocampal function, as it contrasts with a previous idea of model-based planning. Previous neuroscience data supports the idea that animals use a model-based representation (a cognitive map made up of place cells or grid cells) to read out potential future paths to plan their behavior in the environment. For example, Johnson and Redish, 2007 showed activity spreading into alternating arms of a T-maze before a decision is made (i.e. a model-based exploration of possible actions, NOT a successor representation), and Pfeiffer and Foster, 2013 showed that replay in 2-dimensions corresponds to future goal directed activity. Models such as Erdem and Hasselmo, 2012 and Fenton and Kubie, 2012 showed how forward planning of possible trajectories could guide performance of behavioral tasks. In contrast, the successor representation proposes that model-based activity is too computationally expensive and proposes that instead of reading out various possible model-based future paths when making a decision, that a simulated agent could instead learn a look-up table indicating the probability of future behavioral states accessible from a given state. In previous work, the successor representations accounted for certain aspects of experimental neuroscience data such as place cells responding to the insertion of barriers as seen by Alvernhe et al. and the backward expansion of place field seen by Mehta et al. The current paper is admirable for addressing the potential role of neural replay in training of successor representations and its relationship to other neural and behavioral data such as the papers by Cheng and Frank 2008 and by Wu et al. 2017.

      However, a lot of this same data could still be interpreted as indicating that animals use a model-based representation as described above. There's nothing in this paper that rules out a model-based interpretation of the results discussed above. In fact, the cited paper by Momennejad et al. 2017 shows that humans extensively use model-based mechanisms along with some use of a successor representation in addition to the model-based mechanism. The description in the article under review needs to avoid treating successor representations as if they are already the ground truth.

      To do this, throughout the paper, the authors need to repeatedly address the fact that the Successor Representation is just a theory and not proven experimental fact. And they need to repeatedly in all sections point out that the successor representations hypothesis can be contrasted with the theory that model-based neural activity could instead guide behavior and could be the correct account for all of the data that they address (i.e. such as the darkavoidance behavior). They should cite the previous examples of neural data that looks like model-based planning such as Johnson and Redish, 2007 in the T-maze and Pfeiffer and Foster, 2013 in open fields, and cite models such as Hasselmo and Eichenbaum, 2005; Erdem and Hasselmo, 2012 and Fenton and Kubie, 2012 that showed how forward replay or planning of possible trajectories could guide performance of behavioral tasks

      We thank the reviewer for the valuable feedback. We have adapted the manuscript throughout to discuss the important point that the SR is not the ground truth (e.g. the final paragraphs in the sections “Bias-variance trade-off” and “Leveraging replays to learn novel trajectories”). We also discussed more extensively the model-based literature and the suggested citations in the manuscript.

      The title and text repeatedly refers to a "spiking" model. They show spikes in Figure 2 and extensively discuss the influence of spiking on STDP, but they ought to more explicitly discuss the interaction of their spike generation mechanisms (using a Poisson process) and the authors should compare their model to the model of George, DeCothi, Stachenfeld and Barry which addresses many of the same questions but using theta phase precession to obtain the correct spike timing in STDP.

      Yes, that's a great suggestion. We have extended our discussion section. In particular, we added:

      In our work, we did not include theta modulation, but phase precession and theta sequences could be yet another type of activity within the TD lambda framework. Interestingly, more groups have recently investigated related ideas. A recent work \citep{George2022} incorporated the theta sweeps into behavioural activity, showing it approximately learns the SR. Moreover, theta sequences allow for fast learning, playing a similar role as replays (or any other fast temporalcode sequences) in our work. By simulating the temporally compressed and precise theta sequences, their model also reconciles the learning over behavioral timescales with STDP. In contrast, our framework reconciles both timescales relying purely on rate-coding during behaviour. Finally, their method allows to learn the SR within continuous space. It would be interesting to investigate whether these methods co-exist in the hippocampus and other brain areas. Furthermore, \citep{Fang2022} et al. recently showed how the SR can be learned using recurrent neural networks with biologically plausible plasticity.

      The introduction and start of the Results section are should have more citations to neuroscience data. The introduction currently cites only three experimental citations (O'Keefe and Dostrovsky, 1971; O'Keefe and Nadel, 1978 and Mehta et al. 2000) and then gives repeated citations of previous theory papers as if those papers define the experimental data that is relevant to this study. The article should review actual neuroscience literature, instead of acting as if a few theory papers in the last five years are more important sources of data than decades worth of experimental work. The start of the results section makes a statement about the role of hippocampus and only cites Stachenfeld et al. 2017 as if it were an experimental paper. The introduction, start of results and discussion need to be modified to address actual experimental data instead of just prior modeling papers. They need to add at least a paragraph to the introduction discussing real experimental data. There are numerous original research papers that should be cited for the role of hippocampus in behavior so that the reader doesn't get the impression all of this work started with the paper by Stachenfeld et al. 2017. For example, the introduction should supplement the citations to O'Keefe and Mehta with other experimental papers including those that they cite later in the paper. They should also cite other seminal work of Morris et al. 1982 in Morris water maze and Olton, 1979 in 8-arm radial maze and work by Wood, Dudchenko, Robitsek and Eichenbaum on neural activity during spatial alternation. At the start of the Results, instead of only citing Stachenfeld (which should have reduced emphasis when speaking about experiments), they should again cite O'Keefe and Nadel, 1978 for the very comprehensive review of the literature up to that time, plus the work of Morris and Eichenbaum and Aggleton and other experimental work.

      We thank the reviewer for the suggested citations. We have added many citations in order to discuss the experimental literature more thoroughly.

      This article is admirable for addressing how to utilize a continuous representation of space and time, which Kenji Doya also addressed in his NeurIPS article in 1995 and Neural Computation 2000 (which should be cited). To emphasize the significance of this continuous representation, they could note that reinforcement learning (RL) theory models still tend to use a discretized grid-like map of the world and discrete representation of time that does not correspond to the probabilistic nature of place cell response properties (Fenton and Muller) and the continuous nature of the response of time cells (Kraus et al. 2013).

      We thank the reviewer for this important comment and this is indeed one of the main strengths of the proposed framework. We have now emphasised this point, by adding the following paragraph to the Discussion:

      “Importantly, the discount parameter also depends on the time spent in each state. This eliminates the need for time discretization, which does not reflect the continuous nature of the response of time cells (Kraus et al. 2013).”

      I think the authors of this article need to be clear about the shortcomings of RL. They should devote some space in the discussion to noting neuroscience data that has not been addressed yet. They could note that most components of their RL framework are still implemented as algorithms rather than neural models. They could note that most RL models usually don't have neurons of any kind in them and that their own model only uses neurons to represent state and successor representations, without representing actions or action selection processes. They could note that the agents in most RL models commonly learn about barriers by needing to bang into the barrier in every location, rather than learning to look at it from a distance. The ultimate goal of research such as this should to link cellular level neurophysiological data to experimental data on behavior. To the extent possible, they should focus on how they link neurophysiological data at the cellular level to spatial behavior and the unit responses of place cells in behaving animals, rather than basing the validity of their work on the assumption that the successor representation is correct.

      We thank the reviewer for this suggestion, we have now extended the Discussion to include a paragraph on the “Limitations of the Reinforcement Learning framework” which we reproduce here:

      We have already outlined some of the perks of using reinforcement learning for modelling behaviour, including providing clear computational and algorithmic frameworks. However, there are several intrinsic limitations to this framework. For example, it needs to be noted that RL agents that only use spatial data do not provide complete descriptions of behavior, which likely arises from integrating information across multiple sensory inputs. Whereas an animal would be able to smell and see a reward from a certain distance, an agent exploring the environment would only be able to discover it when randomly visiting the exact reward location. Furthermore, the framework rests on fairly strict mathematical assumptions: typically the state space needs to be markovian, time and space need to be discretized (which we manage to evade in this particular framework) and the discounting needs to follow an exponential decay. These assumptions are overly simplistic and it is not clear how often they are actually met. Reinforcement Learning is also a sample-intensive technique, whereas we know that some animals, including humans, are capable of much faster or even one-shot learning. \ Regarding the specific limitations of our model, we can note that even though we have provided a neural implementation of the SR, and of the value function as its read-out (see Figure 5-figure supplement S2, the whole action selection process is still computed only at the algorithmic level. It may be interesting to extend the neural implementation to the policy selection mechanism in the future.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      _Reply to the reviewers _

      Note: the three reviewers who provided comments were identified as Reviewers 2-4

      Reviewer #2

      1) I could not open any of the movies (while those associated with the BioRXiv preprint were fine). Some of the movies could be combined to minimize download/open clicking sequences.

      • The movies were uploaded as .avi files, as per Review Commons instructions, and we tested our ability to view them on several computers at our institution before submission. We are relieved the reviewer was able to access the .mp4 formatted movies via BioRXiv. We will ask the Review Commons Managing Editor to make sure there are no problems with the videos uploaded with the revised manuscript.*

      2) I really dislike reviewing papers without line numbers

      • Line numbers have been added to the revised version.*

      3) The manuscript could be made more relevant to malaria researchers by briefly discussing red cell invasion by merozoites (a single constriction and force against the cell cortex), migration of ookinetes (multiple constrictions during mosquito gut penetration) and sporozoites (long distance migration), but this is not a must.

      • Constrictions during ookinete migration are now mentioned on lines 265-269, and the discussion of the constriction at the moving junction has been broadened to include other apicomplexan parasites lines 270-278.*

      4) I would limit reporting of numbers to two digits, e.g. instead of 46.3% make it 46%; 2.56 +/- 0.38 to 2,6 +/- 0,4 etc

      > We have adjusted all numbers in the text and figures to the appropriate number of significant figures based on measurement precision.

      5) Millions of deaths, please rewrite, more like around 1 million from malaria and cryptosporidium; use citation (WHO)

      > Done (line 40)

      6) Motility: please don't mention flagella, which are used for swimming, in the same sentence / phrase / logic connection as lamellipodia, which are used for substrate based migration

      > The sentence has been rewritten to make clear that cilia and flagella are not organelles involved in the substrate-dependent motility of other eukaryotic cells (lines 47-49).

      7) In Figure 1B, I can see one microsphere and it's not clear if it moves completely back to the original position. In the movie it looks like it goes completely back, maybe exchange the last panel of the figure with a last frame from the movie? Or maybe better: replace with frames from movie 2, which is more striking and shows many beads being displaced?

      > As suggested, Figure 1B now shows frames from the other movie (former Video 2), where bead movement is more obvious.

      8) Please add the entire figure S1 to Figure 1. This is important for readers to understand and 'deserves' full figure status. Same for Figure S2.

      *> We have moved most of former Figure S1 into a new main Figure 2, as suggested. We left the two graphs as Supplemental data (new Figure S1), since these graphs simply show that parasite motility in fibrin is similar to the previously described motility of parasites in Matrigel. *

      *> Figure S2 has been moved to the main text, as suggested (in new Figures 3 and 6). *

      9) I would encourage the authors to elaborate more on the data on Figure S2. It appears that motile parasites did mostly not exert forces above the level for non-motile parasites; for how much motility did they observe forces? The meaning of the x-axis does not become clear. Are those individual parasites per time point or time points of one parasite or of the analyzed matrix volumes over several parasites? How many parasites where observed? This is stated more clearly later but needs to be done already here.

      > We have moved the data in former Suppl. Figure S2 into the main figures, broken it into two parts (Figures 3 and 6B-E) and included a new 3D volume view and additional explanatory detail in the figure legends and text to clarify these points of confusion (lines 100-116, 500-507, 564-570).

      10) Please change 0.042 um into 42 nm etc

      *> Done, lines 113-116. *

      11) Please move some of the data in Figure S8 to the main figures e.g. Figure 4, where it would make a nice contrast / comparison to the mic2 mutant. Please also put a WT for comparison.

      > Done; see revised Figure 6.

      12) I wonder if the defect in directional migration of the mic2 mutant is also partly due to the parasite not being able to squeeze through narrow matrix pores and hence is deflected more often. While I understand (and agree) with the authors observation (interpretation) of the wt parasites not squeezing but pulling, it's hard to think that such squeezing would not still play a part.

      *> The idea that the parasite needs to squeeze its way through pores in the matrix is intuitively appealing (and, in fact, what we had expected to see) but there is currently no data to support it. If squeezing were occurring, we should see an outward deformation of the matrix as the parasite pushes on the matrix fibers, but this is something we have never observed. We therefore think it is unlikely that the loss of directional migration is due to an inability to squeeze through pores in order to “stay on track”. *

      13) Hueschen et al is now on BioRXiv

      > The BioRXiv citation has been added (lines 293, 320).

      14) The shaving off of antibodies could be brought into context to the work on sporozoites by Aliprandini Nat Micro 2018 and on trypanosomes by Enstler Cell 2007 (but not a must)

      *> The two studies mentioned are intriguing and may be related to the well-documented anterior to posterior flux and shedding of GPI-anchored proteins from the surface of gliding Toxoplasma tachyzoites. What we are showing here is slightly different: the fluorescent antibodies on the cell surface seem to be “shaved” backwards at the constriction, much like surface bound antibodies are shaved backwards at the moving junction during invasion (Dubremetz 1985). In other words, there is a discontinuity in the density of surface staining at the constriction/junction. All of these processes may be related, but this is only speculation at this point and since the shaving of antibody at the constriction is a minor point of the paper (meant only to illustrate another similarity between 3D motility and invasion), we would prefer not to try to tie it to these other observations which may or may not be related. *

      15) Anterior-posterior flux: best experimental evidence for this is Quadt et al. ACS Nano 2016 for Plasmodium and Stadler MBoC 2017 for Toxoplasma. The common observations and differences could be discussed as they pertain to the current study

      > These two papers are now cited in our discussion of the linear motor model along with our speculation that the constriction reflects the motility-relevant zone of engagement of this rearward flux with ligands in the matrix (lines 319-322).

      16) The loss of mic2 could lead to the loss of the capability to form discrete adhesion sites that reveal themselves as the observed rings in 3D. I suggest to be careful to hypothesize that the absence of this and MyoA reveals a completely different motility mechanism. To me it seems more likely that the absence of the proteins means that the existing mechanism doesn't work perfectly any more, ie the highly tuned migration machinery misses a key part and malfunctions.

      *> The paragraph in question offered possible explanations for how parasites lacking the constriction could in fact move at normal speeds, not that motility was negatively affected. We have tried to make this more clear in the revision (lines 352-354), before describing the 3 possible explanations. *

      17) Maybe reflect on whether 'search strategy' might be a better word than 'guidance system'

      *> We have replaced the term “guidance system” in the title (lines 1-2), abstract (lines 33-36) and introduction (line 75) with more conservative references to the ability of the parasite to move directionally. The only place the term “guidance system” remains is in the final paragraph of the discussion, which is more speculative in nature, and where we now suggest it to be “part of” a guidance system. *

      Reviewer #3

      1) Extracellular matrix choice. The authors track the parasite movement first on Matrigel and next on fibrin. The authors exemplify the fibrin matrix on an image on Suppl. Fig 1 that shows a relatively quite large pore size, similar or greater than parasite size. Was the analysis done on parasites touching the fibers?

      *> Previous Suppl Figure 1A showed a confocal image at only one z-plane which did indeed give the impression that the pores are relatively large. We have changed this image to a more informative maximum intensity projection (New Figure 2A) and included a video showing the entire imaging volume (new Video 4), which makes clear that the matrix contains many small fibers and that the pores are smaller than the previous single z-plane suggested, so the parasite is likely to be near to or in contact with fibers of the matrix at all times. In Suppl Figure 1D we purposely used a less dense matrix in order to make the matrix deformation more obvious to the eye. The density of the matrix in Fig. 1D has been added to the legend. *

      2) Lack of movement of parasites. In many figures of the articles it is revealed that the majority of parasites in fibrin remain immobile (Suppl Fig 1, Fig 2, Video 5, Suppl Fig 2, Suppl Fig 8). The number of immobile parasites in Matrigel seem to be lower than in fibrin (Suppl Fig 1B) although no quantification is shown. How does the movement in fibrin and Matrigel compare? How does this compares with movement in stiff substrates in 2D? Could the lack of movement be caused by the large pore site in fibrin?.

      > We have added a panel to Suppl. Figure S1 showing that the proportions of parasites moving in fibrin vs Matrigel are not significantly different. In fact, none of our measured motility parameters are different between fibrin and Matrigel. Not all parasites move during the 80s of capture used for these matrix comparisons; some of the parasites are likely dead, but others may have simply not initiated motility during this time window. We typically see between 30-50% movement in 3D motility assays of this duration and similar numbers in 2D trail assays although we have not explored the effect of 2D substrate stiffness.

      3) Considering parasite movement: The authors consider that 3SD is a cutoff for considering parasite displacement. However, several timepoints fall behind this cutoff in the control without parasites and the knockouts with restricted movement.

      > We chose three standard deviations from the mean as our cutoff, in order to eliminate 99.7% of the noise. Since we calculate 16807 vectors per comparison, this leaves us with ~50 vectors above the cutoff even in samples with no moving parasites. Not surprisingly, these vectors are found at random locations in the volume. New Figures 3 and 6B-E and the associated text (lines 100-116, 500-507, 564-570) hopefully clarify this point adequately; it is quite obvious in Figure 3C which vectors correspond to parasite-induced displacements and which correspond to random noise.

      4) Imaging: Although the authors show a very detailed an illustrative table of the imaging acquisition conditions in table 1, it is unclear which microscope the authors used, as two microscopes are described in the methods section, a Nikon Eclipse TE300 widefield microscope and a Nikon AIR-ER confocal microscope. Which images were taken in each system? For the location of Table1 in the manuscript it seems that most images were taken with the Nikon Eclipse. Although this microscope has control over z, the images are quite noisy. How does the lack of confocallity might interfere with the analysis?

      > The high temporal resolution needed for 3D force mapping of cells that move several microns per second meant that all these experiments were done using a widefield microscope equipped with a piezo-driven z-stage. The fastest confocal we tested was not as fast as the widefield. However, spatial resolution suffered as a result of having to use widefield, particularly in z,* and this did indeed make our data more noisy as suggested by the reviewer. This may be why we were unable to detect fibrin deformation in the knockout parasites. The only data collected on the confocal microscope were those shown in new Figure 2A; we have clarified this on lines 421-427. Future studies will explore other imaging modalities such as light sheet microscopy in an attempt to achieve better spatial resolution while maintaining the high frame rates required for force mapping. *

      5) Nuclear constriction. The authors did not show any image or video exemplifying this.

      The images in Suppl. Figure 6 have been replaced with data that show the nuclear shape more clearly.

      6) Knockouts: The authors did not explain how did they generated the knockouts in the methods or did now show the efficacy of the knockout in any figure. If these knockout strains were a gift (I did not find it on the manuscript), the authors should indicate this more explicitly and reference the manuscript where they were described for the first time.

      > Both of the stable knockout lines used were generous gifts from Dr. Markus Meissner. We cited the original papers describing these lines in the text and thanked Dr. Meissner for providing them in the Acknowledgements section. We have now included an additional citation at the first mention of each of the knockouts (lines 174, 188) to make it even clearer where they came from.

      7) Discussion: Although the experimental methodology is sound the authors seem to make many assumptions and speculations on the discussion as how the appearance of this ring/constriction on the parasite translates into the helical movement of the parasite or the coupling of the ring with the cytoskeleton. Live imaging of actin dynamics or mathematical modelling could be used to support their claims.

      > We imaged parasites expressing the actin chromobody but were unable to visualize a ring of actin at the constriction. However, due to the speed of the parasites and the need for a fast frame rate (~15 ms per image) to reconstruct the 3D image volumes, the actin chromobody signal could be under our threshold of detection. We need to develop new, more sensitive ways to visualize proteins at the constriction, and this will be a major focus of our work going forward.

      *> We fully concur that mathematical modeling such as the work recently done by Hueschen et al on actin flow during motility and by Pavlou et al on the role of parasite twist during invasion has much to offer our understanding of these processes. Similar approaches may provide support to the speculations (not claims!) we offer in the discussion and, although beyond the scope of the current study, are a direction we intend to take this work in the future – particularly if we are able to improve the signal-to-noise in our force mapping. *

      8) Quantification of experiments missing: Overall, the main figures lack quantification that sometimes can be found in the supplemental information and sometimes is missing. I would suggest including quantifications next to the events described in the main figures). Likewise, some of the supplemental figures lack quantification (Suppl Fig 7, how many parasites showed this protein trail?)… Overall, the authors should indicate how many parasites were quantified in each figure. As they usually refer to number of constrictions. This is overall a problem in main figures 3 and 5. Or for example in Suppl Fig 5: How many parasites were quantified in this figure? The authors only show number of constrictions, and as the authors described, a parasite might have more than one constriction.

      > We have added further detail on the number of events/parasites quantified to both the figure legends and text throughout the manuscript, including the specific examples noted by the reviewer.

      9) Videos: The videos lack scale of time. Although this that can be found in main figures, it would be helpful to have the annotation in the videos. Likewise, some references for positions in videos, such as the cross found on Fig1 would be helpful for parasites that present little movement.

      > Time stamps have been added to all videos as suggested, and crosshairs have been applied to new Figure 1B and Suppl. Figures 7 and 8 to make the movement of the parasites more obvious. *

      *

      Reviewer #4

      1) I am not sure about the premise that the "linear model" of gliding motility predicts uniformly forward direction. Previous videos of 2D gliding show sporadic motility, changes in direction, or even reversal of direction are not infrequent. However, the current model could explain these behaviors if one or more of the following conditions occur: 1) myosin motors might be coordinating activated to initiate motility, followed by relaxation, 2) actin fibers might be transiently arrayed in clusters that change density and polarity over time, or 3) adhesins, necessary to generate traction, might vary in density and spatial orientation across the surface of the parasite. Changes in these properties would be expected result in zones that promote or disfavor local forces needed for motility - and reversal of direction could occur when forward forces relax and external elastic forces predominate.

      > The potential explanations offered by the reviewer for the frequent changes in direction of zoite motility are intriguing and worth exploring experimentally. The ability of actin fibers to periodically reverse polarity, or the presence of counteracting elastic forces are not components of the “standard” linear motor model of motility but, if they occur, could explain the patch gliding phenomenon and help refine our understanding of motility. Since the data in this manuscript do not in the end either strongly support or disprove the linear motor model – this may ultimately require higher resolution force mapping methods that can detect the forces responsible for forward motion – we have de-emphasized potential problems with the model in the introduction and deleted specific discussion of patch gliding as one of these problems (lines 61-64).

      2) The model favored here: "we propose that force is generated, at least in part, by the rearward translocation of the subset of actin filaments that are coupled to adhesins at the circular ring of attachment" does not seem fundamentally different from the current model - other than it focuses the forces at a critical junction that the parasite migrates through. It seems to me that this is a refinement of the current model and not a replacement. As such, the authors might focus on how their data improve the model rather than pointing out prior deficiencies (although I get that editors like this style).

      > We agree with the reviewer and have modified the text to be more circumspect on this issue* (lines 319-331). *

      3) The finding that the absence of MIC2 affects the constriction formed by inward pull on the matrix is quite convincing and interesting. However, mutants that cannot form the constriction, still move at similar speeds. This suggest that the inward force is different from the motor itself and affects its ability to impart direction, rather than the ability to move per see. The interpretation of the MyoA defect is complicated since motility is certain to be disrupted, the potential role of an independent inward force may no longer be detectable.

      > We agree with the reviewer on this point as well: the forces we have observed to date cannot explain forward motion. We stated this previously and have now emphasized the point further *(lines 322-324, 352-357). Because the parasite is moving forward, the forces responsible must be there but are likely below our threshold of detection. In order to visualize these forces, we are going to need new imaging modalities that can achieve better signal-to-noise than our current setup at the high frame rates required for force mapping. That said, we new data we have added to the manuscript are at least consistent with the narrow diameter ring of the constriction making a contribution to the parasite’s forward motion (new Suppl. Figure 10 and lines 347-351) *

      4) Although I agree with the authors that there are striking parallels between motility in 3D and cell invasion, I am not certain about their conclusion that the construction seen during cell entry is due to the parasite pulling inwardly. When entering the host cell, the parasite must also navigate the dense subcortical actin network, which likely also aids in forming the constriction that is observed. It would be interesting to record this pattern under conditions where host cell actin is destabilized while parasite motility is intact- for example using cytochalasin D to treat wild type host cells during invasion by resistant parasites.

      *> We do not conclude that the constriction during invasion is due to the parasites pulling inwardly, but we do propose that this possibility needs to be considered based on the noted similarities between invasion and motility and our clear (and somewhat surprising) demonstration that the moving parasite pulls on the matrix at the constriction during motility. During invasion, the parasite may indeed have to squeeze through the dense subcortical network – or it may use secreted proteins to loosen up the network so that no squeezing is required. We just don’t know, and our purpose here was simply to put this alternative possibility on the table because we believe it is a viable possibility that follows from the data presented. *

      > We thank the reviewer for the suggestion of testing what happens when cytoD resistant parasites invade in the presence of cytoD; this is a clever idea that we will likely pursue in future work.

      5) Not all of the color patterns shown in Figure 1A are consistent with the model. For example, GAP40 (yellow) does not appear in the model, there are two MLC boxes, but they are different shades, and ELC1/2 does not appear in the model.

      > We thank the reviewer for catching this error; it has now been fixed.

    1. Discussion, revision and decision


      Decision

      Verified with reservations: The content is scientifically sound, but has shortcomings that could be improved by further studies and/or minor revisions.

      Dr. Bañuelos: Verified manuscript

      Dr. Morris: Verified with reservations


      Revision

      Response to Reviewer 1 (Dr. Bañuelos)

      1. Most importantly, I would like to see an introduction that explains the authors’ general arguments about grading changes – including the trajectory of these changes at Dalhousie and why this arc contributes to our knowledge of the history of higher education more broadly. Then, the authors might continually remind us of the arc they present at the outset of their paper – especially when they are highlighting a piece of evidence that illustrates their central argument. To me, the quotes from students and faculty responding to grading changes are among the most interesting parts of the paper and placing these in additional context should make them shine even more brightly!

      Our Response: Thank you so much for your thoughtful review. We have added a larger new introduction section of the paper (paragraphs 1-5 in the latest draft are new) that outlines the general importance of the topic, the Canadian context, details on Dalhousie University, and our overall thesis statement (i.e., most decisions were to improve the external communication value of grades). Moreover, we have added three new student quotes form the Dalhousie Gazette to build a stronger picture for student reactions, and to build a better case for our overall thesis statement (i.e., that changes in grading were often to increase the external communication value of grades). Moreover, throughout we have added some details on the overall funding trajectory for institutions in Canada that created some pressure to standardize grading. We think that these changes have improved the manuscript.

      1. I’d like to read a little more about Dalhousie itself – why it is either a remarkable or unremarkable place to study changes in grading policies. Is it representative of most Canadian universities and thus, a good example of how grading changes work in this national context? Is it unlike any other institution of higher education and thus, tells us something important about grades that we could not learn from other case studies? I don’t think this kind of description needs to be particularly long, but it should be a little more involved than the brief sentences the authors currently include (p.3, paragraph 1) and should explain the choice of this case.

      Our Response: This comment revealed that two additional pieces of context were needed for the introduction: (a) some national context for higher education policy in Canada and (b) some extended description of Dalhousie University when compared to other universities in Canada. To this end, two new paragraphs have been added to the paper (paragraphs 2 & 3 in the current draft).

      Notably, Jones (2014) notes that “Canada may have the most decentralized approach to higher education than any other developed country on the planet” (pg 20). With this in mind, any historical review of education policy is by necessity specific to province and institution – that is, the information can be placed in its context, but resists wide generalization to the country as a whole. In the newest draft, we tried to describe the national, provincial, and institutional context in some more detail in paragraphs 2 & 3.

      1. I’d also like to know more about the archival materials the authors used. The authors mention that they drew from “Senate minutes, university calendars, and student newspapers” (p. 3), but what kinds of conversations about grades did these materials include? At various points, the authors engage in “speculation” (e.g. p.4) about why a particular change occurred. This is just fine and, in fact, it’s good of the authors to remind us that they are not really sure why some of these shifts happened. But, they might go one step further and tell us why they have to speculate. Were explicit discussions of grading changes – including in inter- and intradepartmental letters and memo, reports, and other documents – not available in these archives? Why are these important discussions absent from the historical record?

      Our Response: We have added a new paragraph (paragraph 4) to the paper discussing the sources in some more detail. It is true that the verbatim discussions are frequently absent from the record, especially earlier in history – or if they exist, we have not found them! Instead, we frequently are reviewing meeting minutes or committee reports, which are summaries of discussions. As we now note in the paper, “Thus, the sources used showed what policy changes were implemented, when they were implemented, and a general sense of whether there was opposition to changes; however, there were notable gaps in faculty and student reactions to grade policy changes, as these reactions were frequently not written down and archived.”

      This gap was most apparent in the Senate minutes around the 1940s, where I (the first author) could not find any direct discussions of why changes were implemented. Under the 1937-1947 heading, we more clearly indicate that the rationale for the changes was absent from the Senate minutes during this period. I add some further speculation on why these records might be absent, based on summaries from Waite (1998b); specifically, the university president of the time often made unilateral decisions, circumventing Senate, which might account for why the changes are absent from the records.

      This will hopefully make the limitations of what can be learned from this approach more apparent.

      1. At various points, the authors make references to the outside world – for example, WWII (p. 5), the Veteran’s Rehabilitation Act (pp. 6-7), and British versus American grading schemas (p. 6). But, these references are brief and seem almost off-handed. I know space is limited, but putting these grading changes in their broader context might help make the case for why this study is interesting and important. Are the changes in the 1940s, for example, related to the ascendance of one national graduate education model over another (e.g. American versus British)? Are there any data on how many Canadian undergraduates enrolled in British versus American graduate programs over time? If so, I would share any information you might have on these broader trends.

      Our Response: To our knowledge, there isn’t any comparable report to what we’ve written here documenting the transition from British “divisions” to American “letter grades” in Canadian Universities, making our report novel in this regard. It might well be that a similar historical arc exists in many of the 223 public and private universities in Canada, but we don’t believe such data exists in any readily accessible way – excepting perhaps undergoing a similar deep dive into historical documents at each respective institution! So, we do not have the answer to your question: “Are there any data on how many Canadian undergraduates enrolled in British versus American graduate programs over time?” However, we did add one reference which provided a snapshot point of comparison in 1960, noting in the paper “Baldwin (1960) notes that the criteria for “High First Class” grades in the humanities was around 75-80% at Universities of Toronto, Alberta, and British Columbia in 1960, suggesting that Dalhousie’s system was similar to other research-intensive universities around this time.” That said, there are a few major national events related to the funding of universities in Canada that we have elaborated on in the text to address the spirit of your recommendation for describing the national context:

      a) In the “Late 1940s” section of the paper, we added: “Though Dalhousie had an unusually high proportion of veterans enrolled relative to other maritime universities during this period (Turner, 2011), the Veteran’s Rehabilitation Act was a turning point for large increases in enrollment and government funding Canada-wide, at least until the economic recession of the 1970s (Jones, 2014).”

      b) In the 1990s, there were major government cuts to funding, creating challenging financial times for the university. We discuss the funding pressures that likely contributed to standardization of grading during this time by saying the following in the 1980s-2000s section: “Starting in in the 1980s-1990s there were major government cuts to university funding nation-wide, with the cuts becoming more severe in the 1990s (Jones, 2014; Higher Education Strategy Associates, 2021). Because of the nature of the funding formulas, cuts in Nova Scotia were especially deep. Beyond tuition increases, university administrators knew that obtaining external research grants, Canada Research Chairs, and scholarship funding was one of the few other ways for a university to balance budgets, so there was extra pressure to be competitive in these pools. […] The increased standardization was likely related to increased financial pressures at this time – standardization is an oft-employed tool to deal with ever-increasing class sizes with no additional resources.”

      c) In the 2010s section of the paper, we added context to how universities in country-wide have become increasingly dependent on tuition fees for funding: “Following the 2008 recession, federal funding decreased again (Jones, 2014; Higher Education Strategy Associates, 2021); however, this time universities tended to balance budgets by increasing tuition and international student fees. This trend towards increased reliance on tuition for income is especially pronounced in Nova Scotia, which has the highest tuition rates in the country (Higher Education Strategy Associates, 2021). Thus, the university moved closer to a “consumer” model of education, so it makes sense that a driving force for standardization was student complaints.”

      1. This is a very nitpicky concern that doesn’t fit well elsewhere, so please take it with a grain of salt. I was surprised at the length of the reference list – it seemed quite short for a historical piece! I wonder, again, if more description of the archival material - including why you looked at these sources, in particular, and what was missing from the record – would help explain this and further convince the reader that you have all your bases covered.

      Our Response: In the introduction section, paragraph 4, we describe our sources in more detail including what is likely missing from the record and why we used them. Regarding the length of the reference list, we did add ~12 new references to the list in the course of making various revisions, which partially addresses your concern. Beyond this though, it’s worth noting that some of the sources more extensive than they seem, even though they don’t take up much space in the reference list (e.g., there is one entry for course calendars, but this covers ~100 documents reviewed!). Moreover, there were many dead-ends in the archives that are not cited (e.g., reviewing 10 years of Senate minutes in the 1940s produced little of relevance), so the reference list is curated to only those sources where relevant materials were found.

      Reviewer response to revisions

      The new introduction to the piece addresses many of my previous questions about the authors’ general arguments, the Dalhousie context, and the source material. Thank you for addressing these! Reading this version, it is much clearer that the key argument is that standardized, centralized grading practices were “to improve the external communication value of the grades, rather than for pedagogical reasons” (p. 6). I also really enjoyed the added quotes from students in the Dalhousie Gazette.

      The authors’ response to Reviewer 2 really gave me a better sense of why they wrote this piece and also helped me to more clearly put my finger on what was troubling me in the first round. It still reads a little like a report for an internal audience – which is just fine and, in fact, can be extremely useful for historians of the future. But, as Reviewer 2 notes, this means it does not really seem like a piece of historical scholarship. I do worry that shaping it into this form would take an extensive revision and might not be in the spirit of what the authors intended to do.

      A different version of this article might start with this idea that grades were standardized for external audiences and in response to financial pressures. It would then develop a richer story behind the sudden importance of these external audiences and the nature (i.e. source, type) of financial pressures Dalhousie was facing. It would highlight the impact such changes had on students and their future careers/graduate experiences. It could then connect these trends to other similar changes for external audiences and the increasing interconnectedness of American, Canadian, and British systems through graduate education. It might even turn to sociological theories of organizational change and adaptation and make an argument for when (historically) similar forms of decoupling were likely to occur in the Canadian higher education system. Finally, it might connect these grading changes to current trends – including accusations of grade inflation and accepted best practices for measuring learning outcomes.

      But, it doesn’t seem that the authors necessarily want to do this, which I can understand and respect. I think there is enormous value in a piece of scholarship like this existing – both for internal audiences and for future historians. Indeed, imagine if every university had a detailed history of its grading policies like this available somewhere online! Comparing such practices across institutions would certainly tell us a lot about why grading currently looks the way it does.

      Decision changed

      Verified manuscript: The content is scientifically sound, only minor amendments (if any) are suggested.


      Response to Reviewer 2 (Dr. Morris)

      The authors dove headfirst into Dalhousie’s archives, unpacking the subtle shifts in grading policy. Their work seems to be comparable to archaeologists, digging deep beneath mountains of primary sources to find nuggets of clues into Dalhousie’s grading evolution. I particularly liked when the authors were able to link these changes to student voices, as seen in moments when they referenced student publications.

      Ultimately, I kept coming back to one main comment that I wrote in the margins: “So what?” I would humbly suggest that the authors reflect on why this history matters to them. Granted, they do this in the conclusion, where they touch on Schneider & Hutt’s argument that grades evolved to increasingly be a form of external communication with audiences beyond school communities. Sure. But I want more. I wanted to see a new insight that this microhistory of Dalhousie significant to the history of Canada or the history of education more generally.

      If the authors are so inclined, there might be several approaches to transform this manuscript. I would suggest the following. First, instead of tracing the entire history of grading at the institution, choose one moment of change that you think is the most important. Perhaps in the 1920s and the lack of transparency in grading, or the post-war shift toward American grading. Second, show me – don’t tell me – what Dalhousie was like at this moment. Paint a picture of the institution with details about student demographics, curriculum, educational goals, the broader town, etc. Make the community come alive. Show me what makes Dalhousie unique from other institutions of higher ed. Once you establish that picture, perhaps you could link the change in grading practices to subtle changes at the university community, thereby establishing a before and after snapshot. This will require considerable amounts of work, and the skills of a historian. You will have to find primary and secondary sources that go far beyond what you’ve relied on thus far.

      In the end, I found myself wanting the authors to humanize this manuscript, meaning I wanted them to show me that changes in grading practices have tangible effects on real-life human beings. A humanization of their research would mean going narrower and deeper; or, in other words, eliminating much of what they have documented.

      However, if that is too tall of an order, I would ask that the authors clarify for themselves who this manuscript is for. Is this a chronicling of facts for an internal audience at Dalhousie’s faculty, alumni, and students? Fine. But my guess is that even members of the Dalhousie community want to read something relatable.

      I am suggesting revisions, although not because of objective errors. History is more of an art, in my opinion. With that in mind, I would suggest that the authors paint a more vivid picture (metaphorically) of Dalhousie, showing me how changes one moment of change in grading practices impacted the lives of human beings.

      Our Response: Thank you very much for taking the time to read our paper and provide your thoughts and recommendations. It may be helpful to begin by describing why I (the first author) decided to write this paper. Ultimately, I wrote this paper to satisfy my own personal curiosity and to connect with other people at my own place of employment by exploring our shared history. At present day, Dalhousie has a letter grading scheme with a standardized percentage conversion scheme that all instructors used. I wanted to know why this particular scheme was used, but I quickly realized that nobody at Dalhousie really knew how we ended up grading this way! There was an institutional memory gap, and a puzzle that was irresistible to me. So, I wrote this paper for the most basic of all academic reasons: Pure curiosity. I do very much recognize that the subject matter is very niche, perhaps too niche for a traditional journal outlet. Thus, my publishing plan is to self-publish a manuscript to the Education Resources Information Center (ERIC) database and a preprint server as a way of sharing my work with others who might be interested in what I found. Nonetheless, I believe in the importance and value of peer review, especially since I am writing in a field different than most of my scholarly work. That is why I chose PeerRef as a place to submit, so that I could undergo rigorous peer review to improve the work while still maintaining the niche subject matter and focus that drives my passion and curiosity for the project. Of course, if you feel the whole endeavor is so flawed that it precludes publication anywhere, then we can consider this a “rejection” and I will not make any further edits through PeerRef.<br /> The core of your critique suggested that I should write a fundamentally different paper on different subject matter. While I don’t necessarily disagree that the kind of paper you describe might have broader appeal, it would no longer answer the core research question I wanted an answer to: How has Dalhousie’s grading changed over time? So, I must decline to rewrite the paper to focus on a single timeframe as recommended. All this said, I did try my best to address the spirit of your various concerns to improve the quality of the manuscript. Below, I will outline the various major changes to the manuscript that we made to improve the manuscript along the lines you described, while maintaining our original vision for the structure and focus of the paper. The specific changes are outline below:

      a) Two new paragraphs (now paragraphs 1-2 of the revised manuscript) were added to explain the “so what” part of the question. Specifically, we describe why we think the subject matter might be of interest to others and summarize the general dearth of historical information on grading practices in Canada as a whole.

      b) Consistent with recommendations from the other reviewer, we now state a core argument (i.e., that most major grading changes were implemented to improve the external communication value of the grades) earlier in the introduction in paragraph 5 and describe how various pieces of evidence throughout the manuscript tie back to that core theme.

      c) In an attempt to “humanize” the manuscript more, we added more student quotes from the Dalhousie Gazette throughout the paper so that readers can get a better sense of how students thought about grading practices at various times throughout history. Specifically, three new quotes were added in the following sections: 1901-1936, late 1940s, 1950s-1970s. We also added this short note about the physical location where grades used to be posted: “Naturally, this physical location was dreaded by students, and was colloquially referred to as “The Morgue” (Anonymous Dalhousie Gazette Author, 1937).”

      d) Early in the paper, we describe why we chose Dalhousie and the potential audience of interest: “As employees of Dalhousie, we naturally chose this institution as a case study due to accessibility of records and because it has local, community-level interest. The audience was intended to be members of the Dalhousie community; however, it may also be a useful point of comparison for other institutions, should similar histories be written.”

      e) We have described some of the limitations of our sources in paragraph 4, which may explain why the manuscript takes the form it does – it has conformed to the information that is available!

      f) We have linked events at Dalhousie to the national context in some more detail, by detailing some national events related to the funding of universities in Canada. See our response to Reviewer 1, #4 above for more details on the specific changes.

      g) Consistent with your stylistic recommendations, we have changed various spots throughout the paper from the present tense (e.g., “is”) to the past tense (e.g., “was”), and were careful in our new additions to maintain the past tense, when appropriate. If there are any spots that we missed, let us know the page number / section, and we will make further changes, as necessary.

      h) We retained the first person in our writing – this may be discipline-specific, but in Psychology (the first author’s home discipline), first person is acceptable in academic writing. If you feel strongly about this, we can go through the manuscript and remove all instances of the first person, but we would prefer to keep it, if at all possible.

      Hopefully this helps address the spirit of your concerns, and I look forward to hearing your thoughts in the second round of reviews.

      Decision changed

      Verified with reservations: The content is scientifically sound, but has shortcomings that could be improved by further studies and/or minor revisions.

    1. Joint Public Review:

      In this work Malis et al introduce a novel spin-labeling MRI sequence to measure cerebrospinal fluid (CSF) outflow. The glymphatic system is of growing interest in a range of diseases, but few studies have been conducted in humans due to the requirement for and invasiveness of contrast injections. By labeling one hemisphere of the brain the authors attempt to assess outflow through the superior sagittal sinus (SSS), one of the major drainage pathways for CSF, signal changes across time were assessed to extract commonly used metrics. Additionally, correlations with age are explored in their cohort of healthy volunteers. The authors report the movement of labeled CSF from the subarachnoid space to the dura mater, parasagittal dura, and ultimately SSS, evidence of leakage from the subarachnoid space to the SSS, and decreases in CSF outflow metrics with older age.

      1. I don't think that the description of Parasagittal dura in figure 1 is correct. There is no anatomical structure at the top of SSS that is known as PSD. The location of the lymphatic structures is also incorrect. Please review "Anatomic details of intradural channels in the parasagittal dura: a possible pathway for flow of cerebrospinal fluid" Neurosurgery 1996 Fox at al. There is usually no obvious tissue between the upper wall of the SSS and the calvarium, which can also be seen in the authors' fig 2A and 2B. All of the tissues located lateral to the SSS are known as PSD. Also, the SSS wall is not as thick as the authors stated and is known as PSD in this region. For this reason, the authors need to revise Fig 1 and it should be changed to PSD in the areas referred to as the SSS wall in the article.

      2. The authors described tagged CSF in two pathways: from dura mater to PSD and SAS into the SSS and directly from SAS to SSS. Flow from dura mater to PSD and SAS in the main and supplement cannot be seen. Only a flow from PSD to SSS can be seen. Also, regular dura cannot carry flow-collagen-rich fibrous tissue, except parasagittal dura. There is no flow from dura to the CSF in the figures.

      3. The authors have conducted many tests to prevent venous contamination. However, measurements were made based on SSS flow rates in all tests. Small parenchymal venous structures, and small cortical-SAS veins might be tagged due to different flow patterns and T2- Relaxation times.

      4. The rate of CSF formation in humans is 0.3 - 0.4 ml min-1. ( Brinker et al 2014. Fluids Barriers CNS). We can assume that the absorption rate is also similar to the CSF formation for the entire system brain and Spine. Therefore, the absorption rate of this very small amount of CSF by SSS is very low in seconds. It is hard to detect by MR and especially CSF flow from the PSD to SSS. The authors concluded that using this technique the rate averaged less than a couple of seconds, rather than on the order of hours or days as previously reported with the use of intrathecal administration of GBCA (Ringstad et al., 2020).

      5. Overall, I think that the CSF flow from the PSD to the CNS described by the authors - the CSF flow, might be the venous flow that drains into the SSS slowly, predominantly in the rich venous channels, venous lacunae, and previously described channels in the PSD. Additional explanations are needed.

      6. The study is generally well described and to the best of my knowledge an innovative approach. The findings are broadly consistent with what might be expected from the literature and the authors make a good argument in support of their findings. However, the lack of validation is a major limitation of the presented work. In introducing a novel technique a comparison with an existing approach, such as Gd enhanced contrast techniques, or phase contrast would have been expected. Several considerations could have been mentioned/addressed in more detail e.g. what effect labeling efficiency, tortuosity of vessels, lack of gating, the effectiveness of the intensity thresholding to remove the signal from blood, etc may have on the quantification, etc. Without a more thorough validation, it is difficult to evaluate the findings. While scans were conducted on two volunteers to assess reproducibility this is a very small sample and it is notable that scans were conducted consecutively, which might be expected to reduce variance relative to scans further apart e.g. on different dates, scanned by a different operator and no information is provided on how the two scans were positioned (i.e. separately vs copied from the first to the second scan), some metrics showed large percentage differences, which were more pronounced in one subject than the other. Without further data, it is difficult to interpret the reproducibility results. No assessment of the effect of physiological parameters e.g. breathing, cardiac pulsations, or factors affecting glymphatic clearance e.g. amount of sleep the evening before was given.

      7. Given these limitations it is hard to adequately assess the likely impact or utility. In recent years several groups have published work e.g. doi.org/10.1038/s41467-020-16002-4 , doi.org/10.1016/j.neuroimage.2021.118755 assessing the blood-CSF barrier. However, previous work has generally focused on larger structures, and by labeling in the oblique-sagittal plane it is unclear how drainage and blood flow rates may affect the presented values here.

      8. Some validation data would greatly increase the value of the reported work. I would therefore encourage the authors to consider acquiring some additional datasets to compare measures of CSF draining against another method e.g. 2-D or 4-D phase contrast, or Gd-based contrast-enhanced techniques. Some additional points to consider are noted below.

      8. Abstract

      CSF outflow may also be imaged with phase contrast MRI (albeit in a limited way).<br /> Demographics would fit better in Results, breakdown could be given for the young and old groups i.e. n, ages, sex.<br /> Conclusion - unless further validation can be provided I think some of the claims should be toned down.

      9. Introduction

      The authors emphasise the role of Nedergaard, however, there was some relevant earlier work (e.g. Rennels et al, PMID: 2396537).

      10. Methods

      It would be more conventional to summarise the volunteer characteristics in the Results.<br /> Given the age difference between the two groups, and the fact that for conventional ASL we know of differences in labelling efficiency and the need for a different post-labelling duration in more elderly patients how did the authors account for this?<br /> More broadly what would the effect of differences in labeling efficiency be, given the labeling plane is unlikely to be perpendicular to the draining vessels?<br /> While the authors mention circadian effects there is no mention of controlling for other factors before the scan e.g. caffeine consumption, smoking, etc.<br /> Various mechanisms have been hypothesised to drive glymphatic pulsations. Assessing how physiological signals correlated with the flow may have been a useful proof of concept. Why was it not considered necessary to use a gated acquisition? Did the authors consider the potential impact of respiratory and cardiac pulsations on their measurements?<br /> ROI segmentation - manually selected by two raters, was this done independently and blinded? How were consensus ROIs agreed?<br /> Intensity values outwith MEAN +/- 2 SD were excluded from further analyses. This is justified as removing pulsatile blood. However, was this done independently for tag-on and tag-off? Does this mean slight differences were present in the number of voxels between the two?<br /> The starting points and parameter ranges are given in Eq'n 3, how were the ranges defined? Was there a reason for constraining the fit to positive values only, is there a risk of bias from this?<br /> While the main results appear to have a reasonable sample size n=2 for the reproducibility analysis is very limited. Additional datasets would be useful in properly interpreting the results.

      11. Results<br /> While the authors have taken some measures to reduce potential contamination from blood I would be concerned about the risk of surface vessels affecting the signal, and there does not seem to be an evaluation of how effective their measures are.<br /> The labeling pulse is applied in the oblique sagittal orientation, but in tandem with differing rates of blood flow and CSF drainage from the labeling plane does that not risk circulating flow from other slices potentially affecting the values?<br /> Figure 4. The authors focus on the parasagittal dura, but in both the subtraction image and panel C showing different slices at TI=1250 ms some movement appears visible in the opposing hemisphere. Similarly in S2 If the signal does represent CSF movement then this seems counterintuitive and should be explained.<br /> In Figures 4 and 5 the angulation of the TIME-SLIP tag pulse seems quite different. What procedure was used to standardise this, and what effect may this have on the results?

      12. Discussion<br /> Phrasing error 'which will be assessed in future studies'.<br /> I would suggest that some of the claims of novelty be moderated e.g. 'may facilitate establishment of normative values for CSF outflow' seems a stretch given multiple pathways exist and this is only considered one.<br /> More consideration should be given to some of the points mentioned in the results. The lack of validation should be properly discussed.

    1. Author Response

      Reviewer #1 (Public Review):

      This manuscript describes the generation and characterization of a mouse knockout model of Cep78, which codes for a centrosomal protein previously implicated in cone-rod dystrophy (CRD) and hearing loss in humans. Previous work in cultured mammalian cells (including patient fibroblasts) also indicated roles for CEP78 in primary cilium assembly and length control, but so far no animal models for CEP78 were described. Here, the authors first use CRISPR/Cas9 to knock out Cep78 in the mouse and convincingly demonstrate loss of CEP78 protein in lysates of retina and testis of Cep78-/- animals. Next, by careful phenotypic analysis, the authors demonstrate significant defects in photoreceptor structure and function in these mutant animals, which become more severe over a 9 (or 18) month period. Specifically, TEM analysis demonstrates ultrastructural defects of the connecting cilium and photoreceptor outer segments in the Cep78 mutants, which is in line with previously reported roles for CEP78 in CRD and in regulating primary cilia assembly in humans. In addition to a CRD-like phenotype, the authors also convincingly show that male Cep78-/- animals are infertile and exhibit severe defects in spermatogenesis, sperm flagella structure and manchette formation (MMAF phenotype). Furthermore, the authors provide evidence for an MMAF phenotype from a male individual carrying a previously reported CEP78 c.1629-2A>G mutation, substantiating that CEP78 is required for sperm development and function in mammals and supporting previously published work (Ascari et al. 2020).

      Finally, to identify the underlying molecular mechanism by which CEP78 loss causes MMAF, the authors perform some biochemical analyses, which suggest that CEP78 physically interacts with IFT20 and TTC21A (an ortholog of Chlamydomonas IFT139) and might regulate their stability. The authors conclude that CEP78 directly binds IFT20 and TTC21A in a trimeric complex and that disruption of this complex underlies the MMAF phenotype observed in Cep78-/- male mice. However, this conclusion is not fully justified by the data provided, and the mechanism by which CEP78 affects spermatogenesis therefore remains to be clarified.

      Specific strengths are weaknesses of the manuscript are listed below.

      Strengths:

      Overall, the phenotypic characterisation of the Cep78-/- animals appears convincing and provides new evidence supporting that CEP78 plays an important role in the development and function of photoreceptors and sperm cells in vertebrates.

      Weaknesses:

      1) The immunoprecipitation experiments of mouse testis extracts that were used for the mass spectrometry analysis in Table S4 were performed with an antibody against endogenous CEP78 (although antibody details are missing). One caveat with this approach is that the antibody might block binding of CEP78 to some of its interactors, e.g. if the epitope recognized by the antibody is located within one or more interactor binding sites in CEP78. This could explain why the authors did not identify some of the previously identified CEP78 interactors in their IP analysis, such as CEP76 and the EDD-DYRK2-DDB1-VprBP complex (Hossain et al. 2017) as well as CEP350 (Goncalves et al. 2021).

      We thank Reviewer #1 (Public Review) for agreeing with us on Cep78 plays an important role in photoreceptors and sperm cells development. We also appreciate Reviewer #1 (Public Review) for pointing out the weaknesses which helped us improve our study.

      For the immunoprecipitation experiments of mouse testis extracts, the antigenic sequence of the Cep78 antibody used is p457-741 (NP_932136.2). Cep78 was reported to bind DD-DYRK2-DDB1-VprBP complex, the 1-520aa is responsible for Cep78’s interaction with VprBP, and deletion of p450-497 didn’t affect Cep78’s interaction with VprBP, indicating importance of Cep78 (1-450aa) in interaction with VprBp (Hossain et al. 2017). Our anti-Cep78 antibody is generated using antigen sequence p457-741, the binding of p1-450aa to VprBP is not expected to be blocked by our anti-Cep78 antibody. However, VprBp was not detected by our IP-MS experiment. C-terminal region (395-722aa) of Cep78 overlaps with our Cep78 antibody’s antigenic region (p457-741), and was reported to interact with Cep350 (Goncalves et al. 2021). As a polyclonal antibody, our anti-Cep78 antibody didn’t block the interaction with p457-741, because we still identified Cep350 in our IP-MS. Thus, immunoprecipitation experiments using our Cep78 antibody identified some of the previously known interactors, and the interaction with VprBP may not be blocked by our Cep78 antibody.

      The detailed antibody information has now been added to Supplementary Table S7 in our revised supplementary materials.

      2) Figure 7A-D and page 18-25: based on IPs performed on cell or tissue lysates the authors conclude that CEP78 directly binds IFT20 and TTC21A in a "trimeric complex". However, this conclusion is not justified by the data provided, nor by the previous studies that the authors are referring to (Liu et al. 2019 and Zhang et al. 2016). The reported interactions might just as well be indirect. Indeed, IFT20 is a known component of the IFT-B2 complex (Taschner et al., 2016) whereas TTC21A (IFT139) is part of the IFT-A complex, which suggests that they may interact indirectly. In addition, the IPs shown in Figure 7A-D are lacking negative controls that do not coIP with CEP78/IFT20/TTC21A. It is important to include such controls, especially since IFT20 and CEP78 are rich in coiled coils that tend to interact non-specifically with other proteins.

      Thank Reviewer #1 (Public Review) for the comment on protein interaction between Cep78, Ift20, and Ttc21a. As the reviewer pointed out, IFT20 is a known component of the IFT-B2 complex (Taschner et al., 2016) whereas TTC21A (IFT139) is part of the IFT-A complex. Both IFT20 and TTC21a are located at peripheral areas of IFT-B and IFT-A (PMID: 32456460), and are not core components of IFT-A or IFT-B. It is still possible that these two proteins interact with each other. Actually, Liu et al. have revealed interaction between Ift20 and Ttc21a in human sperm (PMID: 30929735). Additionally, to mediate trafficking of ciliary axonemal components, the IFT machinery is recruited to the distal appendages (PMID: 30601682), which is adjacent to the distal end of the (mother) centriole wall, where at the (mother) centriole wall was reported to be located (PMID:35543806). Cep78 may interact with Ift20 and Ttc21a at centriole during cilliogenesis.s

      To rule out the nonspecific interaction between Cep78 and Ttc21a or Ift20, we added additional negative controls of Gapdh (Figure 7D) and Ap80-NB-HA (Supplementary Figures S7A-C) in co-IP as the reviewer suggested, and found that the interaction between Cep78 and Ttc21a or Ift20 is specific. To examine if Cep78, Ift20 and Ttc21a formed a complex, we fractionated testicular protein complexes using size exclusion chromatography, and found that Cep78, Ift20 and Ttc21a co-fractioned at the size between158 kDa to 670 kDa (Figure 7E), supporting the formation of a trimeric complex. And our immunofluorescent analysis by SIM also showed co-localization between Cep78 and Ift20 or Ttc21a (Figure 7F). All these data support the interaction among Cep78, Ttc21a and Ift20. In the revised manuscript, we rephrased “direct interaction” as “interaction” at page 18, line 393 in the revised manuscript.

      3) In Figure 7D, the input blots show similar levels of TTC21A and IFT20 in control and Cep78-/- mouse testicular tissue. This is in contrast to panels E-G in the same figure where TTC21A and IFT20 levels look reduced in the mutant. Please explain this discrepancy.

      Thank you for pointing this out. Deletion of Cep78 caused down-regulation of Ttc21a and Ift20 proteins. To better reveal the change of interaction between Ttc21a and Ift20, we have to normalize their interaction against expression levels. To achieve this, we increased the amount of total Cep78-/- testicular proteins to ensure that Ttc21a and Ift20 in the input are at similar levels between Cep78+/- and Cep78-/- testes. Using 3 times the amount of the Cep78+/- testicular proteins for Cep78-/- testicular proteins, we detected similar protein levels of Ttc21a and Ift20 between Cep78-/- and Cep78+/- testes, and the interaction between Ttc21a and Ift20 was shown to be down-regulated after Cep78 deletion. Consistently, the analysis of GAPDH as a loading control in input proteins showed more Cep78-/- testicular proteins than Cep78+/- testicular proteins subjected to analysis. To avoid confusion, we have added description of “The amount of Cep78-/- testicular proteins used was 3 times of that of Cep78+/- proteins” in the legend of Figure 7D in the revised version of manuscript.

      4) The efficiency of the siRNA knockdown shown in 7J-M was only assessed by qPCR (Figure S4), but this does not necessarily mean the corresponding proteins were depleted. Western blot analysis needs to be performed to show depletion at the protein level. Furthermore, it would be desirable with rescue experiments to validate the specificity of the siRNAs used.

      Thank the reviewer for the suggestion. To validate the specificity of the siRNAs used, we performed rescue experiments using rescue plasmid with siRNA targeting sequence synonymously mutated (Supplementary Table S6). The efficiency of siRNA knockdown and effects of rescue experiments were evaluated by both qPCR (Supplementary Figures S4.A-C) and Western Blot (Figures 7.J-K, Supplementary Figures S4.D-E, H-I). The results showed that siRNAs significantly reduced the expression of Cep78, Ift20, and Ttc21a at both mRNA (Supplementary Figures S4.A-C) and protein levels (Figures 7.J-K, Supplementary Figure S4.A-C). Meanwhile, with siRNA treatment, the rescue plasmids rescued the expression of Cep78, Ift20, and Ttc21a at both mRNA (Supplementary Figures S4.A-C) and protein levels (Figures 7.J-K, Supplementary Figures S4.D-E, H-I) compared with the control groups.

      In the rescue experiments, we further evaluated whether the effects are specific for Cep78, Ift20 and Ttc21siRNAs in the regulation of cilia and centriole lengths. The results showed that suppression of cilia and centriole lengths by Cep78, Ift20 and Ttc21siRNAs could be rescued by overexpression of rescue plasmids of Cep78syn-HA, Ift20syn-Flag and Ttc21asyn-Flag (Figures 7.N-S).

      5) Figure 7I: the resolution of the IFM is not very high and certainly not sufficient to demonstrate that CEP78, IFT20 and TTC21A co-localize to the same region on the centrosome, which one would have expected if they directly interact.

      Thank the reviewer for the constructive comments. To better demonstrate co-localization of CEP78, IFT20 and TTC21A on the centrosome, we overexpressed Cep78-Halo, Ift20-mCherry and Ttc21a-mEmerald in NIH3T3 cells by lentivirus, and acquired super-resolution images with SIM (N-sim, Nikon, Tokyo, Japan). The SIM results showed that Ift20 and Ttc21a co-localized with Cep78 (Figure 7F). Cep78 was previously reported to localize at the centriole (Goncalves et al., 2021). The co-localization of Cep78, Ift20 and Ttc21a indicated possible important roles of Cep78 in the regulation of Ift20 and Ttc21a in centriole. Our interaction analysis revealed that Cep78 interacted with Ift20 and Ttc21a (Figure 7A-C, Supplementary Figure S7), and formed a complex with Ift20 and Ttc21a (Figure 7E). Loss of Cep78 down-regulated the expression of and interaction between Ift20 and Ttc21a (Figures 7D, G-M).

      6) It is not really clear what information the authors seek to obtain from the global proteomic analysis of elongating spermatids shown in Figure 3N, O and Tables S2 and S3. Also, in Table S2, why are the numbers for CEP78 in columns P, Q and R so high when Cep78 is knocked out in these spermatid lysates? Please clarify.

      Thank the reviewer for the comments. Our global proteomic analysis showed that majority of differentially expressed proteins were down-regulated (Figure 3N), and many proteins are centrosome- and cilia-related proteins and important for sperm flagella and acrosome structures (Figure 3O), which provide insights of downstream molecular events in sperm flagella and acrosome defects after Cep78 deletion.

      As to the quantification of CEP78 expression in TMT-based proteomics analysis, the ratio between Cep78-/- and Cep78+/- is relatively high due to the ratio compression effect, a well-known phenomenon in TMT-based proteomics analysis (PMID: 25337643). The actual difference in protein expression is usually higher than the ratio calculated by TMT signals. Actually, our Western blot analysis of CEP78 protein showed absence of expression in Cep78-/- testis. Although TMT labelling has the disadvantage of ratio compression (PMID: 32040177,PMID: 23969891), it is widely used quantitative proteomics analysis, and is demonstrated to be able to identify key pathways and proteins (PMID: 30683861, 33980814).

      7) Figure 1F and Figure 4K: the data needs to be quantified.

      Thank the reviewer for this suggestion. For Figure 4K, we stained Cep78+/- and Cep78-/- spermatids with anti-Centrin 1 to measure the centriole length. The statistical data of centriole length were provided (Figure 4L), showing significantly increased centriole lengths in Cep78-/-spermatids.

      For Figure 1F, we quantified the immunofluorescence intensities of cone arrestin of light-adapted retinas of Cep78+/- and Cep78-/- mice at 3-month. The results indicate that immunofluorescence intensity of the cone arrestin was lower in Cep78-/- mice.

      8) Figure 2A: It is difficult to see a difference in connecting cilium length in control and Cep78-/- mutant retinas based on the images shown here.

      Thank you for your suggestion, we have stained retinal cryosections from Cep78+/- and Cep78-/- mice with anti-Nphp1 to visualize connecting cilium, and the data are provided in the revised Figure 2A-B.

      Reviewer #2 (Public Review):

      In this report, the authors have described the generation and characteristics of Cep78 mutant mice. Consistent with the phenotype observed in patients carrying the mutations in CEP78, Cep78 knock-out mice show degeneration in photoreceptors cells as well as defects in sperm. The author further shows the CEP78 protein can interact with IFT120 and TTC21a. Mutation in CEP78 results in a reduction of protein level of IFT120 and TTC21A and mislocalization of these two proteins, offering mechanistic insights into the sperm defects. Over all the manuscript is well written and easy to follow. Phenotyping is thorough. However, improvement of the background section is needed. In addition, some of the conclusion is not sufficiently supported by the data, warranting further analysis and/or additional experiments. The Cep78 KO mice model established by the author will be a useful model for further elucidating the disease mechanism in human and developing potential therapy.

      My comments are the following:

      1) Introduction. The statement that "CRD usually exists with combination of immotile cilia defects in other systems" is not correct. CRD due to ciliopathy can have cilia-related syndromic defects in other systems but it is a relatively small portion of all CRDs and the most frequently mutated genes are not cilia-related genes, such as ABCA4, GUCY2D, CRX.

      Thank the reviewer for the comments. We agree with the reviewer that only a small portion of CRDs are due to cilia defects and can have cilia-related syndromic defects in other systems. We corrected this statement in Line 4, Page 77-79 of the revised version of our manuscript. In our revised version, the statement has been changed to “A small portion of CRDs are due to retina cilia defects, and they may have cilia-related syndromic defects in other systems[1].”

      2) Introduction: Page 4 CNGB1 encodes channel protein and not a cilia gene. It should be removed since it does not fit.

      Thank the reviewer for the comment. According to the reviewer’s suggestion, we removed the description of “mutations in CNGB1 cause CRD and anosmia [3]” at Page 4, Line 81 in the revised manuscript.

      3) Page 5, given the previous report of CEP78 patients with retina degeneration, hearing loss, and reduced infertility, the statement of "we report CE79 as a NEW causative gene for a distinct syndrome...TWO phenotypes....." Is not accurate.

      Thank the reviewer for the comments. We have removed the statement of “NEW” causative gene in Page 5, Line 104 of the revised version of our manuscript. The revised sentence is “In this study, based on results of a male patient carrying CEP78 mutation and Cep78 gene knockout mice, we report CEP78 as a causative gene for CRD and male sterility.”

      4) Figure 1F, the OS of the cone seems shorter, which might be the reason for weaker arrestin staining in the mutant compared to the heterozygous. Also, it would be better to quantify the staining to substantiate the statement.

      Thanks for this suggestion. For Figure 1F, we have quantified the immunofluorescence intensity of cone arrestin in Cep78+/- and Cep78-/- light-adapted retinas at 3-month. The results indicate that immunofluorescence intensity of the cone arrestin was significantly lower in Cep78-/- mice.

      5) Figure 1K, panel with lower magnification would be useful to get a better sense of the overall structure defect of the retina. Is the defect observed in the cone as well?

      Thank the reviewer for the comment. As suggested by the reviewer, we have provided images of lower magnification to show the overall structure by TEM, showing disruption of most outer segment in Cep78-/- retina. It is difficult to distinguish whether the disordered outer segment structure belongs to a cone or a rod cell. The images are now provided as Figure 1L in the revised manuscript.

      We observed the abnormality of photopic b-wave amplitudes (Figure 1B, E) and decreased intensity of cone arrestin in light-adapted retinas (Figure 1F, G) in Cep78-/- mice, which indicate that the function of cone cells is damaged.

      6) Figure 2A, NPHP1 or other markers specifically label CC would be more useful to quantify the length of CC. Also need to provide a notation for the red arrows in Figure 2. In addition, the shape of CC in the mutant seems differ significantly from the control. It seems disorganized and swollen.

      Thank the reviewer for the suggestion. According to the reviewer’s suggestion, we have stained anti-Nphp1 in retinal cryosections from Cep78+/- and Cep78-/- mice to visualize connecting cilium, and quantified the length of CC. The results showed that connecting cilia were shorter in Cep78-/- mice. These data are showed in Figure 2A-B.

      Besides, we observed that upper parts of connecting cilia were swelled with disorganized microtubules in TEM (Figure 2E-G). The red arrows in Figure 2E-G indicated swelled upper part of connecting cilia and disorganized microtubules of Cep78-/- photphoreceptors, we added this description in the figure legend.

      7) Evidence provided can only indicate direct interaction among CEP78/IFT20/TTC21A.

      Thanks for the comment. To further validate the interaction between Cep78 and Ttc21a or Ift20, we performed reciprocal co-IP between Cep78 and Ttc21a or Ift20 by overexpression (Figure 7A-C), and also added relevant negative control of Gapdh (Figure 7D) and Ap80-NB-HA (Supplementary Figures S7A-C) in co-IP as negative controls to avoid non-specific interaction. Besides, we provided evidence that Cep78, Ift20 and Ttc21a formed a complex, as they all co-fractioned in a testicular protein complex at the size between158 kDa to 670 kDa using size exclusion chromatography (Figure 7E). Additionally, we performed super-resolution analysis of immunofluorescent localizations, and observed co-localization between Cep78 and Ttc21a or Ift20 by SIM. With these data, we think that Cep78 interacts with Ttc21a and Ift20 and they form a complex. We rephrased “direct interaction” as “interaction” in the manuscript.

      Reviewer #3 (Public Review):

      Authors were aiming to bring a deeper understanding of CEP78 function in the development of cone-rod dystrophy as well as to demonstrate previously not reported phenotype of CEP78 role in male infertility.

      It is important to note, that the authors 're-examined' already earlier published human mutation, 10 bp deletion in CEP78 gene (Qing Fu et al., 10.1136/jmedgenet-2016-104166). This should be seen as an advantage since re-visiting an older study has allowed noting the phenotypes that were not reported in the first place, namely impairment of photoreceptor and flagellar structure and function. Authors have generated a new knockout mouse model with deleted Cep78 gene and allowed to convey the in-depth studies of Cep78 function and unleash interacting partners.

      The authors master classical histology techniques for tissue analysis, immunostaining, light, confocal microscopy. They also employed high-end technologies such as spectral domain optical coherence tomography system, electron, and scanning electron microscopy. They performed functional studies such as electroretinogram (ERG) to detect visual functions of Cep78-/- mice and quantitative mass spectrometry (MS) on elongating spermatids.

      The authors used elegant co-immunoprecipitation techniques to demonstrate trimer complex formation.

      Through the manuscript, images are clear and support the intended information and claims. Additionally, where possible, quantifications were provided. Sample number was sufficient and in most cases was n=6 (for mouse specimens).

      The authors could provide more details in the materials and methods section on how some experiments were conducted. Here are a few examples. (i) Authors have performed quantitative mass spectrometry (MS) on elongating spermatids lysates, however, did not present specifically how elongating spermatids were extracted. (ii) In the case of co-IPs authors should provide information on what number of cells (6 well-plate, 10 cm dish etc) were transfected and used for co-IPs. Furthermore, authors could more clearly articulate what were the novel discoveries and what confirmed earlier findings.

      The authors clearly demonstrate and present sufficient evidence to show CEP78/Cep78 importance for proper photoreceptor and flagellar function. Furthermore, they succeed in identifying trimer complex proteins which help to explain the mechanism of Cep78 function.

      The given study provides a rather detailed characterization of human and mouse phenotype in response to the CEP78/Cep78 deletion and possible mechanism causing it. CEP78 was already earlier associated with Cone-rod dystrophy and, this study provides a greater in-depth understanding of the mechanism underlying it. Importantly, scientists have generated a new knock-out mouse model that can be used for further studies or putative treatment-testing.

      CEP78/Cep78 deletion association with male infertility is not previously reported and brings additional value to this study. We know, from numerous studies, that-testes express multiple genes, some are unique to testes some are co-expressed in multiple tissues. However, very few genes are well studied and have clinical significance. Studies like this, combining patient and animal model research, allow to identify and assign function to poorly characterized or yet unstudied genes. This enables data to use in basic research, patient diagnostics and treatment choices.

      We would like to thank Reviewer #3 (Public Review) for positive comments on our work.

      As to the suggestions to provide some details in the materials and methods by the reviewer, we added the description of STA-PUT method for spermatids purification at Page 34, Line 729-741 in the revised manuscript, the amount of cells used for co-IPs “10 cm dish HEK293T were transfected (Vazyme, Nanjing, China) wit 5μg plasmid for each experimental group.” at Page 36, Line 783-784 in the revised manuscript.

      We also highlighted our new discovery and ensured that all previous published findings are accompanied by references, we added “We further explored whether c.1629-2A>G mutation in this previously visited patient would disturb CEP78 protein expression and male fertility. Blood sample was collected from this patient and an unaffected control for protein extraction.” at Page 17, Line 335. We also added “The major findings of our study are as follows: we found CEP78 as the causal gene of CRD with male infertility and multiple morphological abnormalities of the sperm flagella using Cep78-/- mice. A male patient carrying CEP78 c.1629-2A>G mutation, whom we previously reported to have CRD [8], was found to have male infertility and MMAF in this study. Cep78 formed a trimer with sperm flagella formation enssential proteins IFT20 and TTC21A (Figure 8), which are essential for sperm flagella formation[16, 18]. Cep78 played an important role in the interaction and stability of the trimer proteins, which regulate flagella formation and centriole length in spermiogenesis. ” at the first paragraph of discussion, which is Page 21, Line 447-456 of our revised manuscript.

    1. Author Response

      Reviewer #1 (Public Review):

      This excellent manuscript challenged the premise that NF-kappaB and its upstream kinase IKKbeta play a role in muscle atrophy following tenotomy. Two animal models were used - one leading to enhanced muscle-specific NF-kappaB activation and the other a muscle-specific deletion. In both models, there was no significant relationship to observed muscle changes following tenotomy. Overall this work is significant in that it challenges the existing dogma that NF-kappaB plays a crucial role in muscle atrophy.

      Surprisingly the authors noted that there were basal differences observed in the phenotypes of their models that were sex-dependent. They note that male mice lose more muscle mass after tenotomy and specifically type 2b fiber loss.

      Overall this is an outstanding study that challenges the notion that NF-kappaB inhibitors are likely to improve muscle outcomes following injuries such as rotator cuff tears. Its main weakness is that there were no pharmacological arms of investigation; this fails to definitively exclude the hypothesis that inhibition may exert some effect in healing, perhaps in surrounding non-muscle matrix tissue that in turn may assist in healing.

      Thank you for your careful and thoughtful review. We agree that the finding that NFkb is not driving tenotomy-induced atrophy is both surprising and interesting. We look forward to further uncovering the atrophic mechanisms responsible. We also agree that an investigation using pharmacological NFkb inhibitors will improve our understanding of the full scope of the role of NFkb in the tenotomy pathology. As you and another reviewer note, this work has only blocked NFkb signaling in the mature muscle fiber and thus cannot assess the role of NFkb in satellite cell, fibroblast, immune cell activation etc in the healing response. However, we avoided using these inhibitors in this study due to the potential for these systemic effects to obscure the role of NFkb in the muscle fiber. While we believe that a pharmacological investigation is beyond the scope of this study, it will make an excellent follow on investigation.

      Reviewer #2 (Public Review):

      The primary strength of this paper is a rigorous approach to 'negative' data. Did the authors definitively prove that NF-kB has no role in the tenotomy-induced atrophy? Probably not entirely, since there are limitations of the mouse model and the knockdown mice. There cannot be complete elimination of load since mice heal with some scar tissue, and the knockdown is not complete elimination. However, even with these limitations, this presents important findings that tenotomy, which induces mechanical unloading of the muscle-tendon unit, provides a unique biomechanical environment for the muscle to undergo atrophy, which warrants a more in-depth look given that these injuries are unique and extremely common. It must be mentioned that the results are entirely supported by their data and that even though the model is not 'perfect' it truly supports that NF-kB has a limited role in atrophy. The sex-mediated differences based on autophagy are a secondary hypothesis and are interesting but possibly less clinically relevant based on the differences shown.

      We appreciate your thoughts on the “negative” data in this study. A manuscript in which the data refute your hypothesis and that of the field is difficult to write. There is a higher burden of validation and closer scrutiny of limitations. We agree that the model does have some limitations, but overall strongly supports a limited role for NBkb in tenotomy-induced muscle atrophy.

      The important next step for this group and others is to evaluate the 'how and why' of tenotomy atrophy if not through NF-kB. Is it that there are many redundant processes that the muscle may have to circumnavigate the NF-kB pathway given that it is so ubiquitous that the authors didn't see a difference? Could it be differences in axial vs appendicular muscle? Or should there be a closer look at the mechanosensors in the muscle cells to determine if there are other key drivers of atrophy? Regardless, this paper shows that tenotomy-induced muscle atrophy is unique and supports the conclusion that muscle has many ways to atrophy based on the injury it undergoes.

      We agree that the major next step for this work is to investigate the mechanism(s) responsible for tenotomy-induced atrophy. Autophagy in particular needs a more thorough investigation using autophagic inhibitors in naive wildtype mice to investigate its role in the sex-specificity of tenotomy-induced atrophy. The question of axial vs. appendicular muscle is intriguing. There could also be an upper vs. lower body difference that is worth exploring in future work.

      Reviewer #3Public Review):

      The authors provided thorough analyses of muscle morphology, biochemistry, and function, which is a major strength of the study. However, there are some key confounding variables authors failed to address. For example, the difference in the estrous cycle in female animals was not controlled. The study could have been significantly improved by controlling sex hormone levels or at least testing differences in response to injury.

      We appreciate your careful and insightful review of our work. We designed this study to assess the role of myofiber NFkb in tenotomy-induced atrophy, which led us to a rigorous assessment of morphology, biochemistry and function, which we agree is the strength of the study. We also agree that a major limitation of this study is that the secondary observations of sex-specificity and autophagic signaling are not as well controlled or supported. This is because these observations were made at the end of the study when the histological analyses were completed by the blinded rater. The sex-specificity in the basophilic puncta that the rater observed sparked us to reconsider the sex-specificity in our other data and to stain for autophagic vesicles. As you suggest, to rigorously assess sex-specificity it would be good to control of estrous cycle and analysis of sex hormones which would require initiation of another study, planning for these variables in advance. We think this is beyond the scope of the current question of the role of NFkb in tenotomy-induced atrophy but think it should be undertaken as a follow on to eliminate confounding variables of genetic manipulation and tamoxifen treatment.

      However, since we still need to report the sex specificity we observed while ensuring that our findings are not misconstrued, we reviewed the language in the manuscript to emphasize that these are retrospective observations that require further investigation. We have also added discussion of these variables and their potential influence on the results to the Discussion.

      Discussion: “Additionally, it is important to note that estrous cycle was not controlled in these mice and sex hormone levels weren’t measured in this study. These preliminary observations, though intriguing, will require more rigorous follow up evaluations to define the interaction between sex, tenotomy, and autophagy in naïve wildtype mice.”

      Furthermore, more data are needed to link NFkB signaling and autophagy to make any kind of conclusions. Overall, in the current form of the manuscript, the presented data seem underdeveloped, and the addition of more supporting data could significantly improve the quality of the manuscript and enhance our understanding of NFkB signaling and muscle wasting in rotator cuff injury.

      We agree that more data are needed to complete the picture of autophagy in tenotomy-induced muscle atrophy. The p62 and LC3 positive intracellular puncta in male tenotomized muscle are distinctive, but only limited conclusions can be drawn physiologically because 1) they are only present in a fraction of fibers and 2) it is impossible to tell whether they result from increased autophagic flux or altered vesicle processing. Western blot for LC3 (and now p62) indicates only small changes in total protein, but since these proteins are synthesized and degraded during active autophagy, it is possible for their levels to remain constant while flux increases. Direct measures of autophagic flux would require treating mice with an autophagosome block which would require initiation of another study. However, we agree with the reviewer that we can add some additional measures to better characterize the instantaneous state.

      We have added analysis of p62 protein expression to LC3 since p62 protein content in muscle can be decoupled from LC3 (PMID: 27493873). We also added expression data for genes involved in autophagy (Lc3b, Gabarapl1, Becn1, Bnip3, and Atg5). Finally, we have commented on the limitations of our data in the Discussion.

      Discussion: “Evidence for autophagy regulating tenotomy-induced atrophy has been mounting over recent years (Bialek et al., 2011; Gumucio et al., 2012; Joshi et al., 2014; Ning et al., 2015; Hirunsai & Srikuea, 2021). The evidence presented here supports this contention, but we find surprisingly small effect sizes for all markers investigated. This could be because we are not directly assessing autophagic flux and so are missing some temporal dynamics since synthesis and degradation are ongoing simultaneously.”

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Summary of changes

      We thank all three reviewers for their constructive feedback on our manuscript. We have now perfomed extensive experiments, analyses, and rewriting of our manuscript to address all their concerns. We believe that these changes significantly improve the rigor of our conclusions and the clarity of our discussion. We highlight below key experiments, analyses, and re-writing in the revised manuscript, which is followed by a detailed point-by-point response. 1) We have now performed experiments using alternative uORF donor sequences to demonstrate the robustness of uORF repression to changes in uORF length.

      2) By mutating out near-cognate start codons within uORF2, we have now demonstrated that near-cognate start codon initiation within uORF2 does not impact repression.

      3) To quantify the dynamic range of our dual luciferase assay, we have now mutated out the NLuc start codon. We find that repressive uORF2 constructs have expression levels that are still > 20-fold above the no-startcodon control values.

      4) We have now analyzed ribosome profiling coverage on uORFs (supplementary figure 5), and we show that several uORFs with known elongation stalls lack evidence of 40S and 80S subunit queueing 5′ to stalls, consistent with our collision-induced ribosome dissociation model.

      5) We have now provided detailed discussion of footprint length choice in our modeling and the role of codon choice in our experiments.

      6) We have now added a new main figure that provides a graphical representation of reactions considered in our kinetic modeling. This figure will make our modeling assumptions more transparent and accessible to readers with less computational expertise.

      Reviewer #1:

      Summary

      Bottorff et al test several models of uORF-mediated regulation of main ORF translation using the uORF2 of CMV UL4 gene, a system that has been previously experimentally characterized by the authors. They first train a computational model to recapitulate the observed experimental effects of mutations in uORF2, and then use the model to infer which uORF parameters may confer buffering against reduced ribosome loading that typically occurs upon biological perturbation. The authors then find that: i) the uORF2 confers buffering, ii) the uORF2 mechanism adjusts to computational predictions for the collision-mediated 40S dissociation model of uORF-mediated regulation. Significance

      This manuscript represents an interesting effort to distinguish mechanisms of uORF-mediated regulation based on mathematical modeling, and might be useful for the translation community. My expertise: Regulation of translation.

      We thank Reviewer 1 for a succinct summary of our main conclusions and highlighting the significance of our work to the translation community.

      Major comments 1) Figure 4 (Figure 5 in revised version): Which is the dynamic range of the WT vs the no-stall construct? In the WT construct, main ORF translation is already quite repressed, and detecting further repression may be more difficult than in the no-stall construct. In other words, the differences that authors are detecting between the WT and no-stall constructs might be due to a potential lower dynamic range of the WT construct

      To measure the dynamic range of our reporter assay, we have now mutated the start codon of the NLuc reporter ORF. We reasoned that this construct provides a lower bound on measurable NLuc signal. The resulting noNLuc-start-codon reporter expression was at least 20-fold lower than WT construct (Fig. S1A). Importantly, we also see that the raw NLuc signal of the WT construct is at least 20-fold over the background (Fig. S1B). Thus, the differential response of WT and no-stall constructs is not simply due to lower dynamic range of the WT construct.

      2) The authors conclude that uORF2 follows the collision-mediated 40S dissociation model, based on fitness of their experimental results with predictions from their mathematical modeling regarding distance between uORF2 initiation codon and the stalling site. But can the authors actually directly prove that there are no 40S subunits accumulating behind the stalled 40S using Ribo-Seq or TCP-Seq?

      We have now examined existing 80S Ribo-seq and 40S TCP-seq datasets to determine whether queued 40S or 80S ribosomes can be detected at known stall sites. Stern-Ginossar et al. (2012) performed 80S Ribo-seq during hCMV infection. In this dataset, while the stall at the UL4 termination codon has a very high ribosome density, few elongating ribosomes are seen queued behind the stalled 80S, consistent with an absence of 80S ribosome queuing (Fig. RR1). By contrast, another well-studied elongation stall in the Xbp1 mRNA shows ~30 nt periodic peaks in ribosome density indicative of ribosome queues (Fig. RR2). An important caveat is that queued ribosomes could be systematically underrepresented in standard Ribo-seq datasets due to incomplete nuclease digestion (Darnell et al., 2018; Subramaniam et al., 2014; Wolin and Walter, 1988).

      Since there is no 40S TCP-Seq dataset during hCMV infection, we examined other known stalls on human mRNAs (Fig. RR3 below; Fig. S5 in our manuscript). We examine small ribosomal subunit profiling data from human uORFs with conserved amino acid-dependent elongating ribosome stalls (Figure S5A). Ribosome density read counts are low across all of these uORFs, showing no evidence of ribosome queuing. Subtle queues might not be observed given these low read counts from insufficient capture of small ribosomal subunits. Nevertheless, we do not observe any evidence of queueing upstream to elongating ribosome stalls in this data. We note these observations in our Discussion section as follows (lines 688-712): “Although our data from UL4 uORF2 does not support the queuing-mediated enhanced repression model (Fig. 1C) (Ivanov et al., 2018), this model might describe translational dynamics on other mRNAs. Translation from near-cognate start codons is resistant to cycloheximide, perhaps due to queuing-mediated enhanced initiation, but sensitive to reductions in ribosome loading (Kearse et al., 2019). Loss of eIF5A, a factor that helps paused elongating ribosomes continue elongation, increases 5′ UTR translation in 10% of studied genes in human cells, augmented by downstream in-frame pause sites within 67 codons, perhaps also through queuing-mediated enhanced initiation (Manjunath et al., 2019). There is also evidence of queuing-enhanced uORF initiation in the 23 nt long Neurospora crassa arginine attenuator peptide (Gaba et al., 2020) as well as in transcripts with secondary structure near and 3′ to start codons (Kozak, 1989). Additional sequence elements in the mRNA might determine whether scanning ribosome collisions result in queuing or dissociation. Small subunit profiling data (Wagner et al., 2020) from human uORFs that have conserved amino acid-dependent elongating ribosome stalls do not show evidence of scanning ribosome queues (Fig. S5A), consistent with the collision-mediated 40S-dissociation model. Subtle queues might not be observed given these low read counts from insufficient capture of small ribosomal subunits.”

      3) Experimental data in Figures 2, 4 and 5 include 3 technical replicates. Sound conclusions typically require biological replicates. Further, the number of replicates in Figure 6 has not been indicated.

      As suggested by the reviewer, we have now included biological replicates for all luciferase assays [Figures 2, 5, 6, and 7 that were previously 2, 4, 5, and 6] that were technical replicates in the previous version. This replication does not alter any of our conclusions. We have now included the number of biological replicates for Figure 7 (former Figure 6).

      Minor comments 1) Figure 4 (Figure 5 in revised version): It is strange that a PEST sequence had to be introduced in the construct of part B in order to observe reliable differences, but not in constructs of parts A and C. Can the authors explain?

      We introduced the PEST sequence for part B because we wanted to measure the reporter response to treatment with a drug that reduces translation initiation. The PEST sequence increases the turnover rate of the reporter protein. Without the PEST sequence, the luminescence signal will be dominated by the reporter expression before the drug was added. However, in parts A and C, initiation rate was altered through genetic mutations and measuring their expression under basal conditions does not require a PEST sequence. Except in situations where a quick dynamic response needs to be measured such as in the drug treatment in part B, reporters without PEST sequences are simpler to interpret due to the absence of proteasome-mediated degradation and higher overall signal.

      2) Figure 6 (Figure 7 in revised version): Unfortunately, the authors find no other human uORFs with terminal diproline motifs that are so essential for main ORF repression as uORF2. In this light, can the authors comment further on the usefulness of their findings for human genes? Have the authors searched for viral RNAs with similar features? Please, notice that the gene PPP1R37 has not been mentioned in the main text.

      The UL4 and human uORFs differ in their sequence determinants of translational repression. UL4 uORF2 represses translation entirely through nascent peptide-mediated stalling. While the terminal diproline motif in UL4 uORF2 is necessary for main ORF repression, it is not sufficient. A number of other residues in the UL4 uORF2 peptide play a critical role in repression (Cao and Geballe, 1996; Matheisl et al., 2015). Thus, it is not surprising that human uORFs that we identified based solely on the presence of terminal diproline motifs confer only modest decrease in repression upon mutating the terminal proline. The human uORFs containing these terminal diprolines may partially repress translation via nascent peptide effects, but the majority of the repression likely arises from siphoning of scanning ribosomes from the main ORF (Fig. 1A in our manuscript) and inefficient termination following translation of consecutive prolines (Cao and Geballe, 1996; Cao and Geballe, 1998; Janzen et al., 2002; Matheisl et al., 2015). Our current understanding of features in nascent peptide that mediate translational repression (Wilson et al., 2016) is insufficient to bioinformatically identify elongation-stall containing uORFs in human or viral genomes, so we simply looked for terminal diprolines. Despite this limitiation, we note that the modeling approaches and experimental perturbations developed in our work can be applied to study ribosome kinetics on any repressive uORF, independent of the mRNA or peptide sequence underlying the repression. As suggested by Reviewer 1, we have now included all the studied uORFs in the main text.

      Reviewer #2:

      Summary

      In this paper, the authors are exploring the uORF regulatory mechanism. They first discussed five general models how uORFs might work to repress and buffering main ORF translation, then they mainly focus on the UL4 uORF2 for the potential mechanism. They use both computer modeling and experimental validation with reporter assay in 293t cell line. Based on their model, and few experimental results when they change the translation initiation rate and/or length of dORF, they propose it may work through 40S dissociation model, since the buffering effect is not uORF length sensitive. Significance

      It is an interesting area, using modeling with experiment validation to understand uORF regulation mechanism, the kinetics and interplay between different translation steps, it will help us to understand uORF buffering in stress conditions. Also bring modeling method with reporter validation to the translation field, will provide clues to the molecular mechanism study, especially in complex situation.

      We thank Reviewer 2 for a comprehensive summary of our work and noting the uniqueness and usefulness of our experiment-integrated modeling approach to the translation field.

      Major comments • Are the key conclusions convincing? The modeling for different mechanisms is insightful, but some modeling parameters and experimental validation are not conclusive and validation of few of them can enforce the conclusions.

      We have now performed key validation experiments suggested by Reviewer 2, notably: 1. mutating out of nearcognate start codons in the UL4 uORF2 coding sequence and 2. increasing UL4 uORF2 length using two unrelated protein coding sequences. Please see responses to specific comments below for further details.

      • Should the authors qualify some of their claims as preliminary or speculative, or remove them altogether? Yes, the part about queuing and length sensitive is not convincing to me, it should be modified and reduce the statement strength.

      We agree about reducing the statement strength and have altered our statements as suggested by the reviewer. Specifically, we have now expanded the rationale for the choice of footprint lengths of 40S subunits. Please see responses to specific comments below for further details.

      • Would additional experiments be essential to support the claims of the paper? Request additional experiments only where necessary for the paper as it is, and do not ask authors to open new lines of experimentation. Yes, please see the specific concerns • Are the suggested experiments realistic in terms of time and resources? It would help if you could add an estimated cost and time investment for substantial experiments. They will need to re-think about the modeling, and validation in Figure 5, there are validation experiments that can be done in weeks and in a cost-efficient manner that can enforce the conclusions.

      We have performed the experiments suggested by the reviewer. See responses below.

      • Are the data and the methods presented in such a way that they can be reproduced? Most of them are good • Are the experiments adequately replicated and statistical analysis adequate? Yes Specific concerns 1) It is a bit confusing to me in Figure 2C, the reporter assays, that non-start codon reporter and non-stall reporter has same expression level. In theory, the non-stall reporter still has uORF there, so it should repress main ORF expression, and have lower expression level than the non-start reporter, where there is no uORF, no repression. In other uORFs they tested in Figure 6 (Figure 7 in revised version), the non-stall reporters are lower than non-start reporter. Since data they use to build the model is Figure 2B, and calculate the parameters for the whole paper, I just want to make sure it is making sense. I noticed there is another CTG in frame on the 4th codon, this may be alternative start codon in the non-start reporter to trigger some repression.

      To address Reviewer 2’s concern about alternative start codon usage in the non-start reporter, we have now mutated out all near-cognate start codons known to initiate translation with high frequency (Kearse and Wilusz, 2017). These near-cognate start codons consisted of Leu4 CTG, Leu11 CTG, Leu14 TTG, and Leu15 CTG and were mutated to CTA, CTA, TTA, and CTA, respectively. We find that removing the uORF2 near-cognate start codons does not significantly alter NLuc expression (Fig. S1A). This experiment merely rules out one possible source of these similar expression levels. We expect that uORF2 no-start and no-stall reporters’ very similar NLuc expression levels can be rationalized for the several following reasons: 1. uORF2 initiation frequency is quite low. We estimate it to be 5% or less in our modeling based on previous measurements (Cao and Geballe, 1995). Thus, the maximum theoretically possible difference in NLuc expression between no-start and no-stall reporters is 5% or less. 2. Further, re-initiation after uORF2 translation is frequent. We estimate it to be around 50% within our manuscript, which will further decrease repression in the no-stall mutant. Thus, we expect the no-stall mutant to decrease the flux of scanning ribosomes at the main ORF by 2-3% compared to the no-start mutant. 3. Finally, a subtle but important point to note is that our reporter assays are measuring NLuc expression and not the flux of scanning ribosomes at the main ORF NLuc start codon. Since NLuc ORF has a strong start codon context (GCCACC) and the flux of scanning ribosomes is already high for the no-start and no-stall mutants, slight changes in the flux of scanning ribosomes are unlikely to impact NLuc expression. This is because start codon selection is not rate-limiting for protein expression under these conditions. This last point is clearly seen in high throughput reporter assays where the mutations which impact reporter expression in a non-optimal context have little or no effect in an optimal context (see Fig. 5B, 5C in Noderer et al., 2014).

      Thus, in summary, even if the flux of scanning ribosomes is decreased by 3-5% by the no-stall uORF2 mutant compared to the no-start uORF2 mutant, we expect the effect on NLuc expression to be negligible and below the limit of our experimental resolution (which is ~10% based on the standard error across technical replicates).

      Regarding the different behavior of the human uORFs in our manuscript and UL4 uORF2, note the response to Reviewer 1 regarding the usefulness of our human uORF findings.

      2) All the modeling and prediction the authors do are based on average, but we know translation is very heterogeneous. For each ribosome or each 40S, the kinetics varies a lot, the authors should discuss about this part.

      We now discuss translation heterogeneity in the Discussion section in lines 781-794 as follows: “Translation heterogeneity among isogenic mRNAs has been observed in several single molecule translation studies (Boersma et al., 2019; Morisaki et al., 2016; Wang et al., 2020; Wu et al., 2016; Yan et al., 2016). This heterogeneity may arise from variability in intrasite RNA modifications (Yu et al., 2018), RNA binding protein occupancy, or RNA localization. We do not capture these sources of heterogeneity in our modeling since the observables in our simulations are averaged over long simulated time scales and used to predict only bulk experimental measurements. However, our models studied here can readily extended through compartmentalized and state-dependent reaction rates (Harris et al., 2016) to account for the different sources of heterogeneity observed in single molecule studies.”

      3) For modeling related with the queuing-mediated model in Figure 1C. they use 30nt as the ribosome length to count the potential queuing to start codon. But 30nt is the 80S protected fragment with specific conformation. The protected fragment for 80S will change based on different status of ribosome conformation or elongation step. More importantly, for queuing, it is 40S, so they may have a different size. Based on previous 40S ribosome profiling (Archer, Stuart K., et al. Nature 535.7613 (2016): 570-574. And other papers), the length can vary from 19nt to very long, so I don’t think the 30nt length can be used to model queuing in 40S and length sensitivity in the uORF working mechanism.

      We thank Reviewer 2 for highlighting this issue of footprint length heterogeneity that we had not previously addressed. In our modeling, we assume homogenous ribosome footprints. While, heterogeneous ribosome footprints have been observed for small ribosomal subunits (Bohlen et al., 2020; Wagner et al., 2020; Young et al., 2021) and elongating ribosomes (Lareau et al., 2014; Wu et al., 2019), we believe that our modeling of homogenous footprint length is appropriate for the following three reasons: First, with respect to the small ribosomal subunit footprint heterogeneity, we note that TCP-seq studies include crosslinking of eukaryotic initiation factors (eIFs). The presence of these eIFs is thought to be the main source of heterogeneity in scanning ribosome footprints (Bohlen et al., 2020; Wagner et al., 2020). Although crosslinking is often performed, it is not necessary to obtain scanning ribosome footprints, and homogenous 30 nt footprints are observed in the absence of crosslinking (Bohlen et al., 2020). Notably, figure S2 of Bohlen et al. (2020), reproduced as Fig. RR4 below, shows that scanning SSU footprint lengths are tightly distributed around 30 nt when crosslinking is not used.

      Second, in the context of the strong, minutes-long UL4 uORF2 elongating ribosome stall (Cao and Geballe, 1998), collided ribosomes will wait for long periods of time relative to normal elongating or scanning ribosomes. Thus, we expect that associated eIFs dissociate from these dwelling ribosomes as they typically do during start codon selection or during translation of short uORFs (Bohlen et al., 2020). Third, a significant fraction of mRNAs exhibit cap-tethered translation in which eIFs must dissociate from ribosomes before new cap-binding events, and therefore collisions, can occur (Bohlen et al., 2020). Based on above three points, we believe that modeling the footprint of only the scanning ribosomes, and not the associated eIFs, using a single 30 nt length is biologically reasonable. Footprint length heterogeneity of elongating ribosomes is much less drastic than that observed for scanning ribosomes and likely arises from different conformational states such as an empty or occupied A site (Lareau et al., 2014; Wu et al., 2019). While the different elongating ribosome footprints arise from differences in mRNA accessibility to nucleases, it is unclear whether the distance between two collided ribosomes changes across different ribosome conformations. For instance, the queues of elongating ribosomes observed at the Xbp1 mRNA stall occur at regular ~30 nt periodicity (Fig. RR2). Additionally, the stalled elongating ribosome is stuck in a pretranslocation state and has a defined, ~30 nt footprint (Wu et al., 2019), which only leaves room for 1 5′ queued ribosome within UL4 uORF2 whose footprint is conformation sensitive. Finally, a small degree of scanning footprint heterogeneity is also accounted for by our modeling of backward scanning which effectively introduces heterogeneity to collided scanning ribosome location on mRNAs (Figures 6A, S2D in our manuscript). We have now summarized the above points in the Discussion section of the revised manuscript (lines 713-740).

      4) For Figure 5B (Figure 6B in revised version), besides the modeling length part I have mentioned above, when the authors increase the length of uORF, the sequence is also changed, which may introduce other side effect. So, if the authors want to conclude about the queuing part, they should rethink about the length for both modeling and validation, plus control for the sequence they added to increase the length of uORF, for example use different sequence when manipulate the length.

      As suggested by the Reviewer, we have now varied the length of uORF2 using a different, unrelated donor sequence encoding the FLAG peptide and observe similar results (Fig. S4 in our manuscript) to our original experiment with the YFP-encoding sequence (Fig. 6B in our manuscript). A slight trend towards derepression with longer uORFs is observed in both cases. This effect might arise due to decreased stall strength caused by higher nascent peptide protrusion out of the exit tunnel leading to cotranslational folding (Bhushan et al., 2010; Nilsson et al., 2015; Wilson et al., 2016) or nascent chain factors (Gamerdinger et al., 2019; Weber et al., 2020) exerting a pulling force on the peptide. Importantly, we do not see the periodic change in repression predicted by the queueing model (Figure 6A, yellow-green lines).

      Minor comments • Specific experimental issues that are easily addressable. 5) It is unclear how the luciferase assays were analyzed considering the background noise. If the NLuc expression is low, close to the background, then how to extract or normalize the background will influence the expression level, thus fold change for different reporter/condition.

      To account for the luciferase background, we subtracted background from measured data values. To show that expression is rarely close to background (from mock transfections), we included a supplementary figure showing raw NLuc and FLuc values (Fig. S1B). Also note the response to Reviewer 1 regarding a no-start-codon control having a 20-fold lower signal than the WT UL4 uORF2 construct.

      • Are prior studies referenced appropriately? yes • Are the text and figures clear and accurate? Mostly good • Do you have suggestions that would help the authors improve the presentation of their data and conclusions? Have a main figure about the modeling part.

      As suggested by the Reviewer, we have now added visual representations of the reactions as a new main figure (Fig. 3). We also moved the modeling workflow figure from the supplementary set of figures to this main figure (Fig. 3). We thank the reviwer for this suggestion that greatly improves the presentation of our modeling methodology

      • Place the work in the context of the existing literature (provide references, where appropriate). Recent years, there has been a lot of study about small open reading frames, while for uORFs are known to repress translation, the regulatory mechanism is not known yet, there are just different models not validated yet (Young & Wek, 2016). Also, under normal conditions and stress conditions, uORF can play both repressive and stimulative role in main ORF translation (Orr, Mona Wu, et al. NAR 48.3 (2020): 1029-1042.). This paper is the first study to put all the uORF working hypothesis with buffering effect together, they use modeling to explain how under each hypothesis, buffering may happen or not. >• State what audience might be interested in and influenced by the reported findings. It will be interesting to people, who study molecular biology, biochemistry for translation regulation, especially uORFs. The modeling people may also find it interesting, how they could adapt modelinbeew keywords to help the authors contextualize your point of view. Indicate if there are any parts of the paper that you do not have sufficient expertise to evaluate. I have extensive experience working in the translation regulation field and I feel extremely comfortable to discus all the experimental part including individual reporters as well as genome wide. But I do not consider an expert in the modelling section of this work.

      Reviewer #3 :

      Summary Small ORFs are prevalent in eukaryotic genomes with variety of functions. Recent technological advances enable their detection, yet our understanding on the mode of action remains quite rudimentary. The manuscript by Bottorff, Geballe and Subramaniam aims at elucidating the function of UL4 uORF in the CMV, and thus, it is on timely and topical research. The authors measure the uORF -controlled expression of the well-studies UL4 uORF and kinetically model the initiation behavior. Within a second uORF, a diproline pair controls initiation of the downstream main ORF sensing ribosomal collisions between a scanning small subunit and an 80S positioned at the canonical start of the main ORF. The stalling at both proline codons is envisioned as a kinetic window to sense any elongation-competent 80S at initiation and thus, control the ribosomal load and expression. Such diproline tandems are present in some uORFs in human transcriptome, hence representing more pervasive control mechanism. Significance I am unable to comment in depth on the modeling algorithms and simulations as this is outside of my expertise. The experiments are reasonably designed to test various models of uORF regulation and set the frame for the modelling. The idea that various stress factors would decrease canonical initiation and consequently would reflect the number of initiating ribosomes are adequately tested by varying the number of initiating ribosomes. The discovery of the two terminal prolines, that are also found in other human uORFs, is appealing mode of controlling stalling-driven downstream initiation. However, the lack of experimental support with the human uORFs may indicate additional contributions. This raises the question as to whether the proline codon identity plays a role? Since codons are read with different velocity which is mirrored by the tRNA concentration. It would be good to address whether special proline codons have been evolutionarily selected in CMV and whether the kinetics of stalling strongly depends on the codon identity. Are both prolines in the tandem using the same codon? Along that line, are the same proline codons used in the human diproline-containing counterparts? Consequently, the P to A mutation may have altered the codon usage and could be the reason for the nonlinear effect in the human sequenced. In this case, it would make sence to use Ala-codons with similar codon usage as the natural prolines?

      We thank the Reviewer for raising this point about the role of codon usage. The tandem proline residues do not use the same codon (CCG then CCT). The two C-terminal proline residues in uORF2 are necessary for the elongating ribosome stall (Bhushan et al., 2010; Degnin et al., 1993; Wilson et al., 2016), but it has been previously shown that the identity of the codon does not significantly impact repression (Degnin et al., 1993). The human uORFs generally have 1 of the 2 Pro codons in common with the uORF2 Pro codons. Given that most of the human uORF P to A mutations behave similarly (Figure 7) irrespective of the original proline codon, we believe that codon usage does not impact repression by these uORFs. Moreover, as explained in response to Reviewer 1 and 2’s questions, we believe that the human uORFs containing terminal diprolines may partially repress translation via nascent peptide effects, but the majority of the repression likely arises from efficient siphoning of scanning ribosomes from the main ORF by the uORF (Fig. 1A in our manuscript).

      References

      Bhushan, S., Meyer, H., Starosta, A.L., Becker, T., Mielke, T., Berninghausen, O., Sattler, M., Wilson, D.N., and Beckmann, R. (2010). Structural Basis for Translational Stalling by Human Cytomegalovirus and Fungal Arginine Attenuator Peptide. Molecular Cell 40, 138–146.

      Boersma, S., Khuperkar, D., Verhagen, B.M.P., Sonneveld, S., Grimm, J.B., Lavis, L.D., and Tanenbaum, M.E. (2019). Multi-Color Single-Molecule Imaging Uncovers Extensive Heterogeneity in mRNA Decoding. Cell 178, 458–472.e19.

      Bohlen, J., Fenzl, K., Kramer, G., Bukau, B., and Teleman, A.A. (2020). Selective 40S Footprinting Reveals Cap-Tethered Ribosome Scanning in Human Cells. Molecular Cell 79, 561–574.e5.

      Cao, J., and Geballe, A.P. (1995). Translational inhibition by a human cytomegalovirus upstream open reading frame despite inefficient utilization of its AUG codon. J Virol 69, 1030–1036.

      Cao, J., and Geballe, A.P. (1996). Coding sequence-dependent ribosomal arrest at termination of translation. Molecular and Cellular Biology 16, 603–608.

      Cao, J., and Geballe, A.P. (1998). Ribosomal release without peptidyl tRNA hydrolysis at translation termination in a eukaryotic system. RNA 4, 181–188.

      Darnell, A.M., Subramaniam, A.R., and O’Shea, E.K. (2018). Translational Control through Differential Ribosome Pausing during Amino Acid Limitation in Mammalian Cells. Molecular Cell 71, 229–243.e11.

      Degnin, C., Schleiss, M., Cao, J., and Geballe, A. (1993). Translational inhibition mediated by a short upstream open reading frame in the human cytomegalovirus gpUL4 (gp48) transcript. Journal of Virology.

      Gaba, A., Wang, H., Fortune, T., and Qu, X. (2020). Smart-ORF: a single-molecule method for accessing ribosome dynamics in both upstream and main open reading frames. Nucleic Acids Research.

      Gamerdinger, M., Kobayashi, K., Wallisch, A., Kreft, S.G., Sailer, C., Schlömer, R., Sachs, N., Jomaa, A., Stengel, F., Ban, N., et al. (2019). Early Scanning of Nascent Polypeptides inside the Ribosomal Tunnel by NAC. Mol Cell 75, 996–1006.e8.

      Han, P., Shichino, Y., Schneider-Poetsch, T., Mito, M., Hashimoto, S., Udagawa, T., Kohno, K., Yoshida, M., Mishima, Y., Inada, T., et al. (2020). Genome-wide Survey of Ribosome Collision. Cell Reports 31, 107610.

      Harris, L.A., Hogg, J.S., Tapia, J.-J., Sekar, J.A.P., Gupta, S., Korsunsky, I., Arora, A., Barua, D., Sheehan, R.P., and Faeder, J.R. (2016). BioNetGen 2.2: advances in rule-based modeling. Bioinformatics 32, 3366–3368.

      Ivanov, I.P., Shin, B.-S., Loughran, G., Tzani, I., Young-Baird, S.K., Cao, C., Atkins, J.F., and Dever, T.E. (2018). Polyamine Control of Translation Elongation Regulates Start Site Selection on the Antizyme Inhibitor mRNA via Ribosome Queuing. Mol Cell 70, 254–264.e6.

      Janzen, D.M., Frolova, L., and Geballe, A.P. (2002). Inhibition of translation termination mediated by an interaction of eukaryotic release factor 1 with a nascent peptidyl-tRNA. Mol Cell Biol 22, 8562–8570.

      Kearse, M.G., and Wilusz, J.E. (2017). Non-AUG translation: a new start for protein synthesis in eukaryotes. Genes Dev 31, 1717–1731.

      Kearse, M.G., Goldman, D.H., Choi, J., Nwaezeapu, C., Liang, D., Green, K.M., Goldstrohm, A.C., Todd, P.K., Green, R., and Wilusz, J.E. (2019). Ribosome queuing enables non-AUG translation to be resistant to multiple protein synthesis inhibitors. Genes Dev 33, 871–885.

      Kozak, M. (1989). Circumstances and mechanisms of inhibition of translation by secondary structure in eucaryotic mRNAs. Mol Cell Biol 9, 5134–5142.

      Lareau, L.F., Hite, D.H., Hogan, G.J., and Brown, P.O. (2014). Distinct stages of the translation elongation cycle revealed by sequencing ribosome-protected mRNA fragments. eLife 3, e01257.

      Manjunath, H., Zhang, H., Rehfeld, F., Han, J., Chang, T.-C., and Mendell, J.T. (2019). Suppression of Ribosomal Pausing by eIF5A Is Necessary to Maintain the Fidelity of Start Codon Selection. Cell Reports 29, 3134–3146.e6.

      Matheisl, S., Berninghausen, O., Becker, T., and Beckmann, R. (2015). Structure of a human translation termination complex. Nucleic Acids Res 43, 8615–8626.

      Morisaki, T., Lyon, K., DeLuca, K.F., DeLuca, J.G., English, B.P., Zhang, Z., Lavis, L.D., Grimm, J.B., Viswanathan, S., Looger, L.L., et al. (2016). Real-time quantification of single RNA translation dynamics in living cells. Science 352, 1425–1429.

      Nilsson, O.B., Hedman, R., Marino, J., Wickles, S., Bischoff, L., Johansson, M., Müller-Lucks, A., Trovato, F., Puglisi, J.D., O’Brien, E.P., et al. (2015). Cotranslational Protein Folding inside the Ribosome Exit Tunnel. Cell Reports 12, 1533–1540.

      Noderer, W.L., Flockhart, R.J., Bhaduri, A., Diaz de Arce, A.J., Zhang, J., Khavari, P.A., and Wang, C.L. (2014). Quantitative analysis of mammalian translation initiation sites by FACS-seq. Mol Syst Biol 10, 748.

      Stern-Ginossar, N., Weisburd, B., Michalski, A., Le, V.T.K., Hein, M.Y., Huang, S.-X., Ma, M., Shen, B., Qian, S.-B., Hengel, H., et al. (2012). Decoding Human Cytomegalovirus. Science 338, 1088–1093.

      Subramaniam, Arvind R., Zid, Brian M., and O’Shea, Erin K. (2014). An Integrated Approach Reveals Regulatory Controls on Bacterial Translation Elongation. Cell 159, 1200–1211.

      Wagner, S., Herrmannová, A., Hronová, V., Gunišová, S., Sen, N.D., Hannan, R.D., Hinnebusch, A.G., Shirokikh, N.E., Preiss, T., and Valášek, L.S. (2020). Selective Translation Complex Profiling Reveals Staged Initiation and Co-translational Assembly of Initiation Factor Complexes. Mol Cell 79, 546–560.e7.

      Wang, H., Sun, L., Gaba, A., and Qu, X. (2020). An in vitro single-molecule assay for eukaryotic cap-dependent translation initiation kinetics. Nucleic Acids Res 48, e6.

      Weber, R., Chung, M.-Y., Keskeny, C., Zinnall, U., Landthaler, M., Valkov, E., Izaurralde, E., and Igreja, C. (2020). 4EHP and GIGYF1/2 Mediate Translation-Coupled Messenger RNA Decay. Cell Reports 33, 108262.

      Wilson, D.N., Arenz, S., and Beckmann, R. (2016). Translation regulation via nascent polypeptide-mediated ribosome stalling. Current Opinion in Structural Biology 37, 123–133.

      Wolin, S.L., and Walter, P. (1988). Ribosome pausing and stacking during translation of a eukaryotic mRNA. EMBO J 7, 3559–3569.

      Wu, B., Eliscovich, C., Yoon, Y.J., and Singer, R.H. (2016). Translation dynamics of single mRNAs in live cells and neurons. Science 352, 1430–1435.

      Wu, C.C.-C., Zinshteyn, B., Wehner, K.A., and Green, R. (2019). High-Resolution Ribosome Profiling Defines Discrete Ribosome Elongation States and Translational Regulation during Cellular Stress. Molecular Cell 73, 959–970.e5.

      Yan, X., Hoek, Tim A., Vale, Ronald D., and Tanenbaum, Marvin E. (2016). Dynamics of Translation of Single mRNA Molecules In Vivo. Cell 165, 976–989.

      Young, D.J., Meydan, S., and Guydosh, N.R. (2021). 40S ribosome profiling reveals distinct roles for Tma20/Tma22 (MCT-1/DENR) and Tma64 (eIF2D) in 40S subunit recycling. Nat Commun 12, 2976.

      Yu, J., Chen, M., Huang, H., Zhu, J., Song, H., Zhu, J., Park, J., and Ji, S.-J. (2018). Dynamic m6A modification regulates local translation of mRNA in axons. Nucleic Acids Research 46, 1412–1423.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary:

      In this paper, the authors are exploring the uORF regulatory mechanism. They first discussed five general models how uORFs might work to repress and buffering main ORF translation, then they mainly focus on the UL4 uORF2 for the potential mechanism. They use both computer modeling and experimental validation with reporter assay in 293t cell line. Based on their model, and few experimental results when they change the translation initiation rate and/or length of dORF, they propose it may work through 40S dissociation model, since the buffering effect is not uORF length sensitive.

      Major comments:

      • Are the key conclusions convincing?<br /> The modeling for different mechanisms is insightful, but some modeling parameters and experimental validation are not conclusive and validation of few of them can enforce the conclusions.
      • Should the authors qualify some of their claims as preliminary or speculative, or remove them altogether?<br /> Yes, the part about queuing and length sensitive is not convincing to me, it should be modified and reduce the statement strength.
      • Would additional experiments be essential to support the claims of the paper? Request additional experiments only where necessary for the paper as it is, and do not ask authors to open new lines of experimentation.<br /> Yes, please see the major concerns
      • Are the suggested experiments realistic in terms of time and resources? It would help if you could add an estimated cost and time investment for substantial experiments.<br /> They will need to re-think about the modeling, and validation in Figure 5, there are validation experiments that can be done in weeks and in a cost-efficient manner that can enforce the conclusions.
      • Are the data and the methods presented in such a way that they can be reproduced?<br /> Most of them are good
      • Are the experiments adequately replicated and statistical analysis adequate?<br /> Yes

      I have some major concerns about the paper:

      1. It is a bit confusing to me in Figure 2C, the reporter assays, that non-start codon reporter and non-stall reporter has same expression level. In theory, the non-stall reporter still has uORF there, so it should repress main ORF expression, and have lower expression level than the non-start reporter, where there is no uORF, no repression. In other uORFs they tested in Figure 6, the non-stall reporters are lower than non-start reporter. Since data they use to build the model is Figure 2B, and calculate the parameters for the whole paper, I just want to make sure it is making sense. I noticed there is another CTG in frame on the 4th codon, this may be alternative start codon in the non-start reporter to trigger some repression.
      2. All the modeling and prediction the authors do are based on average, but we know translation is very heterogeneous. For each ribosome or each 40S, the kinetics varies a lot, the authors should discuss about this part.
      3. For modeling related with the queuing-mediated model in Figure 1C. they use 30nt as the ribosome length to count the potential queuing to start codon. But 30nt is the 80S protected fragment with specific conformation. The protected fragment for 80S will change based on different status of ribosome conformation or elongation step. More importantly, for queuing, it is 40S, so they may have a different size. Based on previous 40S ribosome profiling (Archer, Stuart K., et al. Nature 535.7613 (2016): 570-574. And other papers), the length can vary from 19nt to very long, so I don't think the 30nt length can be used to model queuing in 40S and length sensitivity in the uORF working mechanism.
      4. For Figure 5B, besides the modeling length part I have mentioned above, when the authors increase the length of uORF, the sequence is also changed, which may introduce other side effect. So, if the authors want to conclude about the queuing part, they should rethink about the length for both modeling and validation, plus control for the sequence they added to increase the length of uORF, for example use different sequence when manipulate the length.

      Minor comments:

      • Specific experimental issues that are easily addressable.<br /> It is unclear how the luciferase assays were analyzed considering the background noise. If the NLuc expression is low, close to the background, then how to extract or normalize the background will influence the expression level, thus fold change for different reporter/condition.
      • Are prior studies referenced appropriately?<br /> yes
      • Are the text and figures clear and accurate?<br /> Mostly good
      • Do you have suggestions that would help the authors improve the presentation of their data and conclusions?<br /> Have a main figure about the modeling part.

      Significance

      • Describe the nature and significance of the advance (e.g. conceptual, technical, clinical) for the field.<br /> It is an interesting area, using modeling with experiment validation to understand uORF regulation mechanism, the kinetics and interplay between different translation steps, it will help us to understand uORF buffering in stress conditions.<br /> Also bring modeling method with reporter validation to the translation field, will provide clues to the molecular mechanism study, especially in complex situation.
      • Place the work in the context of the existing literature (provide references, where appropriate).<br /> Recent years, there has been a lot of study about small open reading frames, while for uORFs are known to repress translation, the regulatory mechanism is not known yet, there are just different models not validated yet (Young& Wek, 2016). Also, under normal conditions and stress conditions, uORF can play both repressive and stimulative role in main ORF translation (Orr, Mona Wu, et al. NAR 48.3 (2020): 1029-1042.). This paper is the first study to put all the uORF working hypothesis with buffering effect together, they use modeling to explain how under each hypothesis, buffering may happen or not.
      • State what audience might be interested in and influenced by the reported findings.<br /> It will be interesting to people, who study molecular biology, biochemistry for translation regulation, especially uORFs. The modeling people may also find it interesting, how they could adapt modeling to complex biology process and contribute to the understanding.
      • Define your field of expertise with a few keywords to help the authors contextualize your point of view. Indicate if there are any parts of the paper that you do not have sufficient expertise to evaluate.<br /> I have extensive experience working in the translation regulation field and I feel extremely comfortable to discus all the experimental part including individual reporters as well as genome wide. But I do not consider an expert in the modelling section of this work.
    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      1. General Statements [optional]

      We appreciate the efforts the two reviewers had invested in reviewing our manuscript. Their constructive comments will help improve the paper overall.

      2. Description of the planned revisions

      Insert here a point-by-point reply that explains what revisions, additional experimentations and analyses are planned to address the points raised by the referees.

      Reviewer #1 (Evidence, reproducibility and clarity):

      The main point of the current report is that 6mA is present in DNA of Hydractinia, and is introduced randomly into the genome by DNA polymerases, originating from degradation of maternally provided RNA via nucleotide salvage pathway. The authors observed that 6mA levels are changing over development and peak at 16-cell stage, with a sudden decrease to 'background levels' at 64 cell stage, a stage when zygotic genome gets activated. The 6mA drop is Alkbh1 dependent, since upon K/D of Alkbh1, 6mA levels were significantly higher than in control embryos. Authors also observed that AlkbH1 K/D delays zygotic genome activation (ZGA) to later stages, but without any noticeable consequences for the proper development. To demonstrate that 6mA is not controlled via direct DNA methylation, they show that K/D of two potential DNA methyl transferases N6amt1 and Mettl4 does not have any effect on 6mA levels. Supporting their hypothesis, authors demonstrate high activity and imperfect selectivity towards non-modified nucleotides of salvage pathway during embryo development using EU labeling experiments.<br /> In general, the provided data support their model, however, the paper needs some improvements to include missing information and controls before publication.

      Major comments:<br /> 1. Fig 1A shows a schematic where D3-6mA is added to only QTRAP but not QQQ experiment, usually QQQ methods also require isotopic standards for each component quantified to normalize for ionization differences and provide true quantitative information. Why did authors not use dA isotope? The ionization suppression is more pronounced at high concentrations of the components, which is true for dA in the current set up. How do authors control or at least test this?

      We have limited resources of isotopic-labelled standards. Therefore, we initially used QQQ without these standards to obtain data that covered many time points in development to identify the general pattern and key time points of high and low 6mA. Once the QQQ indicated that the 16-cell stage has the highest 6mA and that this drops to background at the 64-cell stage (and remains so later on), we performed QTRAP with the isotope-labeled standard control only for these two stages. Looking at the data resulting from both techniques, it appears that they essentially revealed the same pattern. Since the main focus of the study is on 16- and 64-cell embryos, we feel that the contribution of performing all stages by QTRAP would be marginal. We have performed control experiments to assess ionization suppression for dA and found that it was insignificant. We will add the corresponding data to the Materials and Methods section.

      Fig S1 show that quantification works well, but were the total DNA amounts comparable to the gDNA amount used in actual samples? If yes, please indicate so.

      Yes, the amounts were the same (2mg). We will change the methods sections accordingly.

      1. In line 68 and in fig 1B, 1C there is a mysterious 'Neg. Ctrl 'sample. It is unclear what was the sample and more interestingly in fig 1B the levels in this sample are 0.015% but in fig 1C it is much below 0.001%. Why there is such a striking difference for the identical sample.

      Negative controls were the same amounts (2 µg) of oligonucleotides without 6mA, DNAse-treated exactly like the samples. Figure 1B shows that QQQ is not sensitive enough to reliably detect 6mA concentrations below 0.02%, incapable to distinguish the background 6mA in the negative control from the level of 6mA in the 64-cell stage and later. Therefore, we utilized D3-6mA labelled QTRAP (Figure 1C) and determined that the level in the 64 cells stage embryos was actually ~0.01%. In the negative control, the amount was considerably lower, around 6 ppm (0.0006%).

      1. As I can see authors measured natural isotopologue of 6mA, however traces of contaminant bacterial DNA originating even from recombinant DNA degradation enzymes also have 6mA, giving background signal. In their LC/MS experiments, did authors check if the 6mA comes truly from the gDNA and not from contaminant during DNA purification and processing before MS?

      Yes, we did. As control for the level of 6mA contamination from the enzymatic digestion (sourced from bacteria), we also performed digestion of the negative control (see also answer to previous comment).

      1. Fig 1D in the legend: authors should indicate that samples were already RNAse treated, and Line 80 in the text mentions a second RNase treatment (fig S1C) to confirm the specificity of the DNA staining.

      The samples were indeed RNase-treated. We will modify the legend and the reference to figure 1D on line 80 accordingly.

      1. In lines 86-87, authors compare the LC/MS and sequencing based quantifications, and say they are consistent. Can authors make a figure analogous to fig 1B but using sequencing data?

      The data are already provided in Figure S1E. However, we used a Venn diagram to denote that these figures were generated by a different type of analysis (SMRT-sequencing as opposed to QTRAP). They are consistent but not identical.

      1. Fig 3B and 3C, controls showing the validity of EU staining, are required, such as RNAse treated sample with a signals disappearing; or control embryos without EU, thus having only background signal.

      Indeed, Fig 3C shows an RNase treated sample in which the EU signal is abolished as expected.

      1. Fig 3D specificity control is missing, control embryos without EdU having only background signal.

      The control is provided in Figure 3B. It shows a sample without EdU (treated with EU) and shows the background signal.

      1. Fig 4A legend: 'rescue solution (see text)'. Please describe in the legend what the solution was. Moreover, I did not find clear explanation in the text either, my only guess was from the materials in methods section, where authors used both shAlkbh1 and Alkbh1 mRNA with silent mutations.

      The reviewer is right, this was indeed the control that was used. We will modify the text to clarify this point.

      1. Fig 4B shows many data points per condition and the legend says EU signals (in triplicate), was these triplicate animals with multiple cells, where EU signal from each cell was plotted as a point? Please specify in the legend.

      Yes, triplicate embryos and each cell used as point. The legend will be adapted.

      1. Lines 169-170 state 'the lack of premature ZGA following N6amt1/Mettl4 knockdown (Figure S7B) indicate a lack of methyl transferase that maintains 6mA through embryogenesis' while an experiment indeed demonstrates that these are not the major players in this process, it does not prove these are not DNA methyl transferases. The absence of evidence is not the evidence of absence. I think authors should at least soften this conclusion.

      We agree and will tone down the relevant statement.

      1. Discussion section describes many experimental data that belong to Results section.

      This is a point also raised by Reviewer #2. We will move these points to the results and expand the discussion.

      1. Fig S8 I think should be a part of the main figure since it is one of the important experiments to prove the high activity and somewhat low selectivity of salvage pathway in the embryos during the critical early stages.

      We had originally left it out to save space. We prefer to leave this decision with the editor.

      1. Fig 5C the model is confusing, authors should improve it.

      It is difficult to describe a complex story using a single static model. Therefore, we will add an animation to the supplemental material to clarify the model.

      1. Fig S8 negative controls showing the specificity of CuAAC staining are missing: control animals/ embryos without EU.

      We will redo these experiments and include appropriate controls.

      1. Authors may find this reference useful: PMID: 32355286.

      We will add this ref.

      1. It is known that in mammals ADAL protein is the one which demethylates m6A nucleotide to clear it from the nucleotide pools and prevent it entering into the salvage pathway (PMID: 29884623). Does Hydractinia Symbiolongicarpus have an ADAL analog? If yes then it would be important to see if knock down/overexpression of this enzyme has any effect on the timing of ZGA. In principle, passively introduced 6mA may be regulatory to proper time the ZGA, and is controlled via an activity of Adal and Alkbh1.

      The gene is present in the Hydractinia genome. We could perform the experiments recommended. We will knock the gene down and look at the effect of this manipulation on ZGA.

      1. Material and methods are missing information:<br /> a. Line 370-371 provide references to the protocols listed or describe the steps.<br /> b. Line 373 standard column based purification protocol, what is it either explain or provide a reference.

      References will be provided.

      Minor points:<br /> Line 79 : 'Fig 1D and S1B', Did authors meant 'Fig1D and S1C'?<br /> Fig 5A Y axis title is missing.<br /> Line 379: 3D1-6mA should be D3-6mA please correct the other appearances as well.<br /> Line 405: terms : dsDNA solutions and standard solutions are confusing please rephrase.<br /> Line 410: Cleaned embryos, what does cleaned mean, be specific.<br /> Line 413: PTx is mentioned, please explain what is it.<br /> Line 415 and line 440 : HCl was washed and embryos were neutralized, I guess it should state : HCl was neutralized and embryos were washed with...'<br /> Line 431: ' before fixed by incubation in PAGA-T..." did authors meant : 'before fixation with PAGA-T...?<br /> Line 435: Permeabilization was done by further washes the fixed embryos with...", did authors meant: Permeabilization was done by an additional wash of the fixed embryos with...?<br /> Line 440: The HCL was washed with what solution?<br /> Line 446: For how long were the PTx washes?<br /> Lines 458-460: the sentence is confusing.<br /> Line 500: 'then used detect' should be 'then used to detect'

      We will adopt all minor points above.

      Reviewer #1 (Significance):

      There are many high profile papers describing the existence of 6mA in gDNA of different organism including insects and mammals. However, there is no proof that it has any biological function. Indeed, recent reports (PMID: 32355286 and 32203414) indicate that in mammalian cells, 6mA is indeed primarily incorporated by DNA polymerases and originates from a salvage pathway. The present report is the first in vivo evidence that confirms this to be the case more generally and, importantly, demonstrates a 6mA effect on ZGA. Hence, this is an important and timely report, which will be interesting to the field, as well as a broad audience to clarify the role of 6mA and the mechanism whereby it is introduced into gDNA.<br /> My expertise: Biochemistry and biology of DNA and RNA modifications, including 6mA. Fair expertise: bioinformatics analysis.

      Reviewer #2 (Evidence, reproducibility and clarity):

      The manuscript reports developmental dynamics of DNA 6mA in the cnidarian Hydractinia symbiolongicarpus. The authors describe an event of a seemingly random accumulation of this DNA modification in 16-cell stage embryos of Hydractinia symbiolongicarpus followed by an apparent clearance of 6mA by the 64-cell stage. Interestingly, the depletion of cnidarian orthologue of the putative 6mA 'demethylase', Alkbh1, results in delay in zygotic transcription accompanied by high levels of DNA 6mA in 64-cell stage cnidarian embryos. The authors suggest that the 6mA they observe originates from random misincorporation of recycled degraded m6A-marked ribo-nucleotides during early cnidarian embryogenesis.<br /> Overall, most of the experiments are performed at high technical level and the paper is generally nicely written. Despite this, in my opinion, the manuscript would benefit from incorporation of several addition controls and answering a number of points on the description/presenation of the data.<br /> Major comments:

      1. In the present version of the manuscript, the authors demonstrate the negative correlation between the presence of 6mA in the genome of cnidarian embryos and transcription. Although, the depletion of Alkbh1 leads to the delay in ZGA, strictly speaking, this effect may be independent of the catalytic function of Alkbh1. Therefore, to make a statement that m6A "random incorporation into the early embryonic genome inhibits transcription" the authors should use a catalytically inactive form of this enzyme as a control in the corresponding experiments and/or (ideally) perform in vitro transcription assays using 6mA-containing substrates.

      We could perform shRNA-mediated Alkbh1 KD and try rescue ZGA by co-injecting a catalytically-inactive Alkbh1 mRNA.

      The suggested in vitro experiment would be inconclusive for two reasons: first, Hydractinia polymerase may respond differently to 6mA; second, 6mA-mediated transcription inhibition could be indirect, requiring the in vivo context. We would like to add that transcription inhibition of 6mA has been demonstrated in vitro using yeast DNA polymerase as cited in the paper.

      1. Despite several experiments suggesting that random incorporation of recycled ribonucleotides occurs in cnidarian embryos, the source of 6mA in their DNA seems currently unclear. Would it be possible to directly test the author's hypothesis by comparing the levels of 6mA upon maternal (and possibly zygotic) depletion of the cnidarian orthologue of RNA m6A methyltransferase Mettl3 in cnidarian embryos? Alternatively, the authors could incubate the embryos in medium supplemented with labeled ribo-m6A followed by checking the levels of DNA 6mA in the embryonic DNA?

      We show that maternal mRNAs are already methylated in the early embryo (Figure 5). Therefore, it would indeed make sense to ablate Mettl3 from the maternal tissue while maternal mRNAs are methylated. However, in the absence of a conditional knockout technique in Hydractinia, this would require generation of CRISPR-Cas9 mutants that would likely die early in their development, long before reaching sexual maturity.

      Instead, we are happy to perform the other experiment suggested by the reviewer to directly demonstrate m6A to 6mA transition.

      Minor comments:<br /> 1. It would be nice to complement Fig. 4, 5, and S7 with immunostaining of the corresponding embryos for 6mA.

      6mA immunostaning is not compatible with EU labeling because, first, they require different types of fixation (PAGA-T vs formaldehyde); second, immunostaining requires RNase treatment to remove m6A which would also remove the EU signal.

      1. The current Discussion contains references for several figures with experimental results. I suggest separating these experimental data from the Discussion. The authors should, in my opinion, make an additional Results chapter and, if possible, expand the Discussion section (that is currently minimal) speculating on significance of their results for different biological systems.

      This has also been requested by Reviewer #1. We will follow the reviewer's recommendation.

      1. The present Title reads like a clear overstatement (at least currently, please see major comments above). The Title should also reference the organism where the observations have been made.

      Following the revision, we believe that both random incorporation of 6mA and a delay in zygotic transcription will be well supported by our data. We will add the organism's name to the title as suggested.

      Reviewer #2 (Significance):

      The presence and significance of DNA 6mA in animal genomes is a very interesting and highly controversial topic. Although a number of studies suggest that relatively high levels of this DNA modification occur in multicellular eukaryotes in different biological/functional contexts, other reports challenged these observations attributing them to different experimental artifacts. In this context, the current paper that provides high quality novel experimental data on the developmental dynamics of DNA 6mA in cnidarian is extremely interesting and timely. Moreover, the author's results and the hypotheses on the function/origin of 6mA in cnidarian embryogenesis may provide a conceptual framework for the interpretation of other 6mA/m6A-related studies performed on different experimental models. Thus, this manuscript will definitely be of interest for a wide range of researchers working in the fields of epigenetics, cancer biology and developmental biology.<br /> I strongly believe that this is an interesting and important study that definitely deserves to be published in a high impact journal.

      3. Description of the revisions that have already been incorporated in the transferred manuscript

      Please insert a point-by-point reply describing the revisions that were already carried out and included in the transferred manuscript. If no revisions have been carried out yet, please leave this section empty.

      4. Description of analyses that authors prefer not to carry out

      Please include a point-by-point response explaining why some of the requested data or additional analyses might not be necessary or cannot be provided within the scope of a revision. This can be due to time or resource limitations or in case of disagreement about the necessity of such additional data given the scope of the study. Please leave empty if not applicable.

      Reviewer #2 suggested four experiments, three of which are either impossible in our system or expected to reveal insignificant information. First, the reviewer suggests ablating Mettl3 from the maternal tissue. While being a good idea in principle, there is no conditional ablation technique available for Hydractinia. Generating CRISPR-Cas9 mutants would likely result in embryonic lethality, long before sexual maturation has been reached.

      Second, the reviewer proposed to perform in vitro experiments with m6A-containing substrates. These experiments are unlikely to reveal useful data since the Hydractinia polymerase may respond differently to methylated adenine than commercially available polymerases. Also, transcription inhibition may be indirect, depending on the in vivo context that cannot be mimicked in vitro.

      Finally, the reviewer suggested expressing a catalytically-dead Alkbh1 in the background of endogenous Alkbh1 knockdown to demonstrate that its function depends on the enzymatic activity to remove 6mA from the genome. While we could perform the experiment (see our reply above), the information emanating from it would arguably be outside the scope of this study.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      1. General Statements [optional]

      The newly identified azyx-1 ORF was named peu-1 in the initial submission of this manuscript, a name that was under consideration with WormBase, who supervise nomenclature of C. elegans genes. In consultation with WormBase, the locus was named azyx-1 instead (the final decision being “azyx-1 will be attributed to F42G4.11. It will be released in WS287 at the beginning of 2023”). We updated this nomenclature in our submission files, including in reviewer comments pasted below. Please note that other than this, no changes whatsoever were made to the reviewer comments.

      2. Description of the planned revisions

      REV #3: Specific thoughts for consideration:

      Figure 5, Moderate is really minor/moderate with other metrics, and severe is definitely moderate with other metrics. Thus, I'm not sure if normal vs. moderate is needed. This really is a minor point as it doesn't impact results/overall story/importance.

      This was also pointed out by reviewer #1. We will rename classification more mildly so.

      REV #1 Fig. 5 Even the 'severe' muscle disruption is quite mild (say, in comparison to loss of talin). Perhaps rephrase these categories? The moderate and severe categories also do not look different to me. Show what the muscle cells look like in zyx-1 deletion and overexpression animals. Is there a way to use quantitative imaging to score these? Can azyx-1 phenotypes be rescued or enhanced by expression (or RNAi) of zyxin in the muscle? Also, clarify what age animals are being tested in the muscle and burrowing assay.

      We agree and will rename the classes in milder terms. Qualitative scoring (which was done blinded) is the standard in the field as was done according to Dhondt et al. (2021 Dis Model Mech). When tested for muscle integrity and burrowing capacity, animals were day 1 adults. This is mentioned in the Methods section of the current manuscript and will also be included in the captions of the revised figures.

      REV #2: I am not convinced by the data presented in Figure 5. There does not seem to be much to distinguish the five genotypes, but I concede that I am not used to looking at this type of data. But why was the muscle phenotype not also examined in the azyx-1 rescue lines?

      Because other reviewers that are familiar with these data point out that the observed differences of panels A-B are indeed milder that what is usually seen, we will rename classifications in the manuscript (see responses above). Because the azyx-1 deletion mutant does not differ from controls in the muscle phenotype, there is no phenotype to rescue for this readout, and no rescue strains were generated.

      We are not sure what the reviewer may struggle with in (assumedly) panel C (~‘to distinguish the five genotypes’). The positive control (zyx-1) behaves as expected in the burrowing assay, with our own mutants within that range, also as expected. All data were scored blinded to avoid any bias and statistical analysis supports the interpretations, all granting confidence to the observed differences. However, because reviewer#3 also would prefer another representation of the data shown in this panel (see below), we will provide an updated panel representation in the revised manuscript.

      REV #3: Figure 5C- Hard to read. Would displaying lines/tragectories make it easier to understand? Would displaying as violin plots for each timepoint/condition make it easier to visualize? Basically in black and white and in color this is hard to visually process.

      We will work on another representation for the revised manuscript, since reviewer2 also seemed to struggle with this panel representation.

      REV #1: Fig. S2 Match font sizes on Y-axes. Also, indicate any statistical differences and statistics used.

      Figure adjustments will be implemented in the revised manuscript as requested.

      REV#1: Fig. S3 C, indicate any statistical differences and statistics used.

      Figure adjustments will be implemented in the revised manuscript as requested.

      REV #2: I am not convinced by the "overexpression" experiments. These are not well controlled, since no evidence is presented that AZYX-1 is being overexpressed in these lines. Also, since we know that extrachromosomal transgenic lines are highly variable, one would need to test the effect of several independent lines to ensure that the effects that the authors observe are indeed associated with AZYX-1 overexpression and not simply an idiosyncratic effect of the genetic background of a given strain. Finally, there does not seem to be an obvious mechanism by which overexpression of AZYX-1 can impact ZYX-1 function. That doesn't rule out an effect, but based on the data as it is, it is premature to propose such a mechanism. The authors need to show that multiple overexpression lines do reproducibly overexpress AZYX-1 and that this results in reproducible effects of zyx-1 phenotypes.

      The extrachromosomal strains are indeed variable, but because the background is wild type (in contrast to a deletion mutant background for rescue strains), an overdose of the target provided is expected. As requested in the cross-consultation reviewer communication, we will include quantitative data in our revised manuscript that shows that the used strains (LSC1950, LSC1960, LSC2000) indeed are overexpressors.

      REV #2: The data presented in Figure 4F needs to be quantified using the same format as was presented in Figure 4B.

      Due to the different genetic background of the strains, this is not possible in the exact same way (the red signal of LSC1998 & LSC1999 is not unique to zyxin). We understand that in essence, the reviewer would like us to include a more quantitative representation of these data, and will update the figure accordingly.

      REV #2: What is the difference between the overexpression transgenic lines and the "rescuing" transgenic lines? In the Materials and Methods, the same concentration of plasmid was used in injections - so these likely give the same approximate level of transgenic expression.

      The genetic background: a rescue line adds wt DNA back to a mutant background, while in an OE strain it is added into a wt background. While this can already be derived from the genotype details in Supplemental Table S1, we apologize for not specifying this in the methods section, as it is common practice in the field. These specifications will be added to the revised manuscript.

      REV #2: I am not clear what features are being used to characterise the myofibril structures into the three categories. Can the authors annotate the images to indicate the diagnostic features?

      The reviewer is correct that manual classification is rather poorly defined in general, which is why it is scored blinded (here as per Cothren et al., 2018 Bio Protoc). We adhered to the reference images by Dhondt et al. (2021, Dis Mod Mech) with visual assessment based on how tightly organized (~parallel) myofilaments are organized, assessing overall increases of bends or breaks in individual myofibers as leading to a less aligned pattern (cf. Fig. 1 of Dhondt et al.). We will add this information more explicitly to the Methods section of the revised manuscript.

      3. Description of the revisions that have already been incorporated in the transferred manuscript

      REV #1: Fig. 4 would be better if the control (A) and azyx-1OE (B) worms were more similar in age and size

      The panels of this figure were not to the exact same scale, we apologize if the reviewer found this confusing. We have rescaled the panels so that this is less confusing. The animals are all day 1 adults.

      REV #1: Abstract: Clarify what is meant by 'putative syntenic conservation' or rephrase, simply stating that the existence of an ORF overlapping with the 5' region of zyxin is conserved

      This has been rephrased according to request.

      REV #1: Line 24: Clarify these are synthetic phenotypes (not caused by loss of zyx-1/azyx-1 alone). Loss of zyx-1 alone results in very mild phenotypes.

      While the original sentence already pointed this out, we rephrased the text to make clear that these observations require the dystrophic mutant background.

      REV #1: Line 28: Start new paragraph

      The new paragraph was started a sentence earlier, according to rev#2 request.

      REV #1: Line 31: Not clear what is meant by 'post-transcriptional regulation can be further propagated'- maybe reword to 'alternative and overlapping open reading frames (ORFs) arising from polycistronic mRNA can regulate translation' or something simpler like that.

      This has been rephrased according to request.

      REV #1: Line 56-57: Is this because most C. elegans transcripts start with the splice leader SL1 or SL2 rather than the adjacent 5' sequence? Is that relevant for zyx-1? Recommend commenting briefly on this.

      We did not look into this for all possible u(o)ORFs in C. elegans, which also is not the focus of the manuscript, so we cannot make general statements. As part of the annotation procedure of azyx‑1, WormBase verified that indeed several pieces of evidence, including available phyloCSF data for exon 1, SL1s, RNASeq and Nanopore data, all support its annotation, as well as its translation from the zyx-1 long transcripts (albeit with different start and in different reading frame).

      REV #1: Line 78: Delete the word 'other'

      Done

      REV #1: Line 122: zyx-1

      Done

      REV #1: Line 137: 'lead' should be 'led'

      Done

      REV #1: Line 158: rephrase 'only the long ones' to indicate which isoforms more precisely

      Done (these are a/e, cf. Luo et al. 2014, Development)

      REV #1: Line 195: Rephrase. Unclear what is meant by 'highlights the evasiveness of non-canonical ORFs from functional annotation'

      Done; this was rephrased to “This exemplifies how non-canonical ORFs can escape functional annotation, …”.

      REV #1: Various locations: I think it will be more clear to the reader to consistently refer to the burrowing assay as 'burrowing assay' rather than chemotaxis. I recommend adding a brief description of the burrowing assay to the results section.

      Wording has been updated, we can provide a short context sentence to the results section of the revised manuscript.

      REV #2: I'm not sure how to interpret the significance of the u/ouORFs across short and large phylogenetic distances. One would presume that there might not be primary amino acid conservation if the regulation simply takes by interference with ribosome scanning and translocation. Here some statistical analysis would help with assessing the significance of these observations. How unusual is it to find u/uoORFs in the 5' UTRs of gene encoding zyxin family members versus in general for the species analysed?

      This is indeed the very question we are asking in the manuscript, and there is a clear reason why we refrain from making significance statements. At the moment, all relevant available metadata are used for the analysis in the manuscript, leading to the communication of the synteny-related findings as they are currently presented. This is due to the dependency on translatomics data to find credible u(o)ORFs, and there aren’t very many translatomics datasets available, only for a limited set of species so far. Our manuscript contains all relevant OpenProt data, which are derived from only 9 animal species so far. As shown in Table S4, 14 zyxin orthologs belonging to 7 species have associated u(o)ORFS, for two species only overlapping ORFs are present in the database. While more and more datasets will undoubtedly become available in the next years, the findings in the manuscript are as complete as currently possible: we do find evidence of u(o)ORFs associated with zyxin orthologs in these species, some of which are evolutionarily distantly related to C. elegans.

      REV #2: The authors state that there is evidence for synteny and coding region conservation. The data supporting this assertion is not well presented. Presentation and analysis of multiple sequence alignments of the putative homologues involved would strengthen the assertion of synteny considerably.

      We apologize if the reviewer misunderstood: we discuss likely syntenic conservation, not coding region conservation. The latter is not mentioned in our manuscript, and in fact not convincing indeed. This is not surprising given the bigger sequence diversity observed at the N terminus of zyxins and the partial overlap of these coding sequences, and in line with observations of several others in the RiboSeq community that many identified uORFs are conserved between orthologous genes, but poorly conserved at the amino acid level (e.g. community-driven communication by Mudge et al., BioRxiv 2021 and references therein).

      REV #2: The authors are oddly coy about the molecular details of the 27 bp deletion used to study the loss of azyx-1 function. In the absence of these details, it is not possible to assess the validity of these experiments. We need to be given the full molecular details of the allele - precisely which nucleotides are deleted? And how do they affect the coding regions of zyx-1 and azyx-1?

      I am also confused about why the authors made a deletion allele rather than mutating the AUG of AZYX-1? This would be a cleaner experiment to interpret. Based on the data presented, there are two possible interpretations in addition to the one suggested by the authors: 1) the 27 bp deletion impacts zyx-1 expression due to its impact on the zyx-1 coding region (the coding regions of azyx-1 and zyx-1 overlap); 2) the deletion mutation deletes critical transcriptional control elements. A simpler mutation of the azyx-1 AUG via CRISPR might allow them to rule out the possibility that they have simply compromised a transcriptional control element or damaged the coding region of ZYX-1.

      As mentioned above and as will be included more clearly so in the revised manuscript: the deletion is 182-155bp (27bp) upstream of the zyx-1a start site. This was a mutant that could easily be generated via CRISPR, so we proceeded with this one. This edit rules out option1 (there is no change of the zyxin coding region), but (as also considered but addressed differently in the manuscript; see below) retains alternative interpretation 2. There are no regulatory regions or transcription factor binding sites known for the (a)zyx-1 locus (verified in current WormBase version WS285), but that does certainly not fully rule out the possibility either. Rather than creating a series of azyx-1 mutants, be they SNP or small deletion mutants, that would suffer from the exact same duality in possible interpretation, we chose to combine the deletion mutant with rescue and overexpression strains. Because these latter strains do not affect the endogenous zyxin regulatory region, they add far more credibility to the interpretation, than alternative mutants in the azyx-1/zyx-1 locus would.

      REV#2. The narrative flow of the introduction could be improved by the judicious use of paragraphs. Line 12, for instance is a clear paragraph break, as is line 24.

      Done

      REV #3: Specific thoughts for consideration:<br /> 3) Could more be said about overlapping genes/regulation in humans? Again, not critical but this is such a great piece of work that it would be useful to guide human subjects researchers as to how to best further your work.

      It is unclear whether the reviewer would like to see an extended introduction and/or discussion. We tried to meet this request without drifting too much from the focus of our current communication by adding the following to the introduction (lines 41-47 of the current draft): “From a more human-centred future perspective, uORFs are a rather unexplored niche for translational research: with a predicted prevalence in over 50% of human genes and first examples regulating translation of disease-associated genes already emerging (Lee et al. 2021; Schulz et al. 2018), the field is bound to not only lead to more fundamental, but also application-oriented insights. Keeping this broader context in mind, we here focus on more fundamental principles of uORFs in a model organism context.”.

      4. Description of analyses that authors prefer not to carry out

      REV#1: Does azyx-1 have zyx-1-independent functions or other regulatory targets?

      This is an interesting question that is not yet addressed. While this is possible, it is beyond the scope of our current communication. Since the reviewer does not request anything concrete, we would prefer to leave this for follow-up research. While this notion is included in the manuscript, we are happy to more explicitly address this question in the discussion as well.

      REV#1: Do the burrowing assay results reflect a neuronal or a muscle function for AZYX-1? Or both?

      Our manuscript indeed does not yet delve into tissue-specific actions of this newly discovered ORF. While interesting, and in line with reviewer #3’s remark, this would be valuable for follow-up research, but is beyond the scope of our current communication. We will make sure the concept is clearly mentioned in the discussion of our findings.

      REV #3: Specific thoughts for consideration:

      Could more be done/said about neruo vs, muscular effects of azyx-1 and zyx-1. I appreciate this is beyond the scope of the present manuscript and therefore does not require response if you don't have data or it makes telling the story you want to tell more difficult.

      We agree with the reviewer that spatially resolving some of these observations would be a next interesting step, which indeed is beyond the scope of our current communication.

      REV#1: Fig. 2A very faint, increase brightness/contrast?

      We did not adjust brightness or contrast for any of the figures, an no such requests were made by other reviewers. We greatly prefer presenting the data as unedited as possible, and would like to request the journal’s preference for action here.

      5. Remaining reviewer comments & responses not highlighted above

      CROSS-CONSULTATION COMMENTS<br /> _The following is a conversation among the three referees:<br /> _REFEREE #2: I appear to be the dissenting voice in terms of concern about the details of the 27 bp deletion and the "overexpression" constructs. I would be interested to know your opinions regarding my comments on these issues.<br /> REFEREE #1: I think adding the details of the 27 bp deletion is a reasonable request. It is probably not possible to disambiguate entirely the two effects of the deletion, and changing the start codon may result in an alternate start with other downstream effects. I think just explaining it more fully in the methods would satisfy my concerns.<br /> REFEREE #2: What about the issues with the overexpresssion? In my experience, presence of multicopy transgenes on an extrachromosomal arrays might not lead to over expression of the gene involved? This needs to be verified in some way.<br /> REFEREE #1: You are right about that. If the construct is tagged in some way they could try a western. I would recommend they integrate the transgenes, or just show results from several lines as you suggest.<br /> REFEREE #3: I agree the 27 bp deletion and over expression are reasonable technical issues. However, I view this a techical details vs. critical details for the novel regulatory mechanism. The point about ability to judge conservation is also reasonable but until the theory is firmly out there it is hard to test the conservation and broader applicability to other genes/proteins. Thus, while asking for additional information on these issues is reasonable I do not see the inability to address beyond highlighting as limitation in the text as critical to the overall validity of the work.<br /> REFEREE #2: I disagree with Reviewer #3, without knowing the details of 27 bp deletion the most reasonable interpretation of the data is simply that it is a loss-of-function allele of zyx-1. This goes beyond "technical" - at present there is no unequivocal evidence that azyx-1 has any functional significance beyond that it is expressed as a peptide.<br /> REFEREE #3: I've been back through the manuscript. They have sequenced the deletion and therefore should be able to provide that information to satisfy the issue(s). For the over expression, short of silencing my experience is that they do over express and when you have multiple lines some express more than others (and some silence more than others). If you want evidence that the peptide is over expressed ask them to quantify via mass spec if it isn't tagged and they can't do a Western. Clearly they have work leading expertise in quantitative mass spec proteomics in C. elegans and should be able to do that. Generally speaking, rescue of a deletion is a pretty good sign that the expression is working though (and is an accepted standard).

      We apologize if this was not clear from the manuscript, and will clearly include the details in the Methods section: the deletion is 182-155bp (27bp) upstream of the zyx-1a start site, at AT|G+26|TTC. This was confirmed by sequencing; the oligos used for this are listed in table S3 of the manuscript. We address the confusion of rescue and overexpression above, in response to reviewer #2 (who echoes this confusion here).

      Reviewer #1 (Evidence, reproducibility and clarity):

      **This is a very interesting paper about a gene regulatory mechanism in a type of poly-cistronic mRNA in which alternate starts/open reading frames lead to production of two different proteins from the same locus. AZYX-1 is a predicted 166 aa protein, translated from the 5'UTR of zyx-1. Two isoforms are expressed from the 5' UTR and coding region of zyx-1. The presence of overlapping transcripts with zyxin orthologs appears to be conserved in other animals. The authors provide spectroscopic evidence AZYX-1 is indeed translated, and show AZYX-1 can regulate zyx-1 expression. Intriguingly, it seems azyx-1 inhibits zyx-1 expression in cis (deletion of azyx-1 increases ZYX-1 peptides), but AZYX-1 promotes zyx-1 expression in trans (overexpression of AZYX-1 increases ZYX-1 expression).

      Reviewer #1 (Significance):

      Nature and significance of the advance: This is a very interesting paper about a gene regulatory mechanism in a type of poly-cistronic mRNA encoding azyx-1 and zyx-1. Intriguingly, it seems azyx-1 inhibits zyx-1 expression in cis (deletion of azyx-1 increases ZYX-1 peptides), but AZYX-1 promotes zyx-1 expression in trans (overexpression of AZYX-1 increases ZYX-1 expression).

      Compare to existing published knowledge: This is the first study of its type on zyx-1.

      Audience: Those interested in gene regulatory mechanisms and in zyxin.

      My expertise: C. elegans cytoskeleton, cell migration, acto-myosin contractility.

      Reviewer #2 (Evidence, reproducibility and clarity):

      **Summary:<br /> The authors build on previous work defining upstream and upstream-overlapping open reading frames (uORF and uoORFs, respectively) by focussing on a specific locus azyx-1, which the authors propose influences the expression of the gene encoding the sole zyxin family in C. elegans, zyx-1. They present evidence suggestive of u/uoORFs being a common feature of zyxin family genes in other animals, hinting that perhaps this is a conserved mechanism of gene expression regulation for these genes. In which case, studies in C. elegans would be valuable to elucidate the mechanism involved.<br /> Using a fluorescent reporter strategy, they show that azyx-1 is expressed in the same tissues as zyx-1, which is to be expected since their share the same transcriptional control elements.<br /> They also characterise the peptide steady state levels of both ZYX-1 and AZYX-1 isoforms, suggesting that while overall ZYX-1 levels decline with age, those for AZYX-1 are generally maintained. The significance of these observations was not immediately obvious to me - a priori it is difficult to assess what relative wild type steady-state levels one might expect if AZYX-1 translation impacted ZYX-1 expression.<br /> The authors propose that expression of AZYX-1 leads to inhibition of ZYX-1 translation through the standard model by which u/uoORFs impact translation of downstream ORFs. To test this, they generated a 27 bp deletion "at the beginning of the azyx-1 ORF". This deletion clearly correlated with a reduction in ZYX-1 expression.<br /> Finally, the authors generated lines designed to overexpress AZYX-1, testing the hypothesis that AZYX-1 might influence ZYX-1 in trans. Though here, it is not obvious by what mechanism this might operate, and the effect-sizes involved are modest.

      Reviewer #2 (Significance):

      The authors propose an interesting interaction between an important regulator of cellular behaviour (zyxin) and the u/uoORF that potentially regulates its expression - if validated by further experimentation, this would add to the growing evidence for the importance of the 5' UTR as a source of gene regulatory activity. Such regulation is well described in yeast, but there are fewer examples in animals, particularly in genetically tractable systems such as C. elegans. The work would primarily be of interest to researchers interested in understanding the spectrum of such activity in C. elegans. My own area of expertise, RNA-splicing and the post-transcriptional regulation of C. elegans gene expression, is not directly related to the research presented in the manuscript, but I am familiar with the general concepts and developments involved.

      Reviewer #3 (Evidence, reproducibility and clarity):

      Summary:<br /> The authors find that azyx-1 is a non-cononical gene with overlapping genomic localization to the gene zyx-1 in C. elegans. The authors also find preliminary evidence that similar genes with overlapping localization to zyxin genes exist in other species. The authors provide evidence for the tissue specific distribution of azyx-1 expression. The authors further provide evidence for azyx-1 and zyx-1 expression with age. Importantly, these data demonstrate differences in azyx-1 and zyx-1 protein products biological importance/relevance as they display differences with age. The authors provide evidence that azyx-1 expression influences zyx-1 expression in multiple ways. Lastly, the authors demonstrate that azyx-1 expression influences muscle structure and neuromuscular function. The authors use a combination of bioinformatic, protein biochemistry, genetic/transgenic, histologic, and physiologic methods to make these points. With regards to methods, the range/breadth is impressive and appropriate. In many ways the manuscript it is a tour de force in modern molecular biology with a focus on translational medicine. With regards to species, the in vivo experiments are solely C. elegans but the computational data include Fly, Bull, and Mouse.

      The key conclusions are convincing. There are no major claims that require qualification as preliminary or speculative. No additional experiments are essential to support the claims of the paper. The data and methods are presented in such a way that they can be reproduced. The experiments are adequately replicated and the statistical analysis is adequate.

      Prior studies are references appropriately. The text and figures are mostly clear and accurate.

      We would like to thank the reviewer for their appreciation of our efforts and research approach.

      Reviewer #3 (Significance):

      **Conceptually this is a massive/ground breaking piece of work. Essentially, the authors are demonstrating a novel mechanism of regulation of gene/protein expression that, really, hasn't been reported before. What is particularly notable is that it appears, unsurprisingly, as correctly stated by the authors, to be evolutionarily conserved and not well reported in the literature. As with many classical molecular biology papers, and the more recent (e.g. RNAi, lncRNA) genetic papers, this manuscript hold the promise of transforming biology/medicine. The range of methods employed and the linking of molecular biology to pathophysiology was impressive. The audience that will be interested in this work includes: geneticists, proteomics researchers, evolutionary researchers, molecular biologists, physiologists, ageing researchers, muscle researchers, and muscle disease researchers. Thus, the interested audience is broad. My field of expertise with regards to this manuscript is: C. elegans, Mass Spec, Proteomics, genomic regulation, genetics, transgenics, histology, muscle, and physiology. There are no parts of this manuscript that I do not feel I have insufficient expertise to evaluate. I congratulate the authors on a highly significant, cross disciplinary, manuscript, that should impact multiple sub-areas of biology.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary:

      The authors build on previous work defining upstream and upstream-overlapping open reading frames (uORF and uoORFs, respectively) by focussing on a specific locus peu-1, which the authors propose influences the expression of the gene encoding the sole zyxin family in C. elegans, zyx-1. They present evidence suggestive of u/uoORFs being a common feature of zyxin family genes in other animals, hinting that perhaps this is a conserved mechanism of gene expression regulation for these genes. In which case, studies in C. elegans would be valuable to elucidate the mechanism involved.<br /> Using a fluorescent reporter strategy, they show that peu-1 is expressed in the same tissues as zyx-1, which is to be expected since their share the same transcriptional control elements.<br /> They also characterise the peptide steady state levels of both ZYX-1 and PEU-1 isoforms, suggesting that while overall ZYX-1 levels decline with age, those for PEU-1 are generally maintained. The significance of these observations was not immediately obvious to me - a priori it is difficult to assess what relative wild type steady-state levels one might expect if PEU-1 translation impacted ZYX-1 expression.<br /> The authors propose that expression of PEU-1 leads to inhibition of ZYX-1 translation through the standard model by which u/uoORFs impact translation of downstream ORFs. To test this, they generated a 27 bp deletion "at the beginning of the peu-1 ORF". This deletion clearly correlated with a reduction in ZYX-1 expression.<br /> Finally, the authors generated lines designed to overexpress PEU-1, testing the hypothesis that PEU-1 might influence ZYX-1 in trans. Though here, it is not obvious by what mechanism this might operate, and the effect-sizes involved are modest.

      Major comments:

      1. I'm not sure how to interpret the significance of the u/ouORFs across short and large phylogenetic distances. One would presume that there might not be primary amino acid conservation if the regulation simply takes by interference with ribosome scanning and translocation. Here some statistical analysis would help with assessing the significance of these observations. How unusual is it to find u/uoORFs in the 5' UTRs of gene encoding zyxin family members versus in general for the species analysed?
      2. The authors state that there is evidence for synteny and coding region conservation. The data supporting this assertion is not well presented. Presentation and analysis of multiple sequence alignments of the putative homologues involved would strengthen the assertion of synteny considerably.
      3. The authors are oddly coy about the molecular details of the 27 bp deletion used to study the loss of peu-1 function. In the absence of these details, it is not possible to assess the validity of these experiments. We need to be given the full molecular details of the allele - precisely which nucleotides are deleted? And how do they affect the coding regions of zyx-1 and peu-1?<br /> I am also confused about why the authors made a deletion allele rather than mutating the AUG of PEU-1? This would be a cleaner experiment to interpret. Based on the data presented, there are two possible interpretations in addition to the one suggested by the authors: 1) the 27 bp deletion impacts zyx-1 expression due to its impact on the zyx-1 coding region (the coding regions of peu-1 and zyx-1 overlap); 2) the deletion mutation deletes critical transcriptional control elements. A simpler mutation of the peu-1 AUG via CRISPR might allow them to rule out the possibility that they have simply compromised a transcriptional control element or damaged the coding region of ZYX-1.
      4. I am not convinced by the "overexpression" experiments. These are not well controlled, since no evidence is presented that PEU-1 is being overexpressed in these lines. Also, since we know that extrachromosomal transgenic lines are highly variable, one would need to test the effect of several independent lines to ensure that the effects that the authors observe are indeed associated with PEU-1 overexpression and not simply an idiosyncratic effect of the genetic background of a given strain. Finally, there does not seem to be an obvious mechanism by which overexpression of PEU-1 can impact ZYX-1 function. That doesn't rule out an effect, but based on the data as it is, it is premature to propose such a mechanism. The authors need to show that multiple overexpression lines do reproducibly overexpress PEU-1 and that this results in reproducible effects of zyx-1 phenotypes.
      5. I am not convinced by the data presented in Figure 5. There does not seem to be much to distinguish the five genotypes, but I concede that I am not used to looking at this type of data. But why was the muscle phenotype not also examined in the peu-1 rescue lines?

      Minor comments:

      1. The narrative flow of the introduction could be improved by the judicious use of paragraphs. Line 12, for instance is a clear paragraph break, as is line 24.
      2. The data presented in Figure 4F needs to be quantified using the same format as was presented in Figure 4B.
      3. I am not clear what features are being used to characterise the myofibril structures into the three categories. Can the authors annotate the images to indicate the diagnostic features?
      4. What is the difference between the overexpression transgenic lines and the "rescuing" transgenic lines? In the Materials and Methods, the same concentration of plasmid was used in injections - so these likely give the same approximate level of transgenic expression.

      Referees cross-commenting

      The following is a conversation among the three referees:

      REFEREE #2: I appear to be the dissenting voice in terms of concern about the details of the 27 bp deletion and the "overexpression" constructs. I would be interested to know your opinions regarding my comments on these issues.

      REFEREE #1: I think adding the details of the 27 bp deletion is a reasonable request. It is probably not possible to disambiguate entirely the two effects of the deletion, and changing the start codon may result in an alternate start with other downstream effects. I think just explaining it more fully in the methods would satisfy my concerns.

      REFEREE #2: What about the issues with the overexpresssion? In my experience, presence of multicopy transgenes on an extrachromosomal arrays might not lead to over expression of the gene involved? This needs to be verified in some way.

      REFEREE #1: You are right about that. If the construct is tagged in some way they could try a western. I would recommend they integrate the transgenes, or just show results from several lines as you suggest.

      REFEREE #3: I agree the 27 bp deletion and over expression are reasonable technical issues. However, I view this a techical details vs. critical details for the novel regulatory mechanism. The point about ability to judge conservation is also reasonable but until the theory is firmly out there it is hard to test the conservation and broader applicability to other genes/proteins. Thus, while asking for additional information on these issues is reasonable I do not see the inability to address beyond highlighting as limitation in the text as critical to the overall validity of the work.

      REFEREE #2: I disagree with Reviewer #3, without knowing the details of 27 bp deletion the most reasonable interpretation of the data is simply that it is a loss-of-function allele of zyx-1. This goes beyond "technical" - at present there is no unequivocal evidence that peu-1 has any functional significance beyond that it is expressed as a peptide.

      REFEREE #3: I've been back through the manuscript. They have sequenced the deletion and therefore should be able to provide that information to satisfy the issue(s). For the over expression, short of silencing my experience is that they do over express and when you have multiple lines some express more than others (and some silence more than others). If you want evidence that the peptide is over expressed ask them to quantify via mass spec if it isn't tagged and they can't do a Western. Clearly they have work leading expertise in quantitative mass spec proteomics in C. elegans and should be able to do that. Generally speaking, rescue of a deletion is a pretty good sign that the expression is working though (and is an accepted standard).

      Significance

      The authors propose an interesting interaction between an important regulator of cellular behaviour (zyxin) and the u/uoORF that potentially regulates its expression - if validated by further experimentation, this would add to the growing evidence for the importance of the 5' UTR as a source of gene regulatory activity. Such regulation is well described in yeast, but there are fewer examples in animals, particularly in genetically tractable systems such as C. elegans. The work would primarily be of interest to researchers interested in understanding the spectrum of such activity in C. elegans. My own area of expertise, RNA-splicing and the post-transcriptional regulation of C. elegans gene expression, is not directly related to the research presented in the manuscript, but I am familiar with the general concepts and developments involved.

    1. Author Response

      Reviewer #1 (Public Review):

      This fMRI study investigated how memories are updated after reinterpreting past events. Participants watched a movie and subsequently recalled individual scenes from that movie. Importantly, the movie ends with a twist that changes the interpretation of earlier scenes in the movie. One group of participants watched the movie with the twist at the end, one group did not get to see the twist, and a third group was already informed about this twist before watching the movie. Analyses compared the similarity of activity patterns to (encoded or recalled) events across participants within regions of the default mode network (DMN). The design allowed for multiple relevant comparisons, confirming the prediction that activity patterns in DMN regions reflect the (re)interpretation of the movie (during movie viewing and/or during recall).

      The study is well-designed and executed. The inclusion of multiple analyses involving distinct comparisons strengthens the evidence for the role of the DMN in memory updating.

      The following points may be relevant to consider:

      1) The cross-participant pattern analysis method used here is not standard, with such analyses typically done within participants (or across participants, but after aligning representational spaces). Considering individual variability in functional organization, the method is likely only sensitive to coarse-scale patterns (e.g., anterior vs posterior parts of an ROI). This is not necessarily a weakness but is relevant when interpreting the results.

      We agree with the reviewer that functional misalignment might have played against us here. We designed this study as a natural successor of our previous work in which we captured reliable and multimodal scene-specific cross-participant pattern similarity during encoding and recall in standard space. In this revised version, we provide further evidence on how scene content is captured and influences our results. Nonetheless, we agree with your comment and add the following section to the discussion to encourage considering this point while interpreting the results.

      "Moreover, our current method relies on averaging spatially-coarse activity patterns across subjects (and time points within an event). Future extensions of this work may benefit from using functional alignment methods (Haxby et al 2020, Chen et al 2015) to capture more fine-grained event representations which are shared across participants."

      2) Unlike previous work, analyses are not testing for scene-specific information. Rather, each scene is treated separately to establish between-group differences, and results are averaged across scenes. This raises the question of whether the patterns reflect scene-specific information or generic group differences. For example, knowing the twist may increase overall engagement, both when viewing the movie (spoiled group) and when recalling it (spoiled group + twist group). The DMN may be particularly sensitive to such differences in overall engagement.

      You have brought up great points. We addressed them in two ways: (1) We ran a univariate analysis in each DMN ROI to look at the role of overall regional-average response magnitude in our results. We did not observe a significant effect of group or an interaction between group and condition. (2) We ran a scene-specificity analysis in a new Results section entitled “The role of scene content” (Figure 4). This section is focused on comparing interaction index (Figure 2C), as an indicator of memory updating, under different manipulations. Interaction index reflects the reversal of neural similarity during encoding and recall. Our results suggest that we don’t see the same effects if we shuffle the scene labels and recompute the pattern similarity analyses. Please see added text and figures below:

      "To test whether our reported results were mainly driven by the similarities and differences in multivariate spatial patterns of neural representations, as opposed to by univariate regional-average response magnitudes, we ran a univariate analysis in each ROI. This analysis revealed no significant effect of group (“spoiled”, “twist”, “no-twist”) or interaction between group and condition (movie, recall) (Table 1, see Methods for details).

      Next, to determine whether scene-specific neural event representations—as opposed to coarser differences in general mental state across all scenes with similar interpretations—drive our observed pISC differences, we shuffled the labels of critical scenes within each group before calculating and comparing pISC across groups. By repeating this procedure 1000 times and recalculating the interaction index at each iteration, we constructed a null distribution of interaction indices for shuffled critical scenes (light magenta distributions in Figure 4B). In 12 out of 24 DMN regions, interaction indices were statistically significant based on the shuffled-scene distribution (p < .025, FDR controlled at q < .05). All of these 12 regions were among the ROIs that showed meaningful effects in our original analysis (Figure 2C). Regions with significant scene-specific interaction effects are marked as blue dots with black borders in Figure 4B. Overall, the findings from this analysis confirm that our results are driven by changes to scene-specific representations."

      3) The study does not reveal what the DMN represents about the movie, such that its activity changes after knowing the twist. The Discussion briefly mentions that it may reflect the state of the observer, related to the belief about the identity of the doctor. This suggests a link to the theory of mind/mentalizing, but this is not made explicit. Alternatively, the DMN may be involved in the conflict (or switching) between the two interpretations.

      Great points. We added to the discussion about the role of mentalizing network and in the particular temporo-parietal cortex. About your last point, we think our whole brain findings outside DMN (ACC and dlPFC) might relate to that point. We discussed these further in the paper.

      "We performed two targeted analyses to look for evidence of memory updating across encoding and recall: the interaction analysis (Figure 2C) and the encoding-recall analysis (Figure 3). We hypothesized that a shift in direction of pISC difference would occur when neural representations during recall in the “twist” group start to reflect the Ghost interpretation. The interaction analysis probed this shift indirectly by taking into account the effects of both encoding-encoding and recall-recall analyses. Unlike the interaction analysis, in the encoding-recall analysis, we directly compared neural event representations during encoding and recall. Interestingly, all regions exhibiting an effect across the two encoding-recall analyses, excluding left anterior temporal cortex, were present in the interaction results. Among these regions, the left angular gyrus/TPJ exhibited an effect across all three analyses. As a core hub in the mentalizing network, temporo-parietal cortex has been implicated in theory of mind through perspective-taking, rationalizing the mental state of someone else, and modeling the attentional state of others (Frith and Frith 2006, Guterstam et. al 2021, Saxe and Kanwisher 2003). The motivations behind some actions of the main character in the movie heavily depend on whether the viewer perceives them as a Doctor or a Ghost, and participants may focus on this during both encoding and recall. We speculate that neural event representations in AG/TPJ in the current experiment may be related to mentalizing about the main character’s actions. Under this interpretation, the updated event representations during recall following the twist would be more closely aligned to the “spoiled” encoding representations, as a consequence of memory updating in the “twist” group.

      In our whole brain analysis, these regions did not have significant interaction effects, which suggests that the effects were isolated to encoding. In the whole-brain analysis, we also observed a significant encoding-encoding and interaction effects in anterior cingulate cortex, as well as recall-recall and interaction effects in dlPFC. These results suggest that both the "spoiled" manipulation and the "twist" may recruit top-down control and conflict monitoring processes during naturalistic viewing and recall."

      4) The design has many naturalistic aspects, but it is also different from real life in that the critical twist involves a ghost. Furthermore, all results are based on one movie with a specific plot twist. It is thus not clear whether similar results would be obtained with other and more naturalistic plot twists.

      We added this as a limitation of the study.

      "Our findings provide further insight into the functional role of the DMN. However, these results have been obtained using only one movie. While naturalistic paradigms better capture the complexity of real life and provide greater ecological generalizability than highly-controlled experimental stimuli and tasks (Nastase et al., 2020), they are still limited by the properties of the particular naturalistic stimulus used. For example, this movie—including the twist itself—hinges on suspension of disbelief about the existence of ghosts. Future work is needed to extend our findings about updating event memories to a broader class of naturalistic stimuli: for example, movies with different kinds of (non-supernatural) plot twists, spoken stories with twist endings, or using autobiographical real-life situations where new information (e.g. discovering a longtime friend has lied about something important) triggers re-evaluation of the past (e.g. reinterpreting their friend’s previous actions)."

      5) Only 7 scenes (out of 18) were included in the analysis. It is not clear if/how the results depend on the selection of these 7 scenes.

      Thank you for bringing this up. These scenes were pre-selected for the analyses, as they are the only scenes that are rated high by our independent raters (not study participants) on “twist influence”, meaning that knowing the twist could dramatically change their interpretation. So, we had a priori reasons to hypothesize that the effect will be strong in these scenes. To address your point, we report results by including all 18 scenes in a new Results section entitled “The role of scene content” and in Figure 4A. While the effect was weaker for all scenes it was still apparent in this conservative analysis. As expected, however, including 7 critical scenes produces stronger results than including all scenes or the uncritical scenes (all minus critical scenes). Please see the “The role of scene content” in Results and in Figure 4 for more detailed information.

      "The role of scene content In the prior analyses, we focused on “critical scenes”, selected based on ratings from four raters who quantified the influence of the twist on the interpretation of each scene (see Methods). An independent post-experiment analysis of the verbal recall behavior of the fMRI participants yielded “twist scores” that were also highest for these scenes; that is, the expected and perceived effect of twist information on recall behavior were found to match. In our next analysis, we asked whether the neural event representations reflect these differences in the twist-related content of the scenes. In other words, are the “critical scenes” with highly twist-dependent interpretations truly critical for our observed effects?

      To answer this question, we re-ran our main encoding-encoding and recall-recall pISC analysis in each DMN ROI (Figure 2-3). We calculated interaction indices (Figure 2C) first by including all scenes, and second by including only the 11 non-critical scenes. To better compare the effect of including different subsets of scenes to our original results, in Figure 4 we show the results in 15 ROIs that exhibited meaningful effects in our main analyses (Figure 2C). Figure 4A demonstrates that “critical scenes” yielded higher interaction indices compared to all scenes or non-critical scenes across all ROIs. The interaction score across all DMN ROIs was significantly higher in “critical scenes” than all scenes (t(23) = 7.19, p = 2.53 x 10-7) and non-critical scenes (t(23) = 7.3, p = 1.95 x 10-7). These results show that critical scenes are indeed responsible for the observed pISC differences across groups."

      Reviewer #2 (Public Review):

      In this manuscript titled "Here's the twist: How the brain updates the representations of naturalistic events as our understanding of the past changes", the authors reported a study that examined how new information (manipulated as a twist at the end of a movie) changes the neural representations in the default mode network (DMN) during the recall of prior knowledge. Three groups of participants were compared - one group experienced the twist at the end, one group never experienced the twist, and one group received a spoiler at the beginning. At retrieval, participants received snippets of 18 scenes of the movie as cues and were asked to freely describe the events of each scene and to provide the most accurate interpretation of the scene, given the information they gathered throughout watching.

      All three groups were highly accurate in the recall of content. The groups that experienced the twist at the end as well as at the beginning as a spoiler showed a higher twist score (the extent to which twist information was incorporated into the recall), while seemingly also keeping the interpretation without the twist ("Doctor representation") intact. Neurally, several regions in the DMN showed significant interaction effects in their neural similarity patterns (based on intersubject pattern correlation), indicating a change in interpretation between encoding and recall in the twist group uniquely, presumably reflecting memory updating.

      Several points that I think should be addressed to strengthen the manuscript:

      1) The results from encoding-retrieval similarity analysis (particularly the one depicted in Figure 3B) don't match the results from encoding/retrieval interaction (particularly those shown in Figure 2C). While they were certainly based on different comparisons, I would think that both analyses were set up to test for memory updating. Can the authors comment on this divergence in results?

      Thank you for your comment. Except for one ROI, the other two regions in Figure 2C are present in the interaction analysis. The ROI at the frontal pole might be hard to see from this angle but in fact it holds a high effect size in interaction analysis. So we do not see a big divergence between these two results. But taking into account the recall-recall results, we agree that there seems to be inhomogeneity. We discussed these further in the discussion.

      "We performed two targeted analyses to look for evidence of memory updating across encoding and recall: the interaction analysis (Figure 2C) and the encoding-recall analysis (Figure 3). We hypothesized that a shift in direction of pISC difference would occur when neural representations during recall in the “twist” group start to reflect the Ghost interpretation. The interaction analysis probed this shift indirectly by taking into account the effects of both encoding-encoding and recall-recall analyses. Unlike the interaction analysis, in the encoding-recall analysis, we directly compared neural event representations during encoding and recall. Interestingly, all regions exhibiting an effect across the two encoding-recall analyses, excluding left anterior temporal cortex, were present in the interaction results. Among these regions, the left angular gyrus/TPJ exhibited an effect across all three analyses. As a core hub in the mentalizing network, temporo-parietal cortex has been implicated in theory of mind through perspective-taking, rationalizing the mental state of someone else, and modeling the attentional state of others (Frith and Frith 2006, Guterstam et. al 2021, Saxe and Kanwisher 2003). The motivations behind some actions of the main character in the movie heavily depend on whether the viewer perceives them as a Doctor or a Ghost, and participants may focus on this during both encoding and recall. We speculate that neural event representations in AG/TPJ in the current experiment may be related to mentalizing about the main character’s actions. Under this interpretation, the updated event representations during recall following the twist would be more closely aligned to the “spoiled” encoding representations, as a consequence of memory updating in the “twist” group.

      Our findings are consistent with the view that DMN synthesizes incoming information with one’s prior beliefs and memories (Yeshurun et al 2021). We add to this framework by providing evidence for the involvement of DMN regions in updating prior beliefs in light of new knowledge. Across our different encoding and recall analyses, we observe memory updating effects in a varied subset of DMN regions that do not cleanly map onto a specific subsystem of DMN (Robin and Moscovitch 2017, Ranganath and Ritchey 2012, Ritchey and Cooper 2020). Rather than being divergent, these results might be reflecting inherent differences between the processes of encoding and recall of naturalistic events. It has been proposed that neural representations corresponding to encoding of events are systematically transformed during recall of those events (Chen et al 2017, Favila et al 2020, Musz and Chen 2022). While we provide evidence for reinstatement of memories in DMN, our findings also support a transformation of neural representation during recall, as encoding-recall results were weaker in some areas than recall-recall findings. This transformation could affect how different regions and sub-systems of DMN represent memories, and suggests that the concerted activity of multiple subsystems and neural mechanisms might be at play during encoding, recall and successful updating of naturalistic event memories."

      2) The recall task was self-paced. Can reaction time information be provided on how long participants needed to recall? Did this differ across groups? Presumably in the twist group and spoiled group participants might have needed a longer time to incorporate both the original and twist interpretation.

      This is an interesting idea. Unfortunately, we could not measure this accurately because our recall cues were snippets from the beginning of each scene with different length (selected based on content). And updating could begin from the beginning of those snippets (but we wouldn’t know when). We will consider this point in the future related designs.

      How was the length difference across events taken into consideration in the beta estimates?

      They were used as event durations in the GLM model.

      Also, is there an order effect, such that one type of interpretation tended to be recalled first?

      This is hard to measure as this only occurs in a subset of scenes. But we assume it happens in other people’s brains as well

      This is indeed hard to measure as you mentioned. We will provide the transcripts when sharing the data and hopefully this will facilitate future text-analysis work on this dataset to answer interesting questions like this.

      3) The correlation analysis between neural pattern change and behavioral twist score is based on a small sample size and does not seem to be well suited to test the postulation of the authors, namely that some participants may hold both interpretations in their memory. Interestingly, the twist score of the spoiled group was similar to the twist group, indicating participants in this group might have held both interpretations as well. Could this observation be leveraged, for example by combining both groups (hence better powered with larger sample size), in order to relate individual differences in neural similarity patterns and behavioral tendency to hold both interpretations?

      Even though both groups showed signs of holding both interpretations in mind, the process happening in their brain during the recall is different. In particular, we do not expect to see any updating effect in the spoiled group. So it wouldn’t seem accurate to combine these groups to test the effect of incomplete updating.

      4) Several regions within the DMN were significant across the analysis steps, specifically the angular gyrus, middle temporal cortex, and medial PFC. Can the authors provide more insights on how these widely distributed regions may act together to enable memory updating? The discussion on the main findings is largely at a rather superficial level about DMN, or focuses specifically on vmPFC, but neglects the distributed regions that presumably function interactively

      Thanks for bringing this up. We added text to discussion to respond to this very valid point. Please see the added text in our response to your first point. One more snippet added to the discussion about this:

      "In addition to mPFC, right precuneus and parts of temporal cortex exhibited significantly higher pattern similarity in the “twist” and “spoiled” groups who recalled the movie with the same interpretation. Precuneus is a core region in the posterior medial network, which is hypothesized to be involved in constructing and applying situation models (Ranganath and Ritchey 2012). Our findings support a role for precuneus in deploying interpretation-specific situation models when retrieving event memories. In particular, we suggest that the posterior medial network may encode a shift in the situation model of the “twist” group in order to accommodate the new Ghost interpretation.

      We performed two targeted analyses to look for evidence of memory updating across encoding and recall: the interaction analysis (Figure 2C) and the encoding-recall analysis (Figure 3). We hypothesized that a shift in direction of pISC difference would occur when neural representations during recall in the “twist” group start to reflect the Ghost interpretation. The interaction analysis probed this shift indirectly by taking into account the effects of both encoding-encoding and recall-recall analyses. Unlike the interaction analysis, in the encoding-recall analysis, we directly compared neural event representations during encoding and recall. Interestingly, all regions exhibiting an effect across the two encoding-recall analyses, excluding left anterior temporal cortex, were present in the interaction results. Among these regions, the left angular gyrus/TPJ exhibited an effect across all three analyses. As a core hub in the mentalizing network, temporo-parietal cortex has been implicated in theory of mind through perspective-taking, rationalizing the mental state of someone else, and modeling the attentional state of others (Frith and Frith 2006, Guterstam et. al 2021, Saxe and Kanwisher 2003). The motivations behind some actions of the main character in the movie heavily depend on whether the viewer perceives them as a Doctor or a Ghost, and participants may focus on this during both encoding and recall. We speculate that neural event representations in AG/TPJ in the current experiment may be related to mentalizing about the main character’s actions. Under this interpretation, the updated event representations during recall following the twist would be more closely aligned to the “spoiled” encoding representations, as a consequence of memory updating in the “twist” group.

      Our findings are consistent with the view that DMN synthesizes incoming information with one’s prior beliefs and memories (Yeshurun et al 2021). We add to this framework by providing evidence for the involvement of DMN regions in updating prior beliefs in light of new knowledge. Across our different encoding and recall analyses, we observe memory updating effects in a varied subset of DMN regions that do not cleanly map onto a specific subsystem of DMN (Robin and Moscovitch 2017, Ranganath and Ritchey 2012, Ritchey and Cooper 2020). Rather than being divergent, these results might be reflecting inherent differences between the processes of encoding and recall of naturalistic events. It has been proposed that neural representations corresponding to encoding of events are systematically transformed during recall of those events (Chen et al 2017, Favila et al 2020, Musz and Chen 2022). While we provide evidence for reinstatement of memories in DMN, our findings also support a transformation of neural representation during recall, as encoding-recall results were weaker in some areas than recall-recall findings. This transformation could affect how different regions and sub-systems of DMN represent memories, and suggests that the concerted activity of multiple subsystems and neural mechanisms might be at play during encoding, recall and successful updating of naturalistic event memories."

      Reviewer #3 (Public Review):

      Zadbood and colleagues investigated the way key information used to update interpretations of events alter patterns of activity in the brain. This was cleverly done by the use of "The Sixth Sense," a film featuring a famous "twist ending," which fundamentally alters the way the events in the film are understood. Participants were assigned to three groups: (1) a Spoiled group, in which the twist was revealed at the outset, (2) a Twist group, who experienced the film as normal, and (3) a No-Twist group, in which the twist was removed. Participants were scanned while watching the movie and while performing cued recall of specific scenes. Verbal recall was scored based on recall success, and evidence for descriptive bias toward two ways of understanding the events (specifically, whether a particular character was or was not a ghost). Importantly, this allowed the authors to show that the Twist group updated their interpretation. The authors focused on regions of the Default Mode Network (DMN) based on prior studies showing responsiveness to naturalistic memory paradigms in these areas and analyzed the fMRI data using intersubject pattern similarity analysis. Regions of the DMN carried patterns indicative of story interpretation. That is, encoding similarity was greater between the Twist and No-Twist groups than in the Spoiled group, and retrieval similarity was greater between the Twist and Spoiled groups than in the No-Twist group. The Spoiled group also showed greater pattern similarity with the Twist group's recall than the No-Twist group's recall. The authors also report a weaker effect of greater pattern similarity between the Spoiled group's encoding and the Twist group's recall than between the Twist group's own encoding and recall. Together, the data all converge on the point that one's interpretation of an event is an important determinant of the way it is represented in the brain.

      This is a really nice experiment, with straightforward predictions and analyses that support the claims being made. The results build directly on a prior study by this research group showing how interpretational differences in a narrative drive distinct neural representations (Yeshurun et al., 2017), but extend an understanding of how these interpretational differences might work retrospectively. I do not have any serious concerns or problems with the manuscript, the data, or the analyses. However I have a few points to raise that, if addressed, would make for a stronger paper in my opinion.

      1) My most substantive comment is that I did not find the interpretive framework to be very clear with respect to the brain regions involved. The basic effects the authors report strongly support their claims, but the particular contributions to the field might be stronger if the interpretations could be made more strongly or more specifically. In other words: the DMN is involved in updating interpretations, but how should we now think about the role of the DMN and its constituent regions as a result of this study? There are a number of ideas briefly presented about what the DMN might be doing, but it just did not feel very coherent at times. I will break this down into a few more specific points:

      While many of us would agree that the DMN is likely to be involved in the phenomena at hand, I did not find that the paper communicated the logic for singularly focusing on this subset of regions very compellingly. The authors note a few studies whose main results are found in DMN regions, but I think that this could stand to be unpacked in a more theoretically interesting way in the Introduction.

      Relatedly, I found the summary/description of regional effects in the Discussion to be a bit unsatisfying. The various pattern similarity comparisons yielded results that were actually quite nonoverlapping among DMN regions, which was not really unpacked. To be clear, it is not a 'problem' that the regional effects varied from comparison to comparison, but I do think that a more theoretical exploration of what this could mean would strengthen the paper. To the authors' credit, they describe mPFC effects through the lens of schemas, but this stands in contrast to many other regions which do not receive much consideration.

      Finally, although there is evidence that regions of the DMN act in a coordinated way under some circumstances, there is also ample evidence for distinct regional contributions to cognitive processes, memory being just one of them (e.g., Cooper & Ritchey, 2020; Robin & Moscovitch, 2017; Ranganath & Ritchey, 2012). The authors themselves introduce the idea of temporal receptive windows in a cortical hierarchy, and while DMN regions do appear to show slower temporal drift than sensory areas, those studies show regional differences in pattern stability across time even within DMN regions. Simply put, it is worth considering whether it is ideal to treat the DMN as a singular unit.

      Thank you for your helpful comments. We added text to the introduction and discussion to address your point:

      "Introduction:

      The brain’s default mode network (DMN)—comprising the posterior medial cortex, medial prefrontal cortex, temporoparietal junction, and parts of anterior temporal cortex—was originally described as an intrinsic or “task-negative” network, activated when participants are not engaged with external stimuli (Raichle et al. 2001, Buckner et al 2008). This observation led to a large body of work showing that the DMN is an important hub for supporting internally driven tasks such as memory retrieval, imagination, future planning, theory of mind, and creating and updating situation models (Svoboda et al. 2006; Addis et al. 2007; Hassabis and Maguire 2007, 2009; Schacter et al. 2007; Szpunar et al. 2007; Spreng et al. 2009, Koster-Hale & Saxe, 2013 2013, Ranganath and Ritchey 2012). However, it is not fully understood how this network contributes to these varying functions, and in particular—the focus of the present study—memory processes. Activation of this network during “offline” periods has been proposed to play a role in the consolidation of memories through replay (Kaefer et al 2022). Interestingly, prior work has also shown that the DMN is reliably engaged during “online” processing (encoding) of continuous rich dynamic stimuli such as movies and audio stories (Stephens et al 2013, Hasson et al 2008). Regions in this network have been shown to have long “temporal receptive windows” (Hasson et al 2008; Lerner et al., 2011; Chang et al., 2022), meaning that they integrate and retain high-level information that accumulates over the course of extended timescales (e.g. scenes in movies, paragraphs in text) to support comprehension. This combination of processing characteristics suggests that the DMN integrates past and new knowledge, as regions in this network have access to incoming sensory input, recent active memories, and remote long-term memories or semantic knowledge (Yeshurun et al 2021, Hasson et al 2015). These integration processes feature in many of the “constructive” processes attributed to DMN such as imagination, future planning, mentalizing, and updating situation models (Schacter and Addis 2007, Ranganath and Ritchey 2012). Notably, constructive processes are highly relevant to real-world memory updating, which involves selecting and combining the relevant parts of old and new memories. Recent work has shown that neural patterns during encoding and recall of naturalistic stimuli (movies) are reliably similar across participants in this network (Chen et al. 2017; Oedekoven et al., 2017; Zadbood et al., 2017; see Bird 2020 for a review of recent naturalistic studies on memory), and the DMN displays distinct neural activity when listening to the same story with different perspectives (Yeshurun et al 2017). Building on this foundation of prior work on the DMN, we asked whether we could find neural evidence for the retroactive influence of new knowledge on past memories."

      "Discussion :

      In addition to mPFC, right precuneus and parts of temporal cortex exhibited significantly higher pattern similarity in the “twist” and “spoiled” groups who recalled the movie with the same interpretation. Precuneus is a core region in the posterior medial network, which is hypothesized to be involved in constructing and applying situation models (Ranganath and Ritchey 2012). Our findings support a role for precuneus in deploying interpretation-specific situation models when retrieving event memories. In particular, we suggest that the posterior medial network may encode a shift in the situation model of the “twist” group in order to accommodate the new Ghost interpretation.

      We performed two targeted analyses to look for evidence of memory updating across encoding and recall: the interaction analysis (Figure 2C) and the encoding-recall analysis (Figure 3). We hypothesized that a shift in direction of pISC difference would occur when neural representations during recall in the “twist” group start to reflect the Ghost interpretation. The interaction analysis probed this shift indirectly by taking into account the effects of both encoding-encoding and recall-recall analyses. Unlike the interaction analysis, in the encoding-recall analysis, we directly compared neural event representations during encoding and recall. Interestingly, all regions exhibiting an effect across the two encoding-recall analyses, excluding left anterior temporal cortex, were present in the interaction results. Among these regions, the left angular gyrus/TPJ exhibited an effect across all three analyses. As a core hub in the mentalizing network, temporo-parietal cortex has been implicated in theory of mind through perspective-taking, rationalizing the mental state of someone else, and modeling the attentional state of others (Frith and Frith 2006, Guterstam et. al 2021, Saxe and Kanwisher 2003). The motivations behind some actions of the main character in the movie heavily depend on whether the viewer perceives them as a Doctor or a Ghost, and participants may focus on this during both encoding and recall. We speculate that neural event representations in AG/TPJ in the current experiment may be related to mentalizing about the main character’s actions. Under this interpretation, the updated event representations during recall following the twist would be more closely aligned to the “spoiled” encoding representations, as a consequence of memory updating in the “twist” group.

      Our findings are consistent with the view that DMN synthesizes incoming information with one’s prior beliefs and memories (Yeshurun et al 2021). We add to this framework by providing evidence for the involvement of DMN regions in updating prior beliefs in light of new knowledge. Across our different encoding and recall analyses, we observe memory updating effects in a varied subset of DMN regions that do not cleanly map onto a specific subsystem of DMN (Robin and Moscovitch 2017, Ranganath and Ritchey 2012, Ritchey and Cooper 2020). Rather than being divergent, these results might be reflecting inherent differences between the processes of encoding and recall of naturalistic events. It has been proposed that neural representations corresponding to encoding of events are systematically transformed during recall of those events (Chen et al 2017, Favila et al 2020, Musz and Chen 2022). While we provide evidence for reinstatement of memories in DMN, our findings also support a transformation of neural representation during recall, as encoding-recall results were weaker in some areas than recall-recall findings. This transformation could affect how different regions and sub-systems of DMN represent memories, and suggests that the concerted activity of multiple subsystems and neural mechanisms might be at play during encoding, recall and successful updating of naturalistic event memories."

      2) I think that some direct comparison to regions outside the DMN would speak to whether the DMN is truly unique in carrying the key representations being discussed here. I was reluctant to suggest this because I think that the authors are justified in expecting that DMN regions would show the effects in question. However, there really is no "null" comparison here wherein a set of regions not expected to show these effects (e.g., a somatosensory network, or the frontoparietal network) in fact do not show them. There are not really controls or key differences being hypothesized across different conditions or regions. Rather, we have a set of regions that may or may not show pattern similarity differences to varying degrees, which feels very exploratory. The inclusion of some principled control comparisons, etc. would bolster these findings. The authors do include a whole-brain analysis in Supplementary Figure 1, which indeed produced many DMN regions. However, notably, regions outside the DMN such as the primary visual cortex and mid-cingulate cortex appear to show significant effects (which, based on the color bar, might actually be stronger than effects seen in the DMN). Given the specificity of the language in the paper in terms of the DMN, I think that some direct regional or network-level comparison is needed.

      In the original submission, we included additional analyses for visual and somatosensory networks, which we hypothesized would serve as control networks. Following your comment, in the revision, we added a separate section (included below) more thoroughly examining these analyses. We also added text to the results and discussion to explain our interpretation of these findings.

      "Changes in neural representations beyond DMN We focused our core analyses on regions of the default mode network. Prior work has shown that multimodal neural representations of naturalistic events (e.g. movie scenes) are similar across encoding (movie-watching or story-listening) and verbal recall of the same events in the DMN (Chen et al., 2017; Zadbood et al., 2017). Therefore, in the current work we hypothesized that retrospective changes in the neural representations of events as the narrative interpretation shifts would be observed in the DMN. We did not, for example, expect to observe such effects in lower-level sensory regions, where neural activity differs dramatically for movie-viewing and verbal recall. To be thorough, we ran the same set of analyses we performed in the DMN (Figure 2-3) in regions of the visual and somatomotor networks extracted from the same atlas parcellation (Schaefer et al., 2018). Our results revealed larger overall differences in DMN than in visual and somatosensory networks for the key comparisons discussed previously (Figure S2). In particular, the only regions showing significant differences in pISC in recall-recall and encoding-recall comparisons (p < 0.01, uncorrected) were located in the DMN. We did not observe a notable difference between DMN and the two other networks when comparing recall “twist” to movie “spoiled” and recall “twist” to movie “twist” (RG – MG > RG – MD) which is consistent with the weak effect in the original comparison (Figure 3B). In the encoding-encoding comparison, several ROIs from the visual and somatomotor networks showed relatively strong effects as well (see Discussion).

      In addition, we qualitatively reproduced our results by performing an ROI-based whole brain analysis (Figure S3, p < 0.01 uncorrected). This analysis confirmed the importance of DMN regions for updating neural event representations. However, strong differences in pISC in the hypothesized direction were also observed in a handful of other non-DMN regions, including ROIs partly overlapping with anterior cingulate cortex and dorsolateral prefrontal cortex (see Discussion)."

      "Discussion: While our main goal in this paper was to examine how neural representations of naturalistic events change in the DMN, we also examined visual and somatosensory networks. Aside from the encoding-encoding analysis in which some visual and somatosensory regions showed stronger similarity between two groups with the same interpretation of the movie, we did not find any regions with significant effects in these two networks in the other analyses. Unlike the recall phase where each participant has their unique utterance with their own choice of words and concepts to describe the movie, the encoding (move-watching) stimulus is identical across all groups. Therefore, the effects observed during encoding-encoding analysis in sensory regions could reflect similarity in perception of the movie guided by similar attentional state while watching scenes with the same interpretation (e.g. similarity in gaze location, paying attention to certain dialogues, or small body movements while watching the movie with the same Doctor or Ghost interpretations). In our whole brain analysis, these regions did not have significant interaction effects, which suggests that the effects were isolated to encoding. In the whole-brain analysis, we also observed a significant encoding-encoding and interaction effects in anterior cingulate cortex, as well as recall-recall and interaction effects in dlPFC. These results suggest that both the "spoiled" manipulation and the "twist" may recruit top-down control and conflict monitoring processes during naturalistic viewing and recall."

      3) If I understand correctly, the main analyses of the fMRI data were limited to across-group comparisons of "critical scenes" that were maximally affected by the twist at the end of the movie. In other words, the analyses focused on the scenes whose interpretation hinged on the "doctor" versus "ghost" interpretation. I would be interested in seeing a comparison of "critical" scenes directly against scenes where the interpretation did not change with the twist. This "critical" versus "non-critical" contrast would be a strong confirmatory analysis that could further bolster the authors' claims, but on the other hand, it would be interesting to know whether the overall story interpretation led to any differences in neural patterns assigned to scenes that would not be expected to depend on differences in interpretation. (As a final note, such a comparison might provide additional analytical leverage for exploring the effect described in Figure 3B, which did not survive correction for multiple comparisons.)

      This is a helpful suggestion, and we’ve added an analysis addressing your comment. We found that the interaction index capturing the difference between the three groups was stronger for the critical scenes than for the non-critical scenes for almost all DMN ROIs.

      "The role of scene content In the prior analyses, we focused on “critical scenes”, selected based on ratings from four raters who quantified the influence of the twist on the interpretation of each scene (see Methods). An independent post-experiment analysis of the verbal recall behavior of the fMRI participants yielded “twist scores” that were also highest for these scenes; that is, the expected and perceived effect of twist information on recall behavior were found to match. In our next analysis, we asked whether the neural event representations reflect these differences in the twist-related content of the scenes. In other words, are the “critical scenes” with highly twist-dependent interpretations truly critical for our observed effects?

      To answer this question, we re-ran our main encoding-encoding and recall-recall pISC analysis in each DMN ROI (Figure 2-3). We calculated interaction indices (Figure 2C) first by including all scenes, and second by including only the 11 non-critical scenes. To better compare the effect of including different subsets of scenes to our original results, in Figure 4 we show the results in 15 ROIs that exhibited meaningful effects in our main analyses (Figure 2C). Figure 4A demonstrates that “critical scenes” yielded higher interaction indices compared to all scenes or non-critical scenes across all ROIs. The interaction score across all DMN ROIs was significantly higher in “critical scenes” than all scenes (t(23) = 7.19, p = 2.53 x 10-7) and non-critical scenes (t(23) = 7.3, p = 1.95 x 10-7). These results show that critical scenes are indeed responsible for the observed pISC differences across groups."

      4) I appreciate the code being made available and that the neuroimaging data will be made available soon. I would also appreciate it if the authors made the movie stimulus and behavioral data available. The movie stimulus itself is of interest because it was edited down, and it would be nice for readers to be able to see which scenes were included.

      Unfortunately due to copyright, we cannot share the movie stimulus outright. However, we will share the timing of the cuts used, as well as the time-stamped transcripts of verbal recall.

      To sum up, I think that this is a great experiment with a lot of strengths. The design is fairly clean (especially for a movie stimulus), the analyses are well reasoned, and the data are clear. The only weaknesses I would suggest addressing are with regards to how the DMN is being described and evaluated, and the communication of how this work informs the field on a theoretical level.

    1. Author Response

      Reviewer #1 (Public Review):

      In a very interesting and technically advanced study, the authors measured the force production of curved protofilaments at depolymerizing mammalian microtubule ends using an optical trap assay that they developed previously for yeast microtubules. They found that the magnesium concentration affects this force production, which they argue based on a theoretical model is due to affecting the length of the protofilament curls, as observed previously by electron microscopy. Comparing with their previous force measurements, they conclude that mammalian microtubules produce smaller force pulses than yeast microtubules due to shorter protofilament curls. This work provides new mechanistic insight into how shrinking microtubules exert forces on cargoes such as for example kinetochores during cell division. The experiments are sophisticated and appear to be of high quality, conclusions are well supported by the data, and language is appropriate when conclusions are drawn from more indirect evidence. Given that the experimental setup differs from the previous optical trap assay (antibody plus tubulin attached to bead versus only antibody attached to bead), a control experiment could be useful with yeast microtubules using the same protocol used in the new variant of the assay, or at least a discussion regarding this issue. One open question may be whether the authors can be sure that measured forces are only due to single depolymerizing protofilaments instead of two or more protofilaments staying laterally attached for a while. How would this affect the interpretation of the data?

      This work will be of interest to cell biologists and biophysicists interested in spindle mechanics or generally in filament mechanics.

      Thank you for your careful reading of our manuscript, your kind remarks, and your favorable review.

      Reviewers #1 and #2 both mentioned a concern about potential differences between our previous setup with yeast microtubules, versus our new setup with predominantly bovine microtubules, and whether such differences might underlie the different pulse amplitudes we measured. We think this concern comes mainly from a misunderstanding of how the beads in both setups were tethered to the sides of the microtubules, and we apologize for not making this aspect clearer in our original submission.

      It is true that our new setup requires one additional step, pre-decoration of the anti-His beads with His6-tagged yeast tubulin. However, in both cases, the anti-His antibodies were kept very sparse on the beads to ensure that most beads, if they became tethered to a microtubule, were attached by a single antibody. (~30 pM beads were mixed with 30 pM of anti-His antibody, for a molar ratio of 1:1.) And even though the anti-His beads in our previous work did not undergo a separate incubation step for pre-decoration with tubulin, they undoubtedly were decorated immediately after being mixed into the microtubule growth mix, which in that case included ~1 µM of unpolymerized His6-tagged yeast tubulin dimers. Thus, the arrangement with beads tethered laterally to the sides of microtubules via single antibodies was created in both cases by essentially the same three-step process: First, beads decorated very sparsely with anti-His antibodies were bound to unpolymerized His6-tagged yeast tubulin. Second, a bead-tethered His6-tagged yeast tubulin was incorporated into the growing tip of a microtubule (which could be assembling from either yeast or bovine tubulin, depending on the experiment). Third, the tip grew past the bead to create a large extension. Because the beads in both scenarios were tethered by a single antibody to the same C-terminal tail of yeast β-tubulin, the differences in pulse amplitude cannot be explained by differences in the tethering. In our revised manuscript, we now mention explicitly in Results that the beads were tethered by single antibodies (lines 95 to 100). In Methods we significantly expanded the section about preparation of beads and how they became tethered (lines 365 to 393). [We refer here, and below, to line numbers when the document is viewed with “All Markup” shown.]

      You also raise an interesting, open question: Do protofilaments curl outward entirely independently of their lateral neighbors? Or under some conditions might they tend to stay laterally associated during the curling process, perhaps curling outward in pairs rather than as individual protofilaments? We cannot formally rule out the possibility that such lateral associations sometimes persist during protofilament curling. However, changes in lateral association seem unlikely to explain the magnesium- and species-dependent differences we measured in pulse amplitude, for several reasons: First, there is good evidence for lengthening of protofilament curls at disassembling tips (e.g., Mandelkow 1991, Tran & Salmon 1997), but we are not aware of convincing evidence for magnesium or species-dependent increases in the propensity of curling protofilaments to remain laterally associated. Second, an increase in lateral association should increase the effective flexural rigidity of the curls, but under all the conditions we examined, pulse enlargement was associated with a steepening of the amplitude-vs-force relation – i.e., with softening, not stiffening. Our model indicates that this softening can be fully explained by an increase in protofilament contour length, without any change in the intrinsic flexural rigidity of the protofilament curls.

      Reviewer #2 (Public Review):

      Microtubules are regarded as dynamic tracks for kinesin and dynein motors that generate force for moving cargoes through cells, but microtubules also act as motors themselves by generating force from outward splaying protofilaments at depolymerizing ends. Force from depolymerization has been demonstrated in vitro and is thought to contribute to chromosome movement and other contexts in cells. Although this model has been in the field for many years, key questions have remained unanswered, including the mechanism of force generation, how force generated might be regulated in cells, and how this system might be tuned across cellular contexts or organisms. The barrier is that we lack an understanding of experimental conditions that can be used to control protofilament shape and energetics. This study by Murray and colleagues makes an important advance towards overcoming that barrier.

      This study builds on previous work from the authors where they developed a system to directly measure forces generated by outward curling protofilaments at depolymerizing microtubule ends. That study showed for the first time that protofilaments act like elastic springs and related the generated force to the estimated energy contained in the microtubule lattice. Furthermore, they showed that slowing polymerization rate did not diminish force generation. That study used recombinant yeast tubulin, including a 6x histidine tag on beta tubulin that created attachment points for the bead on the microtubule lattice. The current study extends that system to show that work output is related to the length of protofilament curls.

      We are grateful for your very thoughtful and thorough review, which has helped us improve our manuscript.

      Murray and colleagues show this by manipulating curls in two ways - using bovine brain tubulin instead of yeast tubulin and altering magnesium concentration. Previous EM studies indicated that protofilaments on depolymerizing bovine microtubules have similar curvature but are shorter. The authors here use a blend of bovine brain tubulin and bead-linked recombinant yeast tubulin with the 6x histidine tag in their in vitro system and find smaller deflections of the laser-trapped bead than previously observed with pure yeast tubulin. A concern with comparing this heterogeneous bovine/yeast system to the previous work with homogeneous yeast tubulin is that density of 6x histidine-tagged tubulin subunits is likely to be different between the two systems. Also, the rate of incorporation of 6x histidine yeast tubulin into bovine microtubules in the current study may be different from the rate of incorporation into yeast microtubules in the previous study. These differences could lead to changes in the strength of bead attachment to the microtubule lattice and alter the compliance of the bead to deflection by curling protofilaments. These possibilities and lattice attachment strength are not explored in this study, raising concerns about comparing the two systems.

      Reviewers #1 and #2 both mentioned a concern about potential differences between our previous setup with yeast microtubules, versus our new setup with predominantly bovine microtubules, and whether such differences might underlie the different pulse amplitudes we measured. As detailed in our response to Reviewer #1 above, we think this concern comes mainly from a misunderstanding of how the beads in both setups were tethered to the sides of the microtubules, and we apologize for not making this aspect clearer in our original submission. For both our yeast and bovine microtubule experiments, the anti-His antibodies were kept very sparse on the beads to ensure that most beads, if they became tethered to a microtubule, were attached by a single antibody. Because the beads in both scenarios were tethered by a single antibody to the same C-terminal tail of yeast β-tubulin, the differences in pulse amplitude cannot be explained by differences in the tethering. In our revised manuscript, we now mention explicitly in Results that the beads were tethered by single antibodies (lines 95 to 100). In Methods we significantly expanded the section about preparation of beads and how they became tethered (lines 365 to 393).

      The authors go on to show that magnesium increases bead deflection and work output from the system. The use of magnesium was motivated by earlier studies which showed that increasing magnesium speeds up depolymerization and increases the lengths of protofilament curls. The use of magnesium here provides the first evidence that work output can be tuned biochemically. This is an important finding. The authors then go on to show that the effect of magnesium on bead deflection can be separated from its effect on depolymerization speed. They do this by proteolytically removing the beta tubulin tail domain, which previous studies had shown to be necessary to mediate the magnesium effect on depolymerization rate. The authors arrive at a conclusion that magnesium must promote protofilament work output by increasing their lengths. How magnesium might do this remains unanswered. The mechanistic insight from the magnesium experiments ends there, but the authors discuss possible roles for magnesium in strengthening longitudinal interactions within protofilaments or perhaps complexing with the GDP nucleotide at the exchangeable site, although that seems less likely at the concentrations in these experiments.

      The major conclusion of the study is the finding that work output from curling protofilaments is a tunable system. The examples here demonstrate tuning by tubulin composition and by divalent cations. Whether these examples relate to tuning in biological systems will be an important next question and could expand our appreciation for the versatility of depolymerizing microtubules as a motor.

      We fully agree that two very important next questions are whether work output from curling protofilaments is truly harnessed in vivo, and whether protofilament properties in vivo might be actively regulated for this purpose. Based on your recommendations, and as detailed below (under Major point 2), we have expanded our discussion of these possibilities in our revised manuscript.

      Reviewer #3 (Public Review):

      The authors used a previously established optical tweezers-based assay to measure the regulation of the working stroke of curled protofilaments of bovine microtubules by magnesium. To do so, the authors improved the assay by attaching bovine microtubules to trapping beads through an incorporated tagged yeast tubulin.

      The assay is state-of-the-art and provides a direct measurement of the stroke size of protofilaments and its dependence on magnesium.

      The authors have achieved all their goals and the manuscript is well written.

      The reported findings will be of high interest for the cell biology community.

      Thank you for reading and evaluating our manuscript. We are grateful for your positive comments.

    1. I don't mean shock as in bad news or brutal murder or horrific catastrophe or embarrassing scandal

      i took this explanation as art can ignite us. it can show something different to us. it sparks new ideas, visions, or perspectives. it can instill us with feelings that are so different that what we might’ve expected. art can be so raw that the way we absorb it may not be what people would normally think they would experience from art

    1. But here are three ways that we should think about addressing this issue:Start with parent training. Parents need to be made aware of the negative impact of the video games they may be letting their children play. I get that sometimes we need to occupy our kids, and it’s very tempting to hand them a phone. But we need to be better gatekeepers.It’s hard to change a behavior if you can’t first measure it. Use tools, such as Apple’s Screen Time or Google’s Digital Wellbeing, to create awareness of just how much time you or your children are spending on games — you’ll be surprised.Finally, strike a balance. Games can be fun, of course; we just need to find moderation. When I was growing up, my parents pushed me to eat more vegetables and fruits. With technology so integral to our lives, we need to treat digital wellness like physical wellness and make sure we encourage behavior that’s good for us.

      In these paragraphs pathos is used and more specifically this would be part of scare tactics when it comes to the reader realizing that they should take action to lessen addition to gaming after the reader list some ways on how to prevent gaming addiction or lessen addiction.

    1. But it was also a cultural moment, reflecting style and attitude as much as ideology or policy positions. Both Magaziner and his audience knew that as a government bureaucrat, he was acting against type. We do not expect government officials to so willingly “get out of the room,” and generally when an official does not keep an eye on things, it is interpreted as an abdication of responsibility, not a heroic move.

      I think it may be even harder to abandon control than to come up with a new policy sometimes. Introducing regulation for the Internet was not the same as the regulation of other media, and it definitely required extensive discussion and research. However, the fact that Maganizer was able to leave the conference after initiating the discussion demonstrates an important trend of more flexible regulations.

    1. The moral issue I notice is that the overall group which is made up of middle-class caucasian students is having an issue with having a person of color on the board. The reason I came to that decision is because historically they mention that they at times are lenient on the guidelines of having someone override the process to select new members for the team. Everyone seemed to in agreeance that Reuben was a great candidate to make the decision on his own. Reuben selected a candidate that he found social interests in diversity by expressing their involvement in other programs related to diversity. The coalition members also gave Reuben some options but those were not the people he had choose. His choice was Jameela because he was interested in the fact that she was a generational first to go to school, she had interest in women's studies and the LGBTQ committee, and she stated that she wasn't afraid of challenges. I feel like that there may be some closeted angst against diversity even though they are seated on the board. I feel like maybe they feel like having a person of color on the board may be uncomfortable to them as the majority are middle class white kids. For me I think that Reuben was taking part in deontological ethics. This means, "believe that we ought to base our choices on our duty to follow universal truths that we discover through our intuition or reason." (Hackman and Johnson). I feel that Reuben felt that he was doing the right thing based on what his firm beliefs are for their group. He followed through thinking it was the right decision.

    1. Author Response

      Reviewer #1 (Public Review):

      Auwerx et al. have taken a new approach to mine large existing datasets of intermediary molecular data between GWAS and phenotype, with the aim of uncovering novel insight into the molecular mechanisms which lead a GWAS hit to have a phenotypic effect. The authors show that you can get additional insight by integrating multiple omics layers rather than analyzing only a single molecular type, including a handful of specific examples, e.g. that the effect of SNPs in ANKH on calcium are mediated by citrate. Such additional data is necessary because, as the authors' point out, while we have thousands of SNPs with significant impact on phenotypes of interest, we often don't know at all the mechanism, given that the majority of significant SNPs found through GWAS are in non-coding (and often intergenic) regions.

      This paper shows how one can mine large existing datasets to better estimate the cellular mechanism of significant, causal SNPs, and the authors have proven that by providing insight into the links between a couple of genes (e.g. FADS2, TMEM258) and metabolite QTLs and consequent phenotypes. There is definitely a need and utility for this, given how few significant SNPs (and even fewer recently-discovered ones) hit parts of the DNA where the causal mechanism is immediately obvious and easily testable through traditional molecular approaches.

      I find the paper interesting and it provides useful insight into a still relatively new approach. However, I would be interested in knowing how well this approach scales to the general genetics community: would this method work with a much smaller N (e.g. n = 500)? Being able to make new insights using cohorts of nearly 10,000 patients is great, but the vast majority of molecular studies are at least an order of magnitude smaller. While sequencing and mass spectrometry are becoming exponentially cheaper, the issue of sample size is likely to remain for the foreseeable future due to the challenges and expenses of the initial sample collection.

      We thank the reviewer for his assessment and have now addressed – in the revised version of the manuscript, as well as in the below point-by-point reply – his specific comments/questions.

      Reviewer #2 (Public Review):

      Auwerx et al. present a framework for the integration of results from expression quantitative trait loci (eQTL), metabolite QTL (mQTL) and genome-wide association (GWA) studies based on the use of summary statistics and Mendelian Randomization (MR). The aim of their study is to provide the field with a method that allows for the detection of causal relationships between transcript levels and phenotypes by integrating information about the effect of transcripts on metabolites and the downstream effect of these metabolites on phenotypes reported by GWA studies. The method requires the mapping of identical SNPs in disconnected mQTL and eQTL studies, which allows MRbased inference of a causal effect from a transcript to a metabolite. The effect of both transcripts and metabolites on phenotypes is evaluated in the same MR-based manner by overlaying eQTL and mQTL SNPs with SNPs present in phenotypic GWA studies.

      The aim of the presented approach is two-fold: (1) to allow identification of additional causal relationships between transcript levels and phenotypes as compared to an approach limited to the evaluation of transcript-to-phenotype associations (transcriptome-wide MR, TWMR) and (2) to provide information about the mechanism of effects originating from causally linked transcripts via the metabolite layer to a phenotype.

      The study is presented in a very clear and concise way. In the part based on empirical study results, the approach leads to the identification of a set of potential causal triplets between transcripts, metabolites and phenotypes. Several examples of such causal links are presented, which are in agreement with literature but also contain testable hypotheses about novel functional relationships. The simulation study is well documented and addresses an important question pertaining to the approach taken: Does the integration of mQTL data at the level of a mediator allow for higher power to detect causal transcript to phenotype associations?

      We thank the reviewer for his/her assessment and have now addressed – in the revised version of the manuscript, as well as in the below point-by-point reply – his/her specific comments/questions.

      Major Concerns

      1) Our most salient concern regarding the presented approach is the presence of multiple testing problems. In the analysis of empirical datasets (p. 4), the rational for setting FDR thresholds is not clearly stated. While this appears to be a Bonferroni-type correction (p-value threshold divided by number of transcripts or metabolites tested), the thresholds do not reflect the actual number of tests performed (7883 transcripts times 453 metabolites for transcript-metabolite associations, 87 metabolites or 10435 transcripts times 28 complex phenotypes). The correct and more stringent thresholds certainly decrease the overlap between causal relationships and thus reduce the identifiable number of causal triplets. Furthermore, we believe that multiple testing has to be considered for correct interpretation of the power analysis. The study compares the power of a TWMR-only approach to the power of mediation-based MR by comparing "power(TP)" against "power(TM) * power(MP)" (p. 12). This comparison is useful in a hypothetical situation given data on a single transcript affecting a single phenotype, and with potential mediation via a single metabolite. However, in an actual empirical situation, the number of non-causal transcript-metabolite-phenotype triplets will exceed the number of non-causal transcript-phenotype associations due to the multiplication with the number of metabolites that have to be evaluated. This creates a tremendous burden of multiple testing, which will very likely outweigh the increase in power afforded by the mediation-based approach in the hypothetical "single transcript-metabolite-phenotype" situation described here. Thus, for explorative detection of causal transcript-phenotype relationships, the TWMR-only method might even outperform the mediation-based method described by the authors, simply because the former requires a smaller number of hypotheses to be tested compared to the latter. The presented simulation would only hold in cases where a single path of causality with a known potential mediator is to be tested.

      We thank the reviewer for pointing out the multiple testing issue. Based on this comment, we have revised our approach by mainly implementing two major modifications to our approach.

      First, we reduce the number of assessed metabolites to 242 compounds for which we were able to identify a Human Metabolome Database (HMDB) identifier through manual curation. This was triggered by the suggestion of reviewer #1 to facilitate the database/literature-based follow-up of our discoveries. The motivation is to only test metabolites that if found to be significantly associated would yield interpretable results, thereby reducing the number of tests to be performed. This modification is described in the revised manuscript:

      Results: “Summary statistics for cis-eQTLs stem from the eQTLGen Consortium metaanalysis of 19,942 transcripts in 31,684 individuals [3], while summary statistics for mQTLs originate from a meta-analysis of 453 metabolites in 7,824 individuals from two independent European cohorts: TwinsUK (N = 6,056) and KORA (N = 1,768) [6]. After selecting SNPs included in both the eQTL and mQTL studies, our analysis was restricted to 7,884 transcripts with ≥ 3 instrumental variables (IVs) (see Methods, Supplemental Figure 1) and 242 metabolites with an identifier in The Human Metabolome Database (HMDB) [28] (see Methods, Supplemental Table 1).”

      Methods: “mQTL data originate from Shin et al. [6], which used ultra-high performance liquid chromatography-tandem mass spectrometry (UPLC-MS/MS) to measure 486 whole blood metabolites in 7,824 European individuals. Association analyses were carried out on ~2.1 million SNPs and are available for 453 metabolites at the Metabolomics GWAS Server (http://metabolomics.helmholtz-muenchen.de/gwas/). Among these metabolites, 242 were manually annotated with Human Metabolome Database (HMDB) identifiers (Supplemental Table 1) and used in this study.”

      Second, to account for all remaining tests, we now select significant causal effects based on FDR < 5% in all performed univariable MR analyses. With 5% FDR on both the transcript-to-metabolite and metabolite-to-phenotype effects, the FDR for triplets is slightly inflated to 9.75% (= 1-0.952), a consideration that we now explicitly describe. Note that selecting triplets based on transcript-tometabolite and metabolite-to-phenotype effects FDR < 2.5%, result in a FDR < 5% (1-0.9752) for the triplets. This more stringent threshold identifies 135 causal triplets, 39 of which would be missed by TWMR. Overall, Results and Supplemental Tables have been updated and now read as follow:

      “Mapping the transcriptome onto the metabolome […] By testing each gene for association with the 242 metabolites, we detected 96 genes whose transcript levels causally impacted 75 metabolites, resulting in 133 unique transcriptmetabolite associations (FDR 5% considering all 1,907,690 instrumentable gene-metabolite pairs Supplemental Table 2) […].

      Mapping the metabolome onto complex phenotypes […] Overall, 34 metabolites were associated with at least one phenotype (FDR 5% considering all 1,344 metabolite-phenotype pairs), resulting in 132 unique metabolitephenotype associations (Supplemental Table 4).

      Mapping the transcriptome onto complex phenotypes […] In total, 5,140 transcripts associated with at least one phenotype (FDR 5% considering all 292,170 gene-phenotype pairs) resulting in 13,141 unique transcript-phenotype associations (Supplemental Table 5).

      Mapping metabolome-mediated effects of the transcriptome onto complex phenotypes […] We combined the 133 transcript-metabolite (FDR ≤ 5%) and 132 metabolite-trait (FDR ≤ 5%) associations to pinpoint 216 transcript-metabolite-phenotype causal triplets (FDR = 1-0.952 = 9.75%) (Supplemental Table 6).”

      In the simulations performed for the power analysis, we used a Bonferroni correction. We ran each simulation for 500 transcripts, measuring 80 metabolites at each run and performed TWMR and MWMR. The power of TWMR was calculated by counting how many times we obtain p-values ≤ 0.05/500. The power of the mediation analysis was calculated as 𝑝𝑜𝑤𝑒𝑟"$ ∗ 𝑝𝑜𝑤𝑒𝑟$#, where 𝑝𝑜𝑤𝑒𝑟"$ was calculated by counting how many times we obtain p-values ≤ 0.05/(500*80), and 𝑝𝑜𝑤𝑒𝑟$# was calculated by counting how many times we obtain p-values ≤ 0.05/80. In the revised manuscript, we additionally repeated each simulated scenario 10 times to increase robustness of results. This has been clarified in both the Methods and Results sections of the revised manuscript:

      Methods: “Ranging 𝜌 and 𝜎 from -2 to 2 and from 0.1 and 10, respectively, we run each simulation for 500 transcripts measuring 80 metabolites at each run and performed TWMR and MWMR starting from above-described 𝛽7<"=, 𝛽4<"= and 𝛽>?,(. For each MR analysis we calculated the power to detect a significant association as well as the difference in power between TWMR and the mediation analyses (i.e., 𝑝𝑜𝑤𝑒𝑟"# − 𝑝𝑜𝑤𝑒𝑟"$ ∗ 𝑝𝑜𝑤𝑒𝑟$#). Each specific scenario was repeated 10 times and the average difference in power across simulation was plotted as a heatmap.”

      Results: “To characterize the parameter regime where the power to detect indirect effects is larger than it is for total effects, we performed simulations using different settings for the mediated effect. In each scenario we evaluated 500 transcripts and 80 metabolites and varied two parameters characterizing the mediation: a. the proportion (𝜌) of direct (𝛼!) to total (𝛼"#) effect (i.e., effect not mediated by the metabolite) from -2 to 2 to cover the cases where direct and mediated effect have opposite directions (51 values); b. the ratio (𝜎) between the transcript-to-metabolite (𝛼"$) and the metabolite-to-phenotype (𝛼$#) effects, exploring the range from 0.1 to 10 (51 values).<br /> Transcripts were simulated with 6% heritability (i.e., median ℎ@ in the eQTLGen data) and a causal effect of 0.035 (i.e., ~65% of power in TWMR at a = 0.05) on a phenotype. Each scenario was simulated 10 times and results were averaged to assess the mean difference in power (see Methods).”

      2) A second concern regards the interpretation of the results based on the empirical datasets. For the identified 206 transcript-metabolite-phenotype causal triplets, the authors show a comparison between TWMR-based total effect of transcripts on phenotypes and the calculated direct effect based on a multivariable MR (MVMR) test (Figure 2B), which corrects for the indirect effect mediated by the metabolite in the causal triplet. The comparison shows a strong correlation between direct and total effect. A thorough discussion of the potential reasons for deviation (in both negative and positive directions) from the identity line is missing.

      Deviation from the identity line, as observed in Figure 2B, indicates that while there is a strong correlation between direct and total effect, it is not perfect, and part of the total effect is due to an indirect effect mediated by metabolites. This is explained and discussed in the Results and Discussion section:

      Results: “Regressing direct effects (𝛼!) on total effects (𝛼"#) on (Figure 2A), we estimated that for our 216 mediated associations, 77% [95% CI: 70%-85%] of the transcript effect on the phenotype was direct and thus not mediated by the metabolites (Figure 2B).”

      Discussion: “The observation that 77% of the transcript’s effect on the phenotype is not mediated by metabolites suggests that either true direct effects are frequent or that other unassessed metabolites or molecular layers (e.g., proteins, post-translational modifications, etc.) play a crucial role in such mediation. It is to note that in the presence of unmeasured mediators or measured mediators without genetic instruments, our mediation estimates are lower bounds of the total existing mediation. […] Thanks to the flexibility of the proposed framework, we expect that in the future and upon availability of ever larger and more diverse datasets, our method could be applied to estimate the relative contribution of currently unassessed mediators in translating genotypic cascades.”

      Furthermore, no test of significance for potential cases of mediation is presented. Due to the issues of multiple testing discussed above, the significance of the inferred cases of mediation is drawn into question. The examples presented for causal triplets (involving the ANKH and SLC6A12 transcripts) feature transcripts with low total effects and a small ratio between direct and total effect, in line with the power analysis. However, in these examples, the total effects are also quite low. Its significance has to be tested with an appropriate statistical test, incorporating multiple testing correction.

      Following the reviewer’s suggestion, we have modified our criteria to call significant associations to account for multiple testing (see extensive reply to major concern #1). With 5% FDR on both the transcript-to-metabolite and metabolite-to-phenotype effects, the FDR for triplets is slightly inflated to 9.75% (= 1-0.952). We mention this limitation in the revised manuscript:

      “We combined the 133 transcript-metabolite (FDR ≤ 5%) and 132 metabolite-trait (FDR ≤ 5%) associations to pinpoint 216 transcript-metabolite-phenotype causal triplets (FDR = 1-0.952 = 9.75%) (Supplemental Table 6).”

      All examples presented in the original manuscript remained significant. The fact that the total effect in these examples is low makes them particularly interesting as it highlights how our approach can detect biologically plausible associations between a transcript and a phenotype that only show mild evidence through TWMR but are strongly supported when accounting for metabolites that mediate the transcript-phenotype relation, showcasing situations in which our method can provide a true advantage over classical approaches such as TWMR. Such examples may emerge due to opposite signed direct and indirect effects, which cancel each other out when it comes to testing total effects. What is key that we do not claim the total and the mediated effects to be different (as we would have very limited power to do so), but simply point out that under certain settings we are better powered to detect mediated effects than total ones. In the ANKH example (more details below), the total ANKH-calcium effect is almost exactly the same as the product of the 𝛼,-.%→056157 and 𝛼056157→0120*34 effects, simply the latter ones are detectable, while the total effect is not.

      In the revised manuscript the case for our selected examples is made even stronger thanks to an analysis proposed by Reviewer #1 that aimed at estimating the proportion of previously reported associations through automated literature review. For instance, while our literature review found previously reported evidence of the ANKH-calcium link and of the ANKH-citrate link, we did not identify any publication mentioning all 3 terms in combination in the abstract and/or title, illustrating how our approach can establish bridges between knowledge gaps. We revised the Results section describing the ANKH example accordingly:

      “The 126 triplets that were not identified through TWMR due to power issues represent putative new causal relations. This is well illustrated by a proof-of concept example involving ANKH [MIM: 605145] and calcium levels, for which 48 publications were identified through automated literature review (Supplemental Table 6). While the TWMR effect of ANKH expression on calcium levels was not significant (𝛼,-.%→012034 = −0.02; 𝑃 = 0.03), we observed that ANKH expression decreased citrate levels (𝛼,-.%→056157 = −0.30; 𝑃 = 2.2 × 1089:), which itself increased serum calcium levels (𝛼056157→012034 = 0.07; 𝑃 = 6.5 × 108;9). Mutations in ANKH have been associated with several rare mineralization disorders [MIM: 123000, 118600] [32] due to the gene encoding a transmembrane protein that channels inorganic pyrophosphate to the extracellular matrix, where at low concentrations it inhibits mineralization [33]. Recently, a study proposed that ANKH instead exports ATP to the extracellular space (which is then rapidly converted to inorganic pyrophosphate), along with citrate [34]. Citrate has a high binding affinity for calcium and influences its bioavailability by complexing calcium-phosphate during extracellular matrix mineralization and releasing calcium during bone resorption [35]. Together, our data support the role of ANKH in calcium homeostasis through regulation of citrate levels, connecting previously established independent links into a causal triad.”

      Furthermore, the analysis of the empirical data indicates that the ratio between direct and indirect effect of a transcript on a phenotype is in most cases close to identity, except for triplets with low total effects. This fact should be considered in the power analysis, which assigned the highest gain in power by the mediation analysis to cases of low direct to total effect ratio. The empirical data indicate that these cases might be rare or of minor relevance for the tested phenotypes.

      As our previous power analyses did not fully reflect scenarios observed from empirical data, we extended the range of covered 𝜌 (i.e., the ratio between direct and total effect), so that it mimics more closely the observed range of 𝜌. In the revised manuscript, 𝜌 varies from -2 to 2, so that we also consider configurations where direct and total effects have opposite direction. To provide the readers with a rough idea how frequent the different parameter combinations occur in real data, we now provide another heatmap indicating the density of detected associations in those parameter regimes as Supplemental Figure 4.

      This map can be brought in perspective of Figure 4A that illustrates the power of TWMR vs. mediation analysis over the same range of parameter settings.

      It becomes apparent from Supplemental Figure 4 that in real data, 𝜎 is always larger than 1 and often exceeds 10. Note, however, that this heatmap must be interpreted with care, since the “detected” density will be low in regions where both methods have low power.

      3) Related to the interpretation of causal links: horizontal pleiotropy needs to be considered. The authors report the identification of causal links between TMEM258, FADS1 and FADS2, arachidonic acid-derived lipids and complex phenotypes. However, they also mention the high degree of pleiotropy due to linkage disequilibrium at the underlying eQTL and mQTL region as well as the network of over 50 complex lipids known to be associated with the expression of the above transcripts. Thus, it seems possible that the levels of undetected lipid species may be more important for the phenotypic effect of variation in these transcripts and that the reported "mediators" are rather covariates. Such horizontal pleiotropy would violate a basic assumption of the MR approach. While we think that this does not invalidate the approach altogether, it does affect the interpretation of specific metabolites as mediators. This is aggravated by the fact that metabolic networks are more tightly interconnected than macromolecular interaction networks (assortative nature of metabolic networks) and that single point-measurements of metabolites may not be generally informative about the flux through a specific metabolic pathway.

      This is a valid point and we discuss this limitation in the revised Discussion:

      “It is to note that in the presence of unmeasured mediators or measured mediators without genetic instruments, our mediation estimates are lower bounds of the total existing mediation. In addition, unmeasured mediators sharing genetic instruments with the measured ones, can modify result interpretation as some of the observed mediators may simply be correlates of the true underlying mediators. While this is a limitation of all MR methods, metabolic networks may harbor particularly large number of genetically correlated metabolite species.”

    2. Reviewer #2 (Public Review):

      Auwerx et al. present a framework for the integration of results from expression quantitative trait loci (eQTL), metabolite QTL (mQTL) and genome-wide association (GWA) studies based on the use of summary statistics and Mendelian Randomization (MR). The aim of their study is to provide the field with a method that allows for the detection of causal relationships between transcript levels and phenotypes by integrating information about the effect of transcripts on metabolites and the downstream effect of these metabolites on phenotypes reported by GWA studies. The method requires the mapping of identical SNPs in disconnected mQTL and eQTL studies, which allows MR-based inference of a causal effect from a transcript to a metabolite. The effect of both transcripts and metabolites on phenotypes is evaluated in the same MR-based manner by overlaying eQTL and mQTL SNPs with SNPs present in phenotypic GWA studies.

      The aim of the presented approach is two-fold: (1) to allow identification of additional causal relationships between transcript levels and phenotypes as compared to an approach limited to the evaluation of transcript-to-phenotype associations (transcriptome-wide MR, TWMR) and (2) to provide information about the mechanism of effects originating from causally linked transcripts via the metabolite layer to a phenotype.

      The study is presented in a very clear and concise way. In the part based on empirical study results, the approach leads to the identification of a set of potential causal triplets between transcripts, metabolites and phenotypes. Several examples of such causal links are presented, which are in agreement with literature but also contain testable hypotheses about novel functional relationships. The simulation study is well documented and addresses an important question pertaining to the approach taken: Does the integration of mQTL data at the level of a mediator allow for higher power to detect causal transcript to phenotype associations?

      Major Concerns<br /> 1. Our most salient concern regarding the presented approach is the presence of multiple testing problems. In the analysis of empirical datasets (p. 4), the rational for setting FDR thresholds is not clearly stated. While this appears to be a Bonferroni-type correction (p-value threshold divided by number of transcripts or metabolites tested), the thresholds do not reflect the actual number of tests performed (7883 transcripts times 453 metabolites for transcript-metabolite associations, 87 metabolites or 10435 transcripts times 28 complex phenotypes). The correct and more stringent thresholds certainly decrease the overlap between causal relationships and thus reduce the identifiable number of causal triplets. Furthermore, we believe that multiple testing has to be considered for correct interpretation of the power analysis. The study compares the power of a TWMR-only approach to the power of mediation-based MR by comparing "power(TP)" against "power(TM) * power(MP)" (p. 12). This comparison is useful in a hypothetical situation given data on a single transcript affecting a single phenotype, and with potential mediation via a single metabolite. However, in an actual empirical situation, the number of non-causal transcript-metabolite-phenotype triplets will exceed the number of non-causal transcript-phenotype associations due to the multiplication with the number of metabolites that have to be evaluated. This creates a tremendous burden of multiple testing, which will very likely outweigh the increase in power afforded by the mediation-based approach in the hypothetical "single transcript-metabolite-phenotype" situation described here. Thus, for explorative detection of causal transcript-phenotype relationships, the TWMR-only method might even outperform the mediation-based method described by the authors, simply because the former requires a smaller number of hypotheses to be tested compared to the latter. The presented simulation would only hold in cases where a single path of causality with a known potential mediator is to be tested.

      2. A second concern regards the interpretation of the results based on the empirical datasets. For the identified 206 transcript-metabolite-phenotype causal triplets, the authors show a comparison between TWMR-based total effect of transcripts on phenotypes and the calculated direct effect based on a multivariable MR (MVMR) test (Figure 2B), which corrects for the indirect effect mediated by the metabolite in the causal triplet. The comparison shows a strong correlation between direct and total effect. A thorough discussion of the potential reasons for deviation (in both negative and positive directions) from the identity line is missing. Furthermore, no test of significance for potential cases of mediation is presented. Due to the issues of multiple testing discussed above, the significance of the inferred cases of mediation is drawn into question. The examples presented for causal triplets (involving the ANKH and SLC6A12 transcripts) feature transcripts with low total effects and a small ratio between direct and total effect, in line with the power analysis. However, in these examples, the total effects are also quite low. Its significance has to be tested with an appropriate statistical test, incorporating multiple testing correction. Furthermore, the analysis of the empirical data indicates that the ratio between direct and indirect effect of a transcript on a phenotype is in most cases close to identity, except for triplets with low total effects. This fact should be considered in the power analysis, which assigned the highest gain in power by the mediation analysis to cases of low direct to total effect ratio. The empirical data indicate that these cases might be rare or of minor relevance for the tested phenotypes.

      3. Related to the interpretation of causal links: horizontal pleiotropy needs to be considered. The authors report the identification of causal links between TMEM258, FADS1 and FADS2, arachidonic acid-derived lipids and complex phenotypes. However, they also mention the high degree of pleiotropy due to linkage disequilibrium at the underlying eQTL and mQTL region as well as the network of over 50 complex lipids known to be associated with the expression of the above transcripts. Thus, it seems possible that the levels of undetected lipid species may be more important for the phenotypic effect of variation in these transcripts and that the reported "mediators" are rather covariates. Such horizontal pleiotropy would violate a basic assumption of the MR approach. While we think that this does not invalidate the approach altogether, it does affect the interpretation of specific metabolites as mediators. This is aggravated by the fact that metabolic networks are more tightly interconnected than macromolecular interaction networks (assortative nature of metabolic networks) and that single point-measurements of metabolites may not be generally informative about the flux through a specific metabolic pathway.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1 (Evidence, reproducibility and clarity):

      Summary<br /> The authors have set out to study the Drosophila immune response against the fungus Aspergillus fumigatus. They found that Aspergillus fumigatus kills Drosophila Toll pathway mutants. The fungus does this without invasion because its dissemination is blocked by melanization. They suggest that there is a role for Toll in host defense distinct from resistance. The findings are interesting, and looks like the mycotoxins play a role. It also seems that there is some role of the Bomanins here, but I find that in particular Figure4 experiments are not convincing enough to provide a mechanistic insight as to what is going on. I think the authors need to think through what their results mean, and also, explain better (especially regarding Fig 4) their ideas and how the data fits them.

      We thank the reviewer for scrutinizing our manuscript as well as for suggestions to improve it.

      The role of mycotoxins is demonstrated:

      i) the fungus does not proliferate nor disseminate, also in Toll pathway mutant flies: thus, it must kill through diffusible substances, in as much as these immuno-deficient flies exhibit tremors toward the end of the infection;

      ii) a fungal strain devoid of the capacity to produce secondary metabolites is no longer virulent, even in Toll pathway mutant flies.

      The role of Bomanins is also demonstrated: the finding of a susceptibility of Bom__D__55C deletion flies to A. fumigatus and to mycotoxin challenges clearly shows that at least one or several Bomanin genes are required in the host defense against these challenges. The observation that this susceptibility can be rescued by the genetic overexpression of specific Bomanins indicates which ones are likely to mediate protection. The novel data we have included with the protection from mycotoxin action in neurons point clearly to BomS6 being the major mediator of protection against verruculogen action since it is the only one of two Bom genes to be induced in the head and with a proven potential for rescue of the Bom__D__55C phenotype.

      As regards the concept of the article, it is simple: we show that the Toll pathway does not control A. fumigatus infection by directly attacking the fungus but does so by neutralizing the effects of secreted virulence factors such as restrictocin and verruculogen. We further identify some of the relevant effectors such as Bomanins by using a genetic complementation strategy. To make our point clearer, we have now included additional data in which we show that BomS6 and BomS4 are the only Bomanins induced in the head of flies upon the injection of these two toxins. We next determine that BomS6 and not BomS4 expression in the nervous system dominantly protects the flies from the deleterious effects of verruculogen injection, both in terms of recovery from tremors and survival. Mechanistically, the Toll pathway protects the host from the action of verruculogen by expressing and likely secreting BomS6 from neurons.

      Major comments:<br /> Page 5: .."the fungal burden did not increase much in MyD88 flies challenged with 50 conidia (Fig. 1B)" - What do you mean did not increase much? There is a clear increase in Myd88 mutants compared to controls; would you expect a bigger increase (e.g. log scale induction)? Explain.

      When the injected dose is higher than 50 injected colonies, the fungal burden remains very close to that of the injected inoculum (Fig. EV1_F, J_). As for other pathogens regulated by the Toll pathway, it has been published that the microbial burden increases by log factors for filamentous fungi (Huang et al.., in revision), pathogenic yeasts (e.g., work from our laboratory Quintin et al. Journal of Immunology, 2013), bacteria (e. g., Duneau et al., eLife 2017; Huang et al., in revision). The pathogens usually proliferate exponentially in immuno-deficient hosts, which is clearly not the case of A. fumigatus, the first example we know of.

      Page 6: "the SPZ/Toll/MyD88 cassette is required for host defense against A. fumigatus infections, even though this pathogen only mildly stimulates the Toll pathway." - Should you rather say that A. fumigatus only mildly induces the Toll pathway target gene Drosomycin?

      The answer is negative. Fig. EV1_C_ clearly shows that BomS1 is also modestly induced as compared to an infection with E. faecalis. The promoter of BomS1 contains a canonical Dif-response element (Busse et al., EMBO J., 2007_)_. For a more thorough discussion of this point, please, see reply to Reviewer 2, Major Comment 2.

      Page 6: "...we tested Hayan mutant flies defective for this arm of innate immunity (Nam et al., 2012)." - elaborate this, which arm/which pathway?

      The title of the paragraph is “Drosophila melanization curbs A. fumigatus invasion”. The full first sentence of the paragraph actually read: “As melanization is a host defense of insects effective against fungal infections, we tested Hayan mutant flies defective for this arm of innate immunity”.

      This has not been introduced in the introduction. Explain.

      We have now added a couple of lines (82-83) to introduce melanization for the nonspecialist reader.

      Can you really draw this conclusion: "We conclude that melanization limits the proliferation and the dissemination of A. fumigatus injected into wild-type flies yet does not eradicate it at the injection site, where a melanization plug forms." Maybe you can based on the function/importance of the pathway to melanization, but you need to explain.

      Melanization is mediated by the Hayan protease and three phenol oxidases (two in adults) that catalyze the enzymatic reactions leading to melanin production (for Drosophila, please see Nam et al. EMBO J. (2012), Bingelli et al., PLoS Pathogen (2014), Dudzic et al., BMC Biology (2015), Cell Reports, 2019). Thus, finding that there is an increased proliferation and dissemination in null Hayan mutants is a strong indication for a role of melanization. The identification of a similar phenotype for PPO2 and PO1-PPO2 mutants demonstrates that melanization is curbing A. fumigatus. Our sentence is therefore fully justified.

      Page 10: "The cleavage of the 18S RNA was however much less pronounced in wild-type flies as compared to MyD88" - I am not sure what this means. Do you mean 28S?

      We thank the reviewer for pointing out this mistake that has now been corrected.

      And that the 28S peak is lower? Is this a quantitative method?

      The technique is liquid electrophoresis on a microchip. It is both a qualitative and quantitative technique that replaces traditional agarose or polyacrylamide gels.

      Fig. legend: "Arrows show the position of the 28S RNA sarcin fragment" - there are three arrows in both Fig 4E and F; specify which arrows point what.

      The thick arrow is now indicated in the figure legend to correspond to the much smaller sarcin fragment whereas the thin arrows on the graph clearly specify the position of the 28S RNA peaks.

      Based on the results, I am not convinced about the conclusion, that "restrictocin is able to inhibit translation to a detectable degree in vivo, likely through the cleavage of the ribosomal 28S a-sarcin/ricin loop as described in vitro." <- Do you draw this conclusion before doing the actual in vitro experiment, which is described next in the text (The rabbit reticulocute assay, S2 cells)?

      The existing literature (line 259 for a few selected references) has largely proven that restrictocin cleaves 28S RNA in vitro. We are demonstrating that this also happens in vivo in flies based on the generation of the alpha-sarcin fragment as well as the decreased 28S peaks. Our transgenic approach also indicates that restrictocin blocks translation in vivo. The in vitro approach has been implemented so that we could test the effect of synthetic BomS1 and BomS3 in cell culture. As to our knowledge, no one had demonstrated that restrictocin blocks translation in Drosophila cultured cells. It was therefore important to demonstrate it in cell culture using well-characterized in vitro techniques mastered by AT and FM.

      4H: Not sure what should be seen here, is it the darkest band at 0 uM that disappears?

      We have improved the figure and added an arrow to point out to the relevant band on the gel.

      HI & J need more explanation than what is now included in the text or Figure legend, is the conclusion that there is no difference? Write the stats above the Figs 4I & J (n.s.?).

      We have added NS on the figures and made our conclusion clearer (lines 295-298).

      Minor comments:

      It would have helped commenting if the manuscript contained line numbers

      We apologize for having initially provided a version in which lines were not numbered. At the prompting of Review Commons we immediately provided such a version, that was actually used by Reviewer 2.

      Why do you have the title "Hayan" on top of Fig 1F; you don't have this marking system in the other survival curves

      This point has now been addressed and the survival experiments checked for consistency.

      Fig 2A: Can you speculate why MyD88 flies die rapidly at day 10 if you inject PBST (your control)? What would happen to uninjected controls in otherwise the same conditions? (you could include an uninjected control here?)

      We suspect that this is linked to the trauma induced by the injection. Trauma has been shown to impact the homeostasis of the midgut epithelium (Lee & Miura, Current Topics Developmental Biology 2014, Chakrabarti et al., PLoS Genetics (2016)), and we suspect that it may lead to a leakiness of the gut allowing the passage of some bacteria from the gut microbiota that can proliferate in the hemocoel. Hence, we checked axenic and antibiotics-treated MyD88 flies to exclude that the limited sensitivity to trauma was not significantly contributing to the phenotypes we describe. It is also linked to the thickness of the needle and the problem is alleviated by using thinner needles.

      The uninjected control is now shown in Fig. EV8_E_.

      Please, see also the answer to Reviewer 2 Major comment 1.

      Fig 2E: Not sure what would be the best way of presenting the curves - different colors, dotted lines or something? Now if there are too many lines, they are hard to tell apart. because the symbols are not that visible. Like in 2E if you want to compare the light red/orange colored lines.

      We agree with the reviewer that the lines are hard to tell apart. This is however not a significant issue since the glip mutants display curves similar to that of the wt A. fumigatus control strain.

      For consistency add the caption also to Fig 3D (I assume it is the same as 3C)

      The caption was present in our version and is present in the revised version.

      For consistency, should you add Verruculogen on top of Fig 3F?

      Same reply as for the previous comment.

      Chronologically, how it is explained in the text, Figs 4A and B are in the wrong order.

      We fully agree with the reviewer. This problem has been addressed in the revised version.

      The quality of Fig 4 is not great, the text is hard to read (too small) and becomes blurry upon magnification.

      We fully agree with the reviewer. This problem has been addressed in the revised version.

      Page 12; "These data then suggest that a process akin to the immune surveillance of core cellular processes first described in C. elegans may also exist in Drosophila" - I think this sentence belongs to the discussion, this is not directly drawn from the results.

      We have followed the reviewer suggestion and have now developed our Discussion paragraph now entitled “Induction of the expression of specific Bomanin genes upon mycotoxin challenge”

      Referees cross-commenting

      I think we share many thoughts among all the reviewers.

      The main problem is that the manuscript language is quite strong; from the results many times it is not ok to make such strong statements. Some experiments need further analysis and clarification.

      I think in most cases, this could be achieved by softening the statements and adding more discussion, and not by making new experiments (some may be needed).

      We respectfully disagree with the reviewer on this point. There were obviously some misunderstandings that might be traced to the short format of the initial version. We have now developed the Discussion to clarify our conclusions as suggested by the reviewer.

      Minor things are that experiments are not advancing in a logical order between the text and the figures and there are problems with resolution in some figures.

      Statistics in some figures needs to be added.

      Please, see above.

      Reviewer #1 (Significance):

      The nature of the work is conceptual for the field, to understand the role of the Toll pathway and Bomanins in particular, in this fungal infection model. The work is interesting to a somewhat limited audience, mainly immunologists and in particular, people interested in the Drosophila model for immunity. The work may be interesting conceptually in understanding fungal infections.

      We are not certain that immunologists represent a limited audience. We agree that work on fungal infections is insufficiently funded with respect to the medical importance of these infections, as highlighted in our introduction and Perspective section of the Discussion.

      My expertise: I am a Drosophila immunity researcher with nearly 20 years of experience in working with fly immunity, in particular the Toll and the Imd pathways.

      Reviewer #2 (Evidence, reproducibility and clarity):

      Summary:

      Xu et al. describe how A. fumigatus kills Toll-deficient fruit flies not by hyperproliferation, but more likely by virulence factors. Melanization is important for suppressing fungal spread. The Bomanin genes have an unknown function, and here the data suggest a reasonably convincing role for Toll in resilience. Overall the manuscript is thorough and presents a diversity of approaches that show Toll and the Bomanins in particular contribute to this resilience effect. The idea that Toll effectors are essential for resilience is interesting as other fly stress response pathways like JAK-STAT are better known for helping the fly cope with damages, while Toll is better known as an antifungal response.

      I believe the study, with some careful considerations added, would add a valuable series of observations to understanding how the host immune system promotes survival after infection. Overall I am quite positive about the results, and the authors have made a significant effort.

      We thank the reviewer for the positive evaluation of our work that actually spans many years of research on the Aspergillus fumigatus Drosophila infection model that is a major topic of our work at the Sino-French Hoffmann Institute of Guangzhou Medical University.

      Any experiment suggestions I make are strictly to improve the confidence in the interpretations of the results, but the language could alternately be softened to address those concerns. My major critique is that the authors repeatedly extend beyond what is shown, and occasionally in defiance of what is shown (if I understand the results correctly).

      We have chosen to perform additional experiments when needed. We have also clarified points where there were obvious misunderstandings by expanding our text that had been written under a very concise format.

      It is not thoroughly clear what the reviewer has in mind when using the word defiance. We suppose it refers to the work of Scott Lindsay with whom we are in contact. He actually attempted to monitor the C. glabrata burden but did not pursue this line of investigations as he already saw a difference after one hour and he thought that the Toll pathway cannot be induced so rapidly. Actually, David Duneau mentions a time of two to three hours for the Toll pathway to control E. faecalis infections (eLife, 2017) and Sandrine Uttenweiler-Joseph already saw by MALDI-TOF MS an induction of Bomanins and other DIMs at the earliest point tested, six hours (PhD thesis). There is absolutely no critique of the work of the Wasserman laboratory who has greatly contributed to our understanding of Bomanin functions. Some of our unpublished data clearly point out to an AMP role for at least one Bomanin gene against E. faecalis and we certainly do not exclude an AMP role for BomS against C. glabrata. This however does not dismiss the possibility that Bomanins may also have other roles in dealing with microbial toxins. We have been studying Candida infections in Drosophila for many years and have documented the host defense against C. glabrata (Quintin et al., JI, 2013). We do suspect that C. glabrata likely secretes virulence factors that have not been identified so far. We mention this as a possibility and certainly not as a truth. One should remember that investigators were unaware for a long period of the role of Candidalysin, a pore-forming toxin, in C. albicans infections.

      Finally, a dual role as AMP and protecting from secreted toxins has been clearly shown in the case of alpha-mammalian Defensins that we now are describing in our Revised Discussion (Kudryashova,Immunity, 2014).

      Comments below.

      Major comments:

      1) The language is too strong. Specifically the use of the phrase "anti-toxin" is too generalist, especially as the authors show that their candidate Bomanin does not bind to the toxin directly.

      We have checked all of the submitted documents: the term anti-toxin was never used (just found “anti” in antimicrobial, antifungal, antibiotics..), in this manuscript as well as in the companion article. and we have never excluded an indirect effect, quite to the contrary because of the in vitro experiment with restrictocin mentioned by the reviewer and other observations now included (see further below). We use the terms “protection” or “counteract”, which have not such a meaning. It is burdensome for the reader to read each time “counteract or protect from the actions of the toxins or the effects of the toxin.

      Instead, Toll mutants seem susceptible to damage/stress caused by injury/toxins. MyD88 even show general susceptibility to vehicle controls in Fig3C-D.

      The effects of stress related to the infection conditions and injury are clearly distinct from the much stronger ones exerted by the toxins themselves. As requested by the reviewer further below, we have submitted wild-type and immuno-deficient flies to several stresses such as heat or the injection of hydrogen peroxide or salt solution (Fig. EV8_B-E_). While the latter did not reveal any difference, MyD88 flies succumbed slightly faster to a strong 37°C stress; in contrast, they survived better to a 29°C exposure, the temperature at which we perform most experiments. However, the difference started to be visible only after some 15 days whereas the time frame in which flies succumb to A. fumigatus or toxin challenges is definitely much shorter by some 10 days. We also note that Bom__D__55C mutant flies behave like the isogenized wild-type controls in these assays, further excluding a potential role for general stress sensitivity as a contributor to the effect of toxins.

      As regards DMSO, there is indeed a general mild sensitivity of flies to DMSO, but not specifically affecting MyD88 mutant (Rebuttal Fig. 1J). We find that this effect is lessened when using thinner needles. Thus, the problem has become minor as we became more experienced. We had checked axenics- and antibiotics-treated flies to exclude a contribution from the microbiota. Finally, to uncouple the effects of verruculogen from those of DMSO, we have also challenged flies directly by introducing the powder, using a technique similar to that of the septic injury. While it is quantitatively less accurate, it clearly proves that verruculogen produces the reported effects (Fig. 3C) and was useful to measure Bom and Drosomycin expression by digital PCR in the heads of challenged flies, e.g., Fig. EV6_J-K_ and Figs EV_11&12_.

      Toll is important for development, so it may be expected that Toll flies could have development defects impacting resilience even if/when Toll flies can survive to adulthood. I don't say this to be too negative on the findings, which are quite convincing. But I am not sure that the phrase "anti-toxin" is right for what is shown.

      We fully agree with the reviewer on this point. We have failed to find RNAi lines that are efficient enough to mimic the Toll pathway phenotype when expressed ubiquitously at the adult stage. However, Bom__D__55C mutants do not seem to display a developmental phenotype and display a phenotype similar to that of MyD88 flies. Furthermore, our rescue experiments of the Bom__D__55C sensitivity phenotype to mycotoxin challenge is achieved by the overexpression of specific Boms that are induced only at the adult stage, making it unlikely that this sensitivity phenotype reflects a developmental problem, as had been shown to be the case for 18-wheeler that had initially been proposed to encode the IMD pathway receptor.

      A very interesting recent study shows Dif has a role in the synapse of neurons to protect from alcohol sensitivity. Could secreted Bomanins participate? This emphasizes a mechanism through which Toll mutants likely have defective neural development, which could make them stress response defective, especially to things like neurotoxins. See: https://pubmed.ncbi.nlm.nih.gov/35273084/

      We are aware of this study first presented at the 2019 Fly Meeting in Dallas and this author did discuss with the authors of the study. However, we have found that Dif (and Dorsal) mutants are not sensitive to A. fumigatus infections nor to injected mycotoxins, as was the case already for C. glabrata (Quintin et al., JI, 2013).

      Lin et al. (2019) also showed lack of Bomanin secretion from the fat body in Bombardier mutants causes loss of tolerance (resilience?). So does Bomanin disruption increase susceptibility to stresses more generally, rather than specifically fungal toxins? And is this a development role, rather than an immune response role?

      The authors could try to use other stresses (NaCl, oxygen, heat, alcohol) to test the contribution of Bomanins to this resilience, which may reflect defective neural development rather than a role for secreted systemic immune-response peptides.

      Please, see replies above.

      2) The authors present a paradox. On the one hand, A. fumigatus hardly induces Drs/Bomanins (Fig. S1). Yet on the other, they propose that inducible Bomanins protect the fly from mycotoxins. Why do the authors say Toll is hardly induced by A. fumigatus at the start of the study (Fig S1), but later use the same data to argue that Bomanin induction underlies the resilience phenotype (Fig5).

      The reviewer raises an interesting point. Of note, we have added new data in Fig. EV2_B_ that document that all 55C Bomanin genes, BomS4-_excepted, are induced by a systemic infection. There is indeed somewhat of a paradox. The _Bom__D__55C deletion phenotype clearly establishes that Bomanins play a major role in the protection against mycotoxins and A. fumigatus. The rescue experiments rely on ectopic expression and therefore establish that specific Bomanins can mediate the protective effect. Our data on verruculogen suggest that there might be local inductions, e. g., in the head of BomS6 and BomS4. The brain represents a compartment that is separated from the hemocoel by the blood-brain-barrier. We have not been able to generate BomS6 null mutants so far. In this case, the relevant response may not be systemic. We only detect a weak signal for BomS peptides in the hemolymph of unchallenged flies, making it unlikely that a basal expression is important, at least as regards a systemic infection. We cannot however exclude local inductions at the level of tissues. This would not rely on hemocytes as “hemoless” flies are not susceptible to A. fumigatus or toxin challenges. This topic definitely warrants further investigations.

      In Fig 5, it looks like DMSO is nearly identical to A. fumigatus, so can the authors really suggest that equal induction to DMSO is relevant?

      We had stated that an induction of the Bomanins by the injection of DMSO alone precluded us from analyzing the effects of verruculogen on Bom gene expression. We have now bypassed this difficulty through direct challenges by the undissolved powder (Fig. 6_J-K,_ Fig. EV11).

      The authors' discussion of these points would benefit from considering Vaz et al. (2019; Cell Rep) to frame how much PAMP is injected given equal numbers of fungal cells vs. bacterial cells. To me the lower induction by injecting a few fungal cells with much lower surface area to volume ratio means equal microbe mass has exponentially less PAMP in fungal conidia cell walls (2-3um diameter) vs. equal mass of bacteria (0.5-1um diameter).

      We fully agree with the reviewer and now mention that C. glabrata also led to a milder induction of the Toll-mediated humoral response (Quintin et al. JI, 2013). In addition, it has been shown previously that ß-(1-3)-glucans, which are sensed by GNBP3 in Drosophila (Gottar et al., Cell, 2006), are concealed by the cell wall (germinating conidia) or hydrophobins (Wheeler et al., PLoS Pathogens, 2006; Aimanianda et al., Nature, 2009) . In the case of yeasts, these glucans are accessible only at the budding scar (Gantner et al., EMBO J., 2003).

      Fig S1O is not convincing that Boms alone are present. There is significant noise near Drs in FigS1 infected, which likely saturates the detector before Drs can fly to it. I say this because DIM4 (Daisho) indicates that Toll is strongly induced. The authors should show a larger mass range on the x-axis including peaks of other Toll-induced peptides like the BaramicinA DIM10, DIM12 and DIM13 peptides of their companion paper and DIM14 (Daisho), which are closer in mass to the Bomanins and less likely disrupted by the noise at 4300 m/z. The maldi-tof calibration to correct ranges is critical for arguments of quantification.

      We provide the primary data in the Rebuttal figures at the end of this document. These are the results obtained from three single flies (Files A29683PBUG22, A29684PBUG23 and A29684PBUG24). The first three spectra correspond to the full scale based on the major peaks observed (DIM4/BomS5) in two out of three spectra. At this scale, no signal is visible for Drosomycin at 4891 and the “noise” at 4278 is modest. Next, the multi-spectra report allows to put all three samples on the same sheet, this time zooming on the peaks of interests in the region 4300 (“noise”) and 4891 (Drosomycin). Finally, the next two pages zoom in on the BomS peptide signals and the next page keeps the same scale to document the 4300-5000 region. On the last page, it is obvious that the signal around 4300 is very modest and too distant to influence the Drosomycin ion, thereby excluding any effect of suppression. Of note, in the systemic immune response, Drosomycin is the most induced AMP with a concentration estimated to be around 0.3µM, an order of magnitude higher than other AMPs. Finally, these experiments have been performed by PB who initially developed the technique (Uttenweiler-Joseph, PNAS, 1998) and has been using and developing it ever since.

      Combined with comments in Major Concern 1, I am not convinced that the -inducible- Bomanin response mediates the resilience phenotype.

      Besides our replies above, we do hope that the new data we have included in Fig. 6 that document an induction of only two BomS genes in the heads of Drosophila upon verruculogen and the finding that BomS6 expression in the nervous system protects the fly from the effects of verruculogen will convince this reviewer.

      3) The author's language is very strong to disregard a possible antimicrobial activity.

      As noted above, this is a misunderstanding that we hope is dispelled in the revised discussion (see also above and replies to Reviewer 1).

      Previous studies showed increased Candida growth and decreased hemolymph killing activity in Bom55C flies (Lindsay et al. 2018 and Hanson et al. 2019).

      Please, see reply above. Factually, Lindsay et al. did not study the C. glabrata titer in vivo but using collected hemolymph. The killing activity likely requires a cofactor regulated by the Toll pathway. Hanson investigated the burden of the dimorphic C. albicans pathogen that in flies is filamentous and not C. glabrata.

      Also see minor concern (i).<br /> I grant that the data are consistent with a resilience role. However the authors found no binding of Bomanin to restrictocin, countering their idea of a -direct- anti-toxin effect.

      We are surprised by this comment. We certainly did not favor this idea nor developed it in the original manuscript, even though we cannot formally exclude it at this stage. Future experiments will focus on BomS6 potential interactions with these two mycotoxins.

      At present the authors cannot rule out a direct antimicrobial role, or even the possibility of two different roles for the same peptides (ex: one in resilience, one antimicrobial). For instance, it is difficult to explain the loss of killing activity of Bom-deficient hemolymph ex vivo from Lindsay et al. if Bomanins are strictly anti-toxins. Surely they must also do something generalist?

      Please, see our replies above and the paragraph dedicated to this topic in the Discussion.

      4) In most figures, the authors do not compare flies with shared genetic backgrounds.

      The MyD88 allele we are using is a transposon insertion from the Exelixis collection and we are using the wA5001 strain that was used to generate the collection of insertion (Thibault et al., Nat. Genetics 2004). We thank the reviewer for this comment as we realized we had forgotten to mention the Bom__D__55C strain. Lines 603-604 state that the deficiency line has been isogenized in the wA5001 background.

      The phenotypes are usually strong so I am not concerned.

      However the rescue effect of Bom transgenes in Fig 5C-D is based on smaller differences. Were these genetic backgrounds controlled?

      Yes, as much as we reasonably could. The fact that most BomS transgenes did not rescue gives further confidence in the data.

      Were transgenes inserted at the same site?

      We used the strategy for overexpression developed by the Basler laboratory (Bishof et al., Development 2013, Nat. Protocols 2014) that relies on insertions at the same site.

      The authors seemingly used a heat shock to express transgenes.

      Heat-shocks are usually a short exposure to higher temperatures, usually 37°C. Here, we have used the inducible Gal4-Gal80ts system developed by McGuire and Davis (Trends in Genetics, 2004). The Gal80 repressor inhibits Gal4 function at the permissive temperature (18°C) and becomes inactive at the restrictive temperature (29°C). Thus, we use a temperature shift and not a bona fide heat shock.

      Given a resilience effect is being studied, this heat stress approach is sub-optimal. Earlier experiments showing effect/no effect of Bomanin on heat shock resilience would improve confidence here. I would recommend assaying temperatures that can kill wild-type in order to confirm that Bom do not succumb earlier (ex. up to 37'C).

      The results have been discussed above and show that 29°C is not a concern for Bom__D__55C and not much of a significant problem as regards MyD88.

      In Fig5C the time resolution is poor, and the effect inconsistent across Bomanins. What are the differences in the Bomanins that the authors suspect could cause this? And how consistent are the experiments?

      We provide all the primary survival data in Rebuttal Fig.1 A-H. The partial protection effects of BomBc1, BomS3 and BomS6 against restrictocin are consistent in the three independent experiments (Fig. 5D and Rebuttal Fig. 1 A-B). As regards the seven independent experiments performed with verruculogen, we observed a strong protection conferred by BomS6 expression in six experiments whereas we detected a milder protection conferred by BomS1 in four out of seven experiments and no protection in the three other ones. The effects were always there after 24 hours, in keeping with our novel data showing that BomS6 expression allows a faster recovery, around 10 hours, from verruculogen-induced tremors (Fig. 6E-F).

      Since the effect is finished by 24h, perhaps a boxplot of percent survival at this time would better show the consistency across experiments.

      Given the argument presented just above and considering that this rebuttal letter will be published alongside the article, this may not be needed.

      Minor concerns:

      i) The authors say the fungal burden of Bom55C flies remains low in Fig 5B, but they never measure flies that are near death when fungal load is greatest, or FLUD like in other figures. Given low mortality at the following time points, it seems likely that A. fumigatus would grow beyond initial loads in those individuals and kill them. I grant that these loads are less than what is seen in Hayan mutants. I just might suggest a more careful consideration of the time points used and what can be said about the trends shown here.

      This is certainly a relevant point. The FLUD data are now presented in Fig. EV8_A_ and do not reveal any additional growth.

      ii) Could the authors comment somewhere about the levels of toxin they were required to inject to get a phenotype vs. the level of toxins the authors expect are found in the fly during infection? I appreciate that toxin injection likely requires much higher doses, but it would be good to know just how far the authors have pushed their experimental system beyond its natural range.

      This a question that is difficult to answer accurately as we are not sure the techniques exist to measure toxin levels in these small flies. We have tested a range of concentrations. It is clear that we push the system and likely use concentrations that are higher than those actually secreted by A. fumigatus during infection. Indeed, the mutant strains defective for the production of verruculogen or restrictocin display only a mildly reduced virulence in MyD88 flies. This makes it even more remarkable that wild-type flies are able to withstand these high, unphysiological concentrations, an argument for an indirect effect independent of the dose as pointed out now in the Discussion. How fungal pathogens balance the expression of the hundreds of secreted virulence factors, proteins and secondary metabolites, is a major frontier for future investigations be them plant or animal pathogenic fungi/

      Again regarding toxins vs. general stresses, one could manage to inject salt into the hemolymph and show a stress-sensitized fly would succumb at lower doses than wild-type, emphasizing the relevance of defining concentrations.

      We feel that just monitoring the survival of flies after a challenge that produces an effect is sufficient (Fig. EV8_C_).

      The authors could also write toxin concentrations clearly in the figure/legend per experiment.

      Corrected.

      iii) Throughout the manuscript, the order that figures/panels are cited is inconsistent. Perhaps the text could be re-written so the reader can follow the figures more intuitively while going through the text?

      Corrected.

      iv) There are a few points where run-on sentences, involving many commas, make it hard to follow the logic. I might suggest a careful reading to break up long sentences into two sentences to ensure clarity.

      We hope to have addressed this concern.

      v) Line 279-281: this is the first and only mention of the immune surveillance hypothesis in nematodes. This is strange, given the authors are effectively describing an analogous idea exists in flies? Perhaps this could be added somewhere in the introduction or discussion.

      We have followed the advice of the reviewers and now discuss this point more fully in the Discussion under its own subheading.

      Small points

      • What timepoints are the gene expression data from? Could the authors indicate this in figures/legends?

      Done

      • Line 133-135: "We conclude that MyD88 flies succumb to a low A. fumigatus burden..." - could the authors cite a figure panel here to emphasize what evidence they're referring to.

      Done

      • Line 151-152: Dudzic et al. (2019- Cell Reports Figure 3) showed that PPO2 was regulated by Hayan, while PPO1 by Sp7. This relevant study should be cited here or in the introduction/discussion.

      Excellent suggestion, this was indeed an important study. Done

      • Line 179-180: could the authors define the gliotoxin mutant strain here in the text for clarity?

      Done

      • Line 196: Fig. 4A-B should be Fig. **S4 A-B?

      Corrected.

      • Fig4A: perhaps the authors could reduce the x-axis to focus on the early time points? If I understand correctly, aspf1 has slightly delayed killing compared to akuB (˜50% difference at 2 days), but both kill 100% by 3 days.

      Done

      • Fig4G: can the authors define the GFP transgene on pg10? Not clear what this is, or what this means. Brain? Fat body? The legend of Fig4G and the key in the top left... it's not easy to quickly understand what is shown in Fig4G.

      Done

      • Line 247: I would drop the "at the intracellular level" part. I'm not sure this is robustly shown given the use of an in vitro model where there is no closed extracellular environment. The data are convincing of the effect, this is just a semantic point.

      We agree that there is no closed extracellular environment and that therefore any signal emitted by the cells might get too diluted. However, the addition of EGF will activate the Toll intracellular through the chimeric EGFR-Toll receptor. As restrictocin is known to act intracellularly, one might have though that there might be some intracellular effectors mediating the Toll-dependent protection against restrictocin. Our sentence excludes this possibility.

      • Line 257-258: Cohen et al. (2020- Front Imm) never used Bomanin mutants. Did the authors mean to cite Hanson et al. (2019) here, which seems to fit their described citation re: Bom55C vs. Toll mutant flies (Fig. 2)? Given Hanson et al. infected Toll mutant and Bom55C flies with many bacteria/fungi including A. fumigatus, it's strange this study is not discussed currently.

      The reviewer is correct. Cohen et al. did use A. fumigatus, but on Daisho mutants and MyD88 and not Bom__D__55C as a control. We are now citing Hanson et al., 2019 in lines 443-449 (Discussion).

      • Fig5C-D: the labeling is difficult to follow.

      This is difficult to address unless multiplying EV figures. We feel this is not needed: the important curves are in color and each such curve is seen on the graphs.

      • Line 318: a -possible- AMP role of Bomanins was proposed because of the aforementioned killing activity of wt but not Bom mutant hemolymph, alongside rescue by single Bom genes. To say this was based only on survival experiments is incorrect.

      The paragraph has been rewritten and expanded to dispel any misunderstanding.

      • Line 324-328: could the authors cite appropriate references after "inhibition of calcium-activated K+ channels" ?

      Done

      • comment re-Line 334: Toll10b flies have melanotic tumors and are in general in a stressed state. Might their rescue be due to increased stress tolerance by pre-activated stress responses?

      This is a developmental effect occurring during larval stages, also observed for Cactus mutants. Here, we use a UAS-Tl10B transgene that is induced only at the adult stage using the Gal4-Gal80ts system. Thus, any stress is minimized as much as possible. Furthermore, we can phenocopy this phenotype to a large extent using a UAS-BomS6 driver, even though the phenotypes are subtly different as regards the protection against verruculogen-induced tremors.

      Referees cross-commenting

      Yes I agree that the data themselves are not the issue, nor even the direction of the results. But there are many overly-strong statements that go so far as to refute ideas which are supported by other studies, and for which the authors here do not provide any contradictory evidence.

      We hope that this revised, extended version has clarified any misunderstanding in the initial version.

      As per my review, I would be happy with a re-write that softened the language overall. I genuinely wonder if these Bomanin mutants simply have poor development, and so they are susceptible to neurotoxins/stress because their nervous system/development leaves them less resilient in general. Experiments testing their resilience to different stresses would greatly elevate the ability to make confident insights in the present manuscript. Currently the authors have only investigated one type of phenotype and interpreted it as if that is evidence of the evolved purpose of the peptides. This approach does not account for many other possible (and reasonable) explanations.

      We have performed the experiments suggested by the reviewer. While we see a modest effect of heat on MyD88, it is not found in Bom__D55C flies, which display essentially the same phenotype as MyD88 with regards to the sensitivity to A. fumigatus or some of its secreted mycotoxins_._

      Reviewer #2 (Significance):

      This paper should be of broad interest to the study of immunology, where roles for effectors are typically thought of as cytokines. In fruit flies and other invertebrates that lack adaptive immunity, immune effectors are more thought of as direct actors likely with antimicrobial properties. The finding that Toll might mediate resilience is interesting, and implicating well known Toll effectors provides an important step forward towards a mechanistic basis behind this resilience effect.

      We thank the reviewer for his appraisal of the significance of our work.

      My expertise is in insect and innate immunity.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary

      The authors have set out to study the Drosophila immune response against the fungus Aspergillus fumigatus. They found that Aspergillus fumigatus kills Drosophila Toll pathway mutants. The fungus does this without invasion because its dissemination is blocked by melanization. They suggest that there is a role for Toll in host defense distinct from resistance. The findings are interesting, and looks like the mycotoxins play a role. It also seems that there is some role of the Bomanins here, but I find that in particular Figure4 experiments are not convincing enough to provide a mechanistic insight as to what is going on. I think the authors need to think through what their results mean, and also, explain better (especially regarding Fig 4) their ideas and how the data fits them.

      Major comments:

      Page 5: .."the fungal burden did not increase much in MyD88 flies challenged with 50 conidia (Fig. 1B)" - What do you mean did not increase much? There is a clear increase in Myd88 mutants compared to controls; would you expect a bigger increase (e.g. log scale induction)? Explain.

      Page 6: "the SPZ/Toll/MyD88 cassette is required for host defense against A. fumigatus infections, even though this pathogen only mildly stimulates the Toll pathway." - Should you rather say that A. fumigatus only mildly induces the Toll pathway target gene Drosomycin?

      Page 6: "...we tested Hayan mutant flies defective for this arm of innate immunity (Nam et al., 2012)." - elaborate this, which arm/which pathway? This has not been introduced in the introduction. Explain. Can you really draw this conclusion: "We conclude that melanization limits the proliferation and the dissemination of A. fumigatus injected into wild-type flies yet does not eradicate it at the injection site, where a melanization plug forms." Maybe you can based on the function/importance of the pathway to melanization, but you need to explain.

      Page 10: "The cleavage of the 18S RNA was however much less pronounced in wild-type flies as compared to MyD88" - I am not sure what this means. Do you mean 28S? And that the 28S peak is lower? Is this a quantitative method? Fig. legend: "Arrows show the position of the 28S RNA sarcin fragment" - there are three arrows in both Fig 4E and F; specify which arrows point what.<br /> Based on the results, I am not convinced about the conclusion, that "restrictocin is able to inhibit translation to a detectable degree in vivo, likely through the cleavage of the ribosomal 28S a-sarcin/ricin loop as described in vitro." <- Do you draw this conclusion before doing the actual in vitro experiment, which is described next in the text (The rabbit reticulocute assay, S2 cells)?

      4H: Not sure what should be seen here, is it the darkest band at 0 uM that disappears? HI & J need more explanation than what is now included in the text or Figure legend, is the conclusion that there is no difference? Write the stats above the Figs 4I & J (n.s.?).

      Minor comments:

      It would have helped commenting if the manuscript contained line numbers

      Why do you have the title "Hayan" on top of Fig 1F; you don't have this marking system in the other survival curves

      Fig 2A: Can you speculate why MyD88 flies die rapidly at day 10 if you inject PBST (your control)? What would happen to uninjected controls in otherwise the same conditions? (you could include an uninjected control here?)

      Fig 2E: Not sure what would be the best way of presenting the curves - different colors, dotted lines or something? Now if there are too many lines, they are hard to tell apart. because the symbols are not that visible. Like in 2E if you want to compare the light red/orange colored lines.

      For consistency add the caption also to Fig 3D (I assume it is the same as 3C)

      For consistency, should you add Verruculogen on top of Fig 3F?

      Chronologically, how it is explained in the text, Figs 4A and B are in the wrong order.

      The quality of Fig 4 is not great, the text is hard to read (too small) and becomes blurry upon magnification.

      Page 12; "These data then suggest that a process akin to the immune surveillance of core cellular processes first described in C. elegans may also exist in Drosophila" - I think this sentence belongs to the discussion, this is not directly drawn from the results.

      Referees cross-commenting

      I think we share many thoughts among all the reviewers. The main problem is that the manuscript language is quite strong; from the results many times it is not ok to make such strong statements. Some experiments need further analysis and clarification. I think in most cases, this could be achieved by softening the statements and adding more discussion, and not by making new experiments (some may be needed).

      Minor things are that experiments are not advancing in a logical order between the text and the figures and there are problems with resolution in some figures. Statistics in some figures needs to be added.

      Significance

      The nature of the work is conceptual for the field, to understand the role of the Toll pathway and Bomanins in particular, in this fungal infection model. The work is interesting to a somewhat limited audience, mainly immunologists and in particular, people interested in the Drosophila model for immunity. The work may be interesting conceptually in understanding fungal infections.

      My expertise: I am a Drosophila immunity researcher with nearly 20 years of experience in working with fly immunity, in particular the Toll and the Imd pathways.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      We thank the reviewers for their time in evaluating of our manuscript and for the useful feedback. We are grateful that reviewers acknowledged that our study is important because it “sheds much needed light on this less documented early stage of cancer development”. The reviewers were overall positive in their assessment and, as reviewer #3 noted, our study “advances this field conceptually by highlighting the importance of targeting the cell signaling and chromatin regulation together”. The common criticism of all reviewers relates to writing style, some textual interpretation and ensuring that the number of replicates, statistical analysis, and cell culture type were appropriately mentioned. We felt these were valid points and have taken onboard all these comments. A shared concern between two of the reviewers was related to the logic behind the timepoints we chose to analyse cells in the different assays. We are confident that we have addressed this, and all other comments as detailed below.<br /> Please find below a point-by-point reply to the reviewer’s comments.

      Reviewer #1 (Evidence, reproducibility and clarity):

      This study aimed to identify events that happens early in malignant transformation of breast cancer (BC) cells that are driven by HER2 oncogene. Constructing a 3D inducible model to study impact of HER2 protein level on BC cell and assessment of gross morphological changes, protein phosphorylation and chromatin accessibility at different time points of HER2 activation.

      Using a controllable in vitro model is a good approach although it is not novel. Also the method used to assess HER2 protein positivity is not standardized nor clinically relevant. Positivity of HER2 in clinical practice is assessed either through immunohistochemistry (IHC 3+ or 2+ with gene amplification), however the author did not mention any control for positivity except western blot which is not used in clinical practice.

      We agree with the reviewer that we should have included our comparison of HER2 protein levels for our cells with a positive control. We have tested this, and the data will be included in the revised version of the manuscript. Briefly, both western blot (WB) and IHC are very useful methods with different benefits: WB is less cost effective but more quantitative, while IHC gives a better overview of tissue heterogeneity. Indeed, due to higher sample processing costs, WB is not used in clinical practice to assess HER2 but it has been shown that there is a high concordance (in 95% of over 300 tumours analysed) between the two methods as both techniques showed prognostic significance R. Molina et al., 1992 (PMID: 1363511). We performed comparison of HER2 protein expression levels of our subpopulations (low, medium, and high HER2 expressing cells) versus two patients’ samples that were already known to be HER2 positive using IHC 3+ or 2+. We were able to demonstrate that HER2 protein levels as measured by western blotting showed that the low HER2 expressing cells expressed less HER2 protein compared to IHC 3+ or 2+ and may be comparable to patients with IHC 1+, which are considered HER2 negative and do not qualify for anti-HER2 therapies such as Trastuzumab.

      There is difference between early HER2 positive BC and HER2 low BC. As the earlier is driven by HER2 oncogenic signalling pathway, but the latter is not.<br /> Identification of molecular changes that occur at HER2 low BC seems very important and clinically relevant, however HER 2 low is not fully characterized, yet. And the only definition available is either HER2 1+ or 2+ without gene amplification. The author was not very clear about threshold he followed to call the model HER2 low. Is it positive with lower limit of positivity or just small amount of protein). He also concluded that BC with sub-threshold of HER2 protein behave more aggressive than HER2 positive BC. What is the threshold and was it correlated with IHC or gene amplification level to be reliable?

      The HER2 positive population in our in vitro inducible system was determined by flow cytometry, we separated the overall (bulk) HER2 positive cells into three different subpopulations and selected the bottom 20% of HER2 expressing cells as the “low HER2” and the top 20% of HER2 expressing cells as “high HER2”. We show in figure 4C the different thresholds for low, med, high HER2 protein expression by flow cytometry. We have modified the figure and the figure legend (figure 4C) to better indicate the different subpopulations. Through western blotting we compared these population of cells with patients’ samples that had IHC 3+ or 2+ and showed that low HER2 population expressed less protein than IHC 2+, whereas the high HER2 was relatively comparable to IHC 3+ sample.

      The status of oestrogen and progesterone receptors were not highlighted. Triple negative breast cancer, for instance, is more aggressive than HER2 positive BC, this may be the reason for the worse behaviour.

      We have modified our main text in the manuscript, line 68-69, to better reflect the fact the MCF10A cells are both oestrogen (ER) and progesterone (PR) negative, this has been already characterised by Qu, Y et al., 2015 (PMID: 26147507). However, importantly, we do not think that ER and PR status is the reason these cells are relatively more aggressive, as normal MCF10A cells without HER2 expression did not display any transformative characteristics in our molecular analysis and/or in vitro functional assays, despite being ER and PR negative.

      At line 130, "The low levels of HER2 protein activation at early time point may closely mimic at least partially the signalling changes occurring in HER2 positive BC patients". This claim is not quite true, as low levels of HER2 protein activation doesn't activate HER2 oncogenic signalling pathway as HER2 positive does.

      We thank you for this insightful comment, we have modified our main text to better reflect our view (line 132-133). However, we were not sure which published data the reviewer was referring to in this case. In particular, if low HER2 levels can still form dimerisation with its family members and induce signalling via its family partners such as HER1, HER3 or HER4.

      The author aimed to study the signalling changes accompanying low levels of HER2 induction by lowering significance threshold to log2fold > 0.5. Lowering the threshold for significance will increase the total number of phosphorylated protein (both at low HER2 levels and high levels). So, studying the whole significant proteins at whole time points will not be exclusive for low HER2 levels and this was evident through activation of MAPK cascade which is one of downstream signalling pathway of HER2 positive BC.

      We agree that a log2fold change > 0.5 would increase the total number of significantly phosphorylated proteins. We first performed the analysis on a more stringent cut-off value of log2fold change > 1.5 p-value, <0.05 as shown in figure 2B. In the supplementary we also show the reduced stringency of log2fold change > 0.5, p-value <0.05, for the following reasons: when it comes to proteins, it is conceivable that a log2fold change > 0.5 is sufficient to induce molecular changes; secondly, our study investigates changes that occur just half an hour, and up to 7 hours, after HER2 protein induction. At such early time-points, proteins would be beginning to be phosphorylated and the extent of it may not be pronounced (especially in a small subset of the population); finally, we thought it is important to share this supplementary analysis with the scientific community to have access to this data so that they may further interrogate it from different perspectives.

      Combining HER2 protein level (both IHC and Western blot) to different time points will give better understanding of events associated with HER2 low, early positive or late positive.

      As above, IHC is routinely performed for clinical diagnosis because it is cost effective. Although, western blotting is laborious and expensive, it is more quantitative compared to IHC.

      Reviewer #1 (Significance):

      This work provides good evidence to changes that happen at early HER2 positive breast cancer transformation and introducing a chromatin opening and accessibility as a new target of treatment of HER2 positive breast cancer patients.

      We thank reviewer #1 for their thoughtful feedback and for their appreciation of our work.

      Reviewer #2 (Evidence, reproducibility and clarity):

      HER2 amplification is associated with poor prognosis of breast cancer. Despite it has been extensively studied, it deserves thorough study how HER2 amplification alters downstream signaling pathways, chromatin structure and gene expression, and how cells overcome the hurdles in order to transform. In this study, Hayat et al used doxycycline-induced HER2 expression in MCF10A cells to recapitulate the very early stage of HER2 expression and HER2-induced mammary epithelial cell transformation. The authors performed global phosphoproteomic, ATAC-seq and single-cell RNA-seq, and propose sub-threshold low level HER2 expression activates signaling pathways and increases chromatin accessibility required for cell transformation, while high HER2 expression level in early stages results in decreased chromatin accessibility.

      Major comments:<br /> 1. Although it is not clearly described, it seems that phosphoproteomic and single-cell RNA-seq were performed using 2D-cultured cells, while ATAC-seq was performed using 2D (FACS sorted cells based on HER2 expression levels) or 3D (time course)-cultured cells. Cells cultured on 2D and 3D are significantly different on cell signaling, chromatin structure and gene expression, and therefore cannot be compared.

      We agree that there are differences between 2D and 3D cell cultures, which may impact on the multi-omics experiments performed in this study. In an ideal world we would have preferred to be able to conduct all experiments in 3D cell cultures, including the phosphoproteomics experiments. However, this is not feasible because the phosphoproteomics experiment requires 500ug of total protein which corresponds to approximately 10 million cells for each condition and replicates in 3D matrices. 3D structures would have also presented with accessibility issues since doxycycline might not have reached all cells equally at the 30 minutes timepoint. Since we were analysing early timepoints for phosphoproteomics, homogeneity in induction was important. We performed ATAC-seq in 3D cell culture because it was feasible as it only required 25,000-50,000 cells to be grown in small 3D cell cultures and is indeed superior for physiological relevance. We therefore had to compromise and worked with the assumption that immediate signaling events will not be fundamentally different in 2D vs 3D. We have modified the main text to better reflect this and have indicated which experiments were performed in 2D vs 3D in the figure legends and the methods section.

      1. Phosphoproteomic (0.5, 4 and 7 hours), ATAC-seq (1, 4, 7, 24 and 48 hours) and single-cell RNA-seq (7, 24, 48 and 72 hours) were performed on cells at different time points after doxycycline treatment. The authors need to clearly explain the rationale why such time points were chosen for each experiment in the text.

      There are indeed differences in the time-points analysed between the different multi-omics analysis. However, as mentioned above, the reason for selecting such early time points for the phosphoproteomic experiment was that signalling changes are rapid and we were focused on characterising the early signalling dynamics. With regards to the ATAC-seq and scRNA-seq, there are several shared time-points such as the 7h, 24h, and the 48h. Additionally, as the chromatin changes would be slower acting as compared to signalling changes, two later time-points were selected including the 48h (ATAC-seq) and 72h (scRNA-seq) to capture some late changes during cellular transformation.

      1. Change on chromatin accessibility does not necessarily mean change on gene expression levels. RNA-seq needs to be performed and analyzed along with ATAC-seq data.

      We agree that chromatin accessibility does not necessarily correlates with gene expression changes and the need to perform RNA-seq to make such a conclusion. This is the reason we performed single cell RNA-seq, which looks at changes in high temporal and cellular resolution. This is particularly useful for the heterogenous cell population that we worked in to better understand the differences between cell types.

      1. Analyses on multi-omics data are quite preliminary. Clustering analysis on the time course of phosphoproteomic, ATAC-seq and single-cell RNA-seq will help characterize the dynamics of cell signaling and gene expression. Integrated analyses on multi-omics data and construction of regulatory network are necessary to identify the key signaling node and key epigenetic regulators/machinery that facilitate or prevent cell transformation. Integrated analyses, of course, need to be performed on data obtained from cells cultured in the same conditions.

      We think our study is an important work and provides a strong foundation for a comprehensive, integrative multi-omics study using primary human breast cells with parallel analysis performed on the same population of cells using the latest techniques such as scATAC and RNA-seq or scNMT-seq. We are indeed in the process to apply for funding in a larger analysis that involves in vivo work and clinical samples, using this study as a foundation.

      1. The authors picked up several genes from the analyses, and discussed the potential importance in cell transformation without functional validation. It is important to show data demonstrating altered expression of certain genes and/or altered activity of certain signaling pathway/epigenetic regulators is indeed important for cell transformation in low HER2-expressing condition or preventing cell transformation in high HER2-expressing condition.

      We agree that this is important. The scope of this study is to report on the result that low HER2 was unexpectedly more aggressive compared to high HER2, which was a highly reproducible observation, and identified a molecular explanation for this behaviour (dedifferentiation and predominant chromatin opening). In terms of cross validation, we found the MUC1 protein expression to be low in low HER2 expressing cells, indicating that they are more stem-like (figure 4B). We confirmed and validated this finding in our scRNA-seq data shown in figure 4F. The pathway analysis from phosphoproteomic study shows that MAPK pathway is highly activated upon HER2 protein overexpression. To validate this claim, we performed western blotting analysis that confirm this as the ERK protein was hyperphosphorylated in HER2 expressing cells compared to controls. Thus, our resource study provides many candidates that can be tested to further explore the biology.

      1. HER2 expression in MCF10A cells is insufficient in inducing tumor formation in vivo, although HER2 expression results in disrupted acini structure and colony formation in vitro (e.g. Alajati et al. 2013 Cancer Res, 73:5320-5327 cited in the manuscript). It is interesting to investigate whether this is due to the mechanisms identified in this study.

      MFC10A cells are generally difficult to transform in vivo. It is possible that mechanisms identified in our study might be responsible for lower tumourigenicity in vivo with WT HER2 compared to HER2 variants, since our study suggests activated checkpoints in high HER2 cells. It would be interesting to compare the differential impact on chromatin for the two HER2 variants too. In our system, we think the reasons why cells form abnormal morphological changes and grow colonies in vitro is a result of HER2 overexpression, which induces aberrant signalling, and this may be leading to loss of cell-to-cell contact and disruption of adhesion molecules. However, the objective of this study was to understand the early signaling to chromatin changes in in vitro cellular transformation, and changes in cell morphology are a consequential part of the process.

      1. In Figure 2C, two replicates are completely separated and replicates of each time points are not clustered together.

      We agree that the two replicates are separated into two separate groups, this was demonstrated by the PCA analysis (Supplementary Fig 1F). We grouped these samples into “early” (0h, 1h, 4h, and 7h time-points) or “late” (24h and 48h time-points) based on them clustering well into these two groups. The subsequent analysis were performed based on these groups that clustered together. However, we still showed each replicate in figure 2C to appreciate the dynamics of chromatin accessibility between each time-point, which shows clear differences in HER2 versus Control.

      Minor comments:<br /> 1. Essential experimental information, e.g. whether cells were cultured in 2D or 3D, needs to be clearly and accurately described in main text, figure legends and experimental procedures.

      The figure legends in the manuscript have now been modified to include information on cell culture type.

      1. Statistic methods are not provided. In Fig. 4D, HER2-med and HER2-high need to be compared to HER2-low group.

      Statistical analyses have been added to figure 4D and HER2-med and HER2-high have been compared to HER2-low group.

      Reviewer #2 (Significance):

      The authors propose sub-threshold low level HER2 expression activates signaling pathways and increases chromatin accessibility, which facilitates mammary epithelial cell transformation, while high HER2 expression in early stages results in decreased chromatin accessibility via unknown feedback mechanisms. It is interesting to identify which signaling and epigenetic regulators are essential to cell transformation, which feedback mechanisms prevent the transformation of HER2-amplified mammary epithelial cells, whether inactivation of such feedback mechanism indeed occurs in tumorigenesis of HER2-amplified breast cancer, and whether it is a potential therapeutic target for HER2-amplified breast cancer.

      Expertise of review: breast cancer, cell signaling, tumor microenvironment.

      We thank reviewer #2 for their time and for providing such useful feedback on our work.

      Reviewer #3 (Evidence, reproducibility and clarity):

      In this paper Hayat et.al study the early transformational events that follow the activation of the oncogenic HER2 signaling pathway and its crosstalk with chromatin opening. Using an inducible in vitro model of HER2+ breast cancer they have identified that the overexpression of HER2 transforms non-tumorigenic breast epithelial cells via chromatin regulation. The study also shows that the transformative potential of the cells is inversely related their HER2 expression where the low HER2 expressing cells obtain a stem-cell like signature and increased chromatin accessibility leading to an increased transformative potential.

      Major comments:

      While the key conclusions of the paper are convincing, here are the parts of the study that need further clarification or supporting data from the authors.

      1. In Figure 1C the authors show that MCF10AHER2 cells formed complex transformed masses when grown in 3 dimensional cultures. From the figure it is evident that that the transformative potential of the HER overexpression is far more pronounced at the Day 6 and Day 9 mark. Therefore, one wonders why these time points weren’t used as the “late timepoint” in any of the sequencing studies moving forward. Can the authors comment on this choice and perform additional experiments to address the molecular changes that lead to the dramatic transformations seen at this timepoint? Since the authors have a well-established protocol in place, looking at an additional time point could be potentially feasible, provided the cells/samples have been frozen down at this stage. If unable to do so, could the authors comment on the molecular changes they would expect to see at this time point.

      In our study we primarily focused on the early events upon HER2 overexpression because the changes appear to be much more dynamic, and we hypothesised that these events are the cause of the subsequent, more pronounced featured later on. The rationale behind employing an inducible system and capturing the early changes was to identify aberrant molecular events at the earliest time possible. Indeed, numerous studies have investigated the differences between normal versus cancer cells (many of which are at later time points, that have missed the foremost aberrant molecular changes). Based on our ATAC-seq analysis at late-timepoints, 24h and 48h time-point, the number of changes in chromatin accessibility become relatively more stable as compared to early time points (supplementary figure 2A).

      1. Fig 1D the authors conclude that the overexpression of HER2 causes increased cell invasion based on the results seen in a collagen coated plate. How to the authors explain the lack of any such significant change in a Matrigel coated plate?

      To test the invasiveness of the HER2 overexpressing cells, collagen is used to increase stiffness to Matrigel. Stiffness is relevant for the type of invasion seen in these 3D cultures because it activates pathways important for invasion. We added the references to the text for clarity (PMID: 15838603 and PMID: 16472698).

      1. In Supp Fig 1D the authors use the DAVID Bioinformatics tool to identify the various signaling pathways enriched in the HER2 induced system. In addition to the MAPK pathway this analysis also shows other common cancer-related pathways (eg. The Mtor pathway) being enriched to a similar or higher extent. Can authors address why only the MAPK pathways was pursed in detail?

      HER2 is major receptor that can signal through various signalling pathways. We highlighted the MAPK pathway because it has been previously shown that MAPK cascades can modify chromatin through transcription factors and chromatin regulators Clayton and Mahadevan., 2010 (PMID:19948258). We think that when HER2 is overexpressed, it primarily signals down the MAPK pathway, resulting in the activation of transcription factors and chromatin regulators that lead to a highly accessible chromatin and ultimately contributes to transformation. To confirm this result, we did perform western blotting control analysis and found that indeed, HER2 overexpression consistently activates the MAPK pathway that shows phosphorylation of ERK but does not influence AKT phosphorylation. We can include this data in the manuscript.

      1. Figure 4B and supplementary figure 3E only show that percentage of the cells have either MUC1-ve or EpCAMlow or CD24low expression. However, Figure 4A and the corresponding text indicates that that breast stem cells are defined by a combination of MUC1-ve, EpCAMlow, and CD24low expression. If this is the case, the authors need to show the percentage of the cells within each population have an overlap of all these expression signatures, to support the claim of low HER2 expressing cells showing a more de-differentiated stem-cell like property.

      Our results confirm that upon HER2 overexpression, cells become MUC1-ve, EpCAMlow and CD24-ve, acquiring the breast stem cell signature. We did not show the CD24 expression because all the cells that were MUC1-ve and EpCAMlow were also 100% CD24-ve. We have now modified figure 4B and the figure legend to reflect this change, additionally, we added another figure (supplementary figure 4) that shows how the analysis was performed systematically.

      1. The authors also state 'other biological effects being responsible for the lower capacity in anchorage-independent growth of high HER2 expressing cells' that is shown in fig 4d. While an experimental investigation of these effects may be out of the scope of this study, the authors may consider commenting (and referencing additional literature) on the other biological effects they think may result in this phenomenon.

      We have modified the manuscript (lines 294-296) and added further explanation as to what other biological effects may be responsible for lack of colony growth in high HER2 expressing cells in lines.

      1. The authors do a great job providing details about all statistical analyses performed, however the details regarding the experimental replicates are only provided for some experiments making it difficult to infer if the experiments have been adequately replicated before concluding results. Can the authors please add the n - value for all applicable experiments in the figure legend or the methods section?

      The number of replicates has now been added to the respective figure legends.

      1. What is the scope for validation of these findings in vivo and in human samples? Could the authors please comment on this in the discussion section of the manuscript.

      The primary goal of this study was to understand the early transformational events in a simple in vitro, yet a robust model that is highly accessible. We have analysed some human samples to compare the HER2 protein expression levels. However, the findings from this manuscript could be validated in more precious models such as primary human cells, human tumours samples and in vivo in animals. We have modified the end of discussion to address these points (lines 394-399).

      Minor comments:

      1. In figure 1B the authors show a western blot analysis for HER2 expression over time while using GAPDH as a loading control. However, GADPH control seems to be unequal, especially in the 1ug/ml Dox lane. This needs to be addressed.

      We agree that there is a slight difference in the GAPDH levels in this western blot. We have carried out densitometry analysis which could be added to the supplementary data if required, to show that even though the GAPDH appears to be slightly less in the 1ug/ml of dox (last lane), it shows that HER2 levels are even greater than what appears on the blot, thereby confirming the trend we have observed in the current western blot.

      1. In figure 1C, it is unclear if the images shown are representative of the exact same spot over a 9-day period or of different spots.

      In figure 1C, the morphological regions are representative of the whole well in which the cells were growing but not the exact same spot. This is because nearly all the cells (>90%) transformed from round, organised acini to the fibroblastic, invasive morphology by day 9. We have captured multiple images of different areas in the well using confocal microscopy, and this can be added in the supplementary data.

      1. In Supplementary figure 3E, labeling the y-axis on the figure as opposed to just in the legends would make it easy for the reader.

      The figure has now been appropriately labelled.

      1. With respect to presentation: In figures involving single cell RNA sequencing and phosphoproteome analyses, highlighting the specific genes that are focused in detail on the manuscript would aid the reading process. The current format makes it difficult for the reader to spot the specific genes that are the points of focus within each heat map.

      We modified the figures concerning the phosphoproteomic analysis and scRNA-seq and have highlighted important genes for readers’ ease.

      Reviewer #3 (Significance):

      I have close to a decade's experience in working on breast cancer. In the past I focused on studying intratumor genetic heterogeneity and cell signaling pathway interactions. I am currently working on identifying novel therapeutic targets for the treatment of ER+ breast cancer. My expertise lies in understanding molecular biology of the disease. While I have worked with and understand most techniques used in this study, I would like to indicate that I do not have sufficient expertise in ATAC seq and am unable to evaluate the intricacies of this technique.

      While molecular changes that occur in HER2+ breast cancer have been highly investigated, the changes that occur at an early pre-cancerous stage of the disease aren't as well documented. The work by Hayat et al., sheds much needed light on this less documented early stage of cancer development. The past decade has shown an increased focus on epigenetic therapy with more chromatin targeting drugs entering clinic (Siklos et al., 2022). There has also been increased clinical evidence underlining the efficiency of combining epigenetic therapy and with hormonal and other anticancer therapies in solid tumors (Jin et al., 2021). Phase II clinical trials combining HDAC inhibitors with aromatase inhibitor have shown to improve clinical outcomes in patients (Yardley et al., 2013). Similarly, pre-clinical studies have shown that combination therapy with BET inhibitors improved treatment efficacy and circumvented drug resistance in fulvestrant (Feng et al., 2014) and everolimus (Bihani et al., 2015) treatments. Conclusions from the work by Hayat et.al, although based on in vitro analyses, advances this field conceptually by highlighting the importance of targeting the cell signaling and chromatin regulation together. If validated in in vivo models and clinical samples, this may open up potential possibilities of combining anti-HER2 therapies with epigenetic therapies. Additionally, the study also makes an interesting observation that low HER2 expression could result in increased tumorigenicity of cells which is in contrary to current clinical norm of looking at increased HER2 expression as a sign of aggressive disease. These findings are of interest to the scientific and clinical community working on discovering novel therapeutic targets and biomarkers for treatment of HER2+ breast cancer.

      We thank reviewer #3 for his/her overall assessment and for appreciating this work. There is a significant focus regarding low HER2 positive breast cancers in the field. Approximately 50-60% of breast cancers have "low" HER2 expression and in many cases, this low HER2 is seen together with metastatic cancer. The FDA has very recently approved fam-trastuzumab deruxtecan-nxki aka Enhertu, which appears to target these cancers with low HER2 well and is shown to be relatively effective in a phase 3 clinical trial known as Destiny Breast-04. However, it is not yet clear how low HER2 expressing cells drive the metastatic spread of breast cancers or why they are so aggressive. Our work sheds a light that increased chromatin accessibility could be a route of transformation in low HER2 cancers. Therefore, providing an alternative platform to target these cancers and why it is crucial that this work reaches the clinical and scientific community as soon as possible.

    1. Author Response

      Reviewer #1 (Public Review):

      Kang et al. have performed whole exome sequencing of gall bladder carcinomas and associated metastases, including analysis of rapid autopsy specimens in selected cases. They have also attempted to delineate patterns of clonal and subclonal evolution across this cohort. In cases where BilIN was identified, the authors show that subclones within these precursor lesions can expand and diversify to populate the primary tumor and metastatic sites. They also demonstrate subclonal variation and branching evolution across metastatic sites within the same patient, with the suggestion that multiple subclonal populations may metastasize together to seed different sites. Lastly, they highlight ERBB2 amplification as a recurrent event observed in gall bladder carcinomas.

      While these data add to the literature and start to examine important questions related to clonal evolution in a relatively rare malignancy, the authors' findings are very descriptive and it is hard to draw many generalizable conclusions from their data. In addition, the presentation of their figures is somewhat confusing and difficult to interpret. For example, they do not separate their clonal analyses by disease site and by time in a readily interpretable manner, as in some instances of Figure 2 and Figure 3 the clone maps are from different sites collected at the same time point, while others show some samples at different time points. Depicting these hierarchies in a more organized and clearly understandable manner would help readers more easily interpret the authors' findings. In addition, the clinical implications of these clonal hierarchies and their heterogeneity are unclear, as the authors do not relate the observed evolution to intervening therapies and may not be powered to do so with this dataset.

      Thank you for the constructive and valuable comments about 1) figures and 2) clinical implications.

      1) We agree with your opinion that Figures 2 and 3 are confusing. Reflecting on your comment, Figures 2 and 3 have been modified. Now, the time point at which the tissue was obtained and the anatomical location of the tissue are readily visible in the redesigned figures.

      2) From a clinical point of view, we believe that our study highlights the importance of precise genomic analysis of multi-regional and longitudinal samples in individual cancer patients. In the current oncology clinics, cancer panel data of patients are being used to identify druggable mutations usually with a single tumor sample. However, we found that only a part of the mutations was clonal while a substantial proportion was subclonal, which is usually not an effective druggable target. For example, in the GB-S2 patient, after sequencing with GB tissue, ERBB2 targeting treatment would have been performed if a specific clinical trial is available because ERBB2 p.V777L is pathogenic. However, our clonal evolution analysis suggests that ERBB2 targeting strategy may not be effective in subclones without the ERBB2 p.V777L mutation, especially from regional metastasis. We have added the description for this part to the Discussion section (Page 13, Line 12-15).

      Additional areas that would require clarification include:

      1) There are very few details on how the authors performed their subclone analysis to identify major subclones, and what each of the clusters in Supplemental Figure 1 represents. In addition, they do not describe how they determined that the highlighted mutations in Table 2 were drivers for metastasis and subclonal expansion. Were these the only genes that exhibited increased allele frequencies in metastatic sites, or were other statistical criteria used?

      Thank you for the important comment about 1) clone analysis and 2) highlighted mutations in Table 2.

      1) Mutations were timed as clonal or subclonal through PyClone (Roth A et al., Nat Methods. 2014) clustering (Figure 1—figure supplement 1). Phylogenetic trees were constructed using the mutation clusters identified with PyClone as an input of CITUP (Malikic S et al., Bioinformatics. 2015) (Figures 2 and 3). We added the sentence "See Supplementary File 1 to check the matching information for the PyClone clusters and the CITUP clones." to the supplementary figure legend.

      2) A full list of mutations constituting a CITUP clone can be found in Supplementary File 1. Among the mutations, previously reported cancer-associated genes harboring them were selected manually and listed in Table 2. References for each gene are introduced in the 'Evolutionary trajectories and expansion of subclones during regional and distant metastasis' section.

      2) The authors do not discuss the relevance of variation in mutational signatures observed with disease progression/metastasis, e.g., is there any significance that signature 22 (aristolochic acid) and signature 24 (aflatoxin) are increased in metastases? In addition, when comparing their data to previously published reports in Figure 1B and Figure 4A, it would be helpful if the authors discussed possible reasons for some of the large differences in mutational or signature frequencies across datasets. For example, do the authors think the frequency of ERBB2 alterations is so much higher in their cohort than in prior reports due to methodological/data reasons or due to differences in patient population?

      Thank you for the constructive and valuable comments about 1) mutational signatures observed with disease progression/metastasis and 2) differences in mutational or signature frequencies across datasets.

      1) During the revision process, signatures 22 and 24 highlighted in the metastasis stage were validated by two additional tools, Signal (Degasperi A et al., Nat Cancer. 2020) and MuSiCa (Diaz-Gay M et al., BMC Bioinformatics. 2018) (Figure 4—figure supplement 3). Aristolochic acid is an ingredient of oriental herbal medicine (Debelle FD et al., Kidney Int. 2008, Hoang ML et al., Sci Transl Med. 2013). Given that all the patients in our cohort are Korean, and a recent study found that Korean cancer patients are frequently exposed to herbal medicines (Kwon JH et al., Cancer Res Treat 2019), one possible explanation is that some patients might have been exposed to herbal remedies containing aristolochic acid. On the other hand, aflatoxin is known to be contained in soybean paste and soy sauce, which are widely used in Korean food (Ok HE et al., J Food Prot. 2007). Considering that the signatures 22 and 24 are found not in early carcinogenesis but in late carcinogenesis and metastasis (Figure 4B and Figure 4—figure supplement 3), the two carcinogens appear to have little impact on the early stage of cancer development, but their impacts might be highlighted in overt cancer cells. Further investigation is required because it is difficult to determine the etiology of signatures 22 and 24 with this limited patient data. We updated this part in the Discussion section (Page 13, Line 4-7).

      2) In the two previous genomics studies on GBAC, the prevalence of ERBB2 alteration was 7.9% (Narayan RR et al., Cancer. 2019) and 9.4% (Li M et al., Nat Genet. 2014), respectively. Compared with these data, our data is characterized by relatively higher ERBB2 alterations (54.5%: amplification in 27.3% and SNV in 27.3%) (Figure 1B). A higher prevalence of ERBB2 alteration was also reported in other studies on GBAC, with corresponding rates of 28.6% (amplification and overexpression, Nam AR et al., Oncotarget. 2016) and 36.4% (amplification only, Lin J et al., Nat Commun. 2021). The variations in ethnicity and culture might have contributed to the differences. This part is described in the Discussion section (Page 11, Line 19-23). In addition, the discrepancy in Figure 4A might be attributed to the difference in analyzed samples: our study included precancerous and metastatic lesions while the other two studies uniformly analyzed primary tumors.

      Reference for reply 1)

      • Kwon JH, Lee SC, Lee MA, Kim YJ, Kang JH, Kim JY, et al. Behaviors and Attitudes toward the Use of Complementary and Alternative Medicine among Korean Cancer Patients. Cancer Res Treat. 2019;51(3):851-60.

      3) The authors try to describe and draw conclusions about the possibility of metastasis to metastasis spread in p.6, lines 6-10 "In our study, of 7 patients with 2 or more metastatic lesions, evidence of metastasis-to-metastasis spread was found in 2 patients (28.6%). In GB-A1 (Figure 2A), it appears that CBD, omentum 1-2, mesentery, and abdominal wall 2-4 lesions may originate from abdominal wall 1 (old) rather than from primary GBAC considering clone F." The authors conclude here that the spread arose from abdominal wall 1, but this lesion is only separated from the CBD lesion by 1 month. There is no history given about whether this timing difference is significant or if it was simply due to clinically-driven differences in when each lesion was sampled. Given the proximity of the CBD lesion to the original gall bladder cancer, it seems just as likely that all of these distant lesions were seeded from the CBD lesion. If this is the case, the author's conclusion about "metastasis to metastasis" spread does not seem strongly supported. It would be helpful if the authors could clarify this point and/or provide additional data to strengthen this conclusion.

      We appreciate your valuable comment. As addressed above, the manuscript has been modified to reflect your comments.

      Reviewer #2 (Public Review):

      Minsu Kang et al. analyzed 11 patients with gallbladder adenocarcinoma using multi-point sampling. Mutational analysis revealed evolutional patterns during progression where the authors found metastasis-to-metastasis spread and the migration of a cluster of tumor cells are common in gallbladder adenocarcinomas. The signature analysis detected signatures 22 (aristolochic acid) and 24 (aflatoxin) in metastatic tumors. Overall, the analyses are well-performed using established algorithms. However, the manuscript is highly descriptive. Therefore, it is very difficult to understand what the novel findings are.

      Major comments

      1) The sections "Evolutionary trajectories and expansion of subclones during regional and distant metastasis", "Polyclonal metastasis and intermetastatic heterogeneity", "Mutational signatures during clonal evolution", and "Discussion" are highly descriptive which makes it difficult to understand what the novel and/or important findings are. Those sections would profit from reorganization.

      Thank you for the important comment. We have reorganized the manuscript according to your comments.

      1) In the "Evolutionary trajectories and expansion of subclones during regional and distant metastasis" section, unnecessary sentences have been removed and Figures 2 and 3 have been changed to make it simpler to understand how subclones spread during metastasis.

      2) In the "Polyclonal metastasis and intermetastatic heterogeneity" section, after receiving feedback on statements that were conflicting (Reviewer #1's comment 4), we clarified the statements and removed any other extraneous sentences. Figures 2 and 3 have been changed to make it simpler to understand polyclonal metastasis and intermetastatic heterogeneity.

      3) In the "Mutational signatures during clonal evolution" section, after receiving comments that Figures 4B and 4C were confusing (Essential Revisions #6), we moved Figure 4B to Figure 4—figure supplement 2. Unnecessary sentences have been removed. We emphasized signatures 22 and 24 highlighted during metastasis. This result was validated by using two additional tools, Signal (Degasperi A et al., Nat Cancer. 2020) and MuSiCa (Diaz-Gay M et al., BMC Bioinformatics. 2018).

      4) In the Discussion section, duplicate descriptions and unnecessary extraneous explanations have been deleted. We emphasized that whereas aflatoxin and aristolochic acid had little impact on early cancer formation, their impacts could be more clearly seen in cancer cells that had already manifested (Page 13 Line 2-7). In addition, the limitations of the NGS test currently used in the clinical field were pointed out, and the clinical significance of this study was described (Page 13 Line 8-16).

      2) What would enhance this paper is more of a connection between the bioinformatics analysis and the biology. Although the authors analyzed multi-point sequencing data well, this paper lacks in-depth discussion. I understand that the results in the paper are "computationally" the most likely. However, the impact is lost by an incomplete connection to biology.

      As you commented, we analyzed the WES data obtained from patient samples by computational methods. In this study, we did not validate the various results using in vitro or in vivo models. However, we would like to emphasize the significance of our work because it is the first human study, covering the current theory of carcinogenesis from precancerous lesions to metastasis in GBAC. For example, polyclonal seeding has been previously confirmed in animal models (Cheung KJ et al., Science 2016). In humans, there have been reports in breast cancer (Ullah I et al., J Clin Invest. 2018) and colorectal cancer (Wei Q et al., Ann Oncol. 2017), but not in GBAC yet.

      3) In addition to the above concern, it is difficult to comprehend the cohort as the detailed information is lacking. I would suggest providing a brief table that contains the number of collected samples, frozen or FFPE, the clinical information, etc. by sample.

      Thank you for the constructive comment. Supplementary Table 1 was modified as you mentioned. It is now indicated from which organ, when, and by what method the tissue was obtained, what the tumor purity of the tissue was, and whether the tissue was fresh-frozen or FFPE. In addition, we updated the information about tissue acquisition sites in Figure 1A.

      4) The mutations with very low allele frequency (< 1%) are discussed in the manuscript. However, no validation data is provided. Please add a description of the accuracy of the mutation calling considering the following concerns.

      • FFPE samples are analyzed using the same method as frozen samples. FFPE contains much more artifacts. Is it adequate to use the same methods for both frozen and FFPE samples?

      Thank you for the valuable comment. We also considered the FFPE artifacts. However, we did not remove the possible artifacts. This part has been described above. Please see Essential Revisions #5.

      • How were those mutations with low allele frequency validated? Are those variants validated by other methods? Especially in FFPE.

      Thank you for the important comment. Firstly, we discarded any low-quality, unreliable reads and variants according to the pre-specified filtering criteria used in previous literature analyzed with the Genomon2 pipeline (Yokoyama A et al., Nature. 2019, Kakiuchi N et al., Nature. 2020, Ochi Y et al., Nat Commun. 2021). In the Method section, we have added an explanation for this part (Page 16 Line 5-12).

      As you commented, validation of low VAF mutation is required if the mutation is sample-specific. However, in this study, if a mutation in Supplementary File 1 has a low VAF in one sample, one of the other samples always has a higher VAF, which has passed our pre-specified filter. Therefore, validation is not required for that mutation. In addition, possible sequencing artifacts with low VAFs in FFPE tissues have been discussed above. Please see Essential Revisions #5.

      • Is the low variant allele frequency (0.2~1%) significantly higher than the background noise level?

      Thank you for the important comment. As you expected, FFPE samples had a higher number of sample-specific mutations than fresh-frozen ones in our study. However, we did not remove these mutations in the analysis of the FFPE samples. For a more detailed description, please see Essential Revisions #5.

      5) The authors compared mutational signatures divided by stages or timings. How are the signatures calculated although each sample has a distinct number of somatic mutations? Did the authors correct the difference?

      Thank you for the helpful comment. We classified all the mutations according to the specific criteria (Page 9 Line 9-18). For example, in Figure 4B (before revision, Figure 4C), mutations were classified by the timing of development during clonal evolution. After that, we could calculate the relative contributions of mutational signatures in each group using the three tools, Mutalisk (Lee J et al., Nucleic Acids Res. 2018), Signal (Degasperi A et al., Nat Cancer. 2020) and MuSiCa (Diaz-Gay M et al., BMC Bioinformatics. 2018). Although the number of mutations is different for each group, no additional correction was required because we compared the relative contributions among the groups.

      6) In distant metastasis tumors, signatures 22 and 24 are increased. Those two signatures are strongly associated with a specific carcinogen. Although the clinical information lacks, do the authors think that those patients were exposed to those chemicals after the diagnosis? Why do the authors think the two signatures increased in the metastatic tumors? Were those signatures validated by other methods?

      We appreciate your important and constructive comment.

      1) We think that the patients might have been exposed to aristolochic acid or aflatoxin before or after the cancer diagnosis. Aristolochic acid is an ingredient of oriental herbal medicine (Debelle FD et al., Kidney Int. 2008, Hoang ML et al., Sci Transl Med. 2013). Given that all the patients in our cohort are Korean, and a recent study found that Korean cancer patients are frequently exposed to herbal medicines (Kwon JH et al., Cancer Res Treat 2019), one possible explanation is that some patients might have been exposed to herbal remedies containing aristolochic acid. On the other hand, aflatoxin is known to be contained in soybean paste and soy sauce, which are widely used in Korean food (Ok HE et al., J Food Prot. 2007). Nevertheless, we believe that further investigation is required because it is difficult to determine the etiology of signatures 22 and 24 with this limited patient data.

      2) Summarizing the mutational signature results using the 3 different tools (Figure 4B and Figure 4—figure supplement 3), the signatures 22 and 24 are relatively rare in early carcinogenesis. However, the two signatures contributed more to late carcinogenesis and metastasis. Therefore, it is speculated that the two carcinogens appear to have little impact on the early stage of cancer development but might be highlighted in overt cancer cells. Further studies on this novel hypothesis are necessary.

      3) During the revision process, signatures 22 and 24 highlighted in the metastasis stage were validated by two additional tools, Signal (Degasperi A et al., Nat Cancer. 2020) and MuSiCa (Diaz-Gay M et al., BMC Bioinformatics. 2018) (Figure 4—figure supplement 3). We updated this part in the Result (Page 9 Line 18-21) and Discussion (Page 13 Line 2-7) sections.

      Reference for reply 1)

      • Kwon JH, Lee SC, Lee MA, Kim YJ, Kang JH, Kim JY, et al. Behaviors and Attitudes toward the Use of Complementary and Alternative Medicine among Korean Cancer Patients. Cancer Res Treat. 2019;51(3):851-60.

      7) Figures 2 are well-described. However, they are difficult for readers to fully understand. The colors for each clone are sometimes similar. The results of multi-time point and regional analyses in the cases with multiple sampling are not integrated. Driver mutations are separately described in the small phylogenetic trees. Evolutional patterns (linear or branching) are not described in the figures. Modifying the above concerns would improve the manuscript.

      We appreciate your important comment.

      1) In GB-S1, clones of similar colors were modified to be different colors.

      2) Figures 2 and 3 have been modified to make them easier to understand by separating time and space more clearly.

      3) Driver mutations are now indicated in both the phylogenetic tree and TimeScape result (Figures 2 and 3).

      4) Evolutional patterns (linear or branching) can be discovered by examining the phylogenetic tree in Figures 2 and 3. In addition, we described each patient's evolutionary pattern more clearly in the manuscript.

      8)"Among 6 patients having concurrent BilIN tissues, two patients were excluded from the further analysis because of low tumor purity in one patient and different mutational profiles between BilIN and primary GBAC in the other patient, suggesting different origins of the two tumors (Figure 1-figure supplement 2)." This seems cherry-picking. More explanation is necessary.

      • How is the tumor purity? Although the authors use 0.2% variant allele frequency as true mutation (for example Table 2), is the tumor purity lower than 0,2%?

      Thank you for the important comment. The calculated tumor purity of BilIN in the GB-S8 patient was 0.03 based on the WES data. We added this value to the manuscript (Page 6 Line 9) and Supplementary Table 1. Although variants were called in this case, the tumor purity was too low to estimate the allele-specific copy number, and thus sophisticated analysis as in other patients was not possible. In addition, the value of 0.2% in Table 2 is not the VAF, but cellular prevalence calculated by PyClone and CITUP. Although the value is low in the primary tumor, it is mentioned because it is high in metastatic lesions.

      • BilIN and GBAC of GB-S7 have some shared mutations. Why do the authors conclude that BilIN and GBAC have distinct origins? Do the authors think that those shared mutations are germline mosaic mutations?

      Thank you for the important comment.

      1) We think that the BilIN and GBAC of the GB-S7 patient are tumors of different origins because BilIN and GBAC of the GB-S7 patient have different truncal mutations (Figure 1—figure supplement 2C). This is a markedly different feature compared to BilIN and GBAC samples of other patients. We have added an explanation for this part to the Results section (Page 6 Line 9-11).

      2) We do not think that mosaicism occurred at the developmental stage. In addition, although some mutations were identified from both BilIN and GBAC, we cannot determine their importance because either one of the lesions had a very low VAF ranging from 0.001 to 0.04. If the mosaicism occurred only in the GB at the developmental stage, the VAF values of the shared mutations should be much higher than the current values, and the VAF values of the two BilIN and GBAC lesions should be similar.

      • Was the copy number profile compared between BilIN and GBAC?

      Thank you for the constructive comment. In this study, we obtained allele-specific copy numbers using Control-FREEC version 11.5 (Boeva V et al., Bioinformatics. 2012). The copy number of the mutations in the GB-S8 patient's BilIN could not be estimated by Control-FREEC due to low tumor purity (0.03). In the case of GB-S7, BilIN and GBAC were determined to be of a different tumor origin and thus disregarded from the analysis.

    1. Author Response

      Reviewer #1 (Public Review):

      It's here where my very mild (I truly liked this article - it is well done, well written, and creative) comments arise. The implications for stochastic strategies immediately emerge in the early results - bimodal strategies come about from the introduction of two variables. There is not enough credence given to the field of stochastic behavior in the introduction - the introduction focuses too much on previous models of predator-prey interaction, and in fact, Figure 1, which should set up the main arguments of the article, shows a model that is only slightly different (slight predator adjustment) that is eventually only addressed in the Appendix (see below). The question of "how and when do stochastic strategies emerge?" is a big deal. Figure 1 should set up a dichotomy: optimal strategies are available (i.e., those that minimize Tdiff) which would predict a single unimodal strategy. Many studies often advocate for Bayesian optimal behavior, but multimodal strategies are the reality in this study - why? Because if you consider the finite attack distance and inability of fish to evoke maximum velocity escapes while turning, it actually IS optimal. That's the main point I think of the article and why it's a broadly important piece of work. Further framing within the field of stochastic strategies (i.e., stochastic resonance) could be done in the introduction.

      We appreciate the comment provided by the reviewer. We changed the second paragraph of the introduction so as to focus more on the protean tactic (stochasticity). We added a new figure (Figure 1 in the new version) to conceptually show the escape trajectories (ETs) of a pure optimal tactic, a pure protean tactic, a combination of optimal and protean tactics, and an empirically observed multimodal pattern. We explained each tactic and described that the combination of the optimal and protean tactics still cannot explain the empirically observed multiple preferred ETs.

      The revised paragraph (L49-66) is as follows: Two different escape tactics (and their combination) have been proposed to enhance the success of predator evasion [16, 17]: the optimal tactic (deterministic), which maximizes the distance between the prey and the predator (Figure 1A) [4, 14, 15, 18], and the protean tactic (stochastic), which maximizes unpredictability to prevent predators from adjusting their strike trajectories accordingly (Figure 1B) [19-22]. Previous geometric models, which formulate optimal tactics, predict a single ET that depends on the relative speeds of the predator and the prey [4, 14, 15, 18], and additionally, predator’s turning radii and sensory-motor delay in situations where the predator can adjust its strike path [23-25]. The combination of the optimal tactic (formulated by previous geometric models), which predicts a specific single ET, and the protean tactic, which predicts variability, can explain the ET variability within a limited angular sector that includes the optimal ET (Figure 1C). However, the combination of the two tactics cannot explain the complex ET distributions reported in empirical studies on various taxa of invertebrates and lower vertebrates (reviewed in [26]). Whereas some animals exhibit unimodal ET patterns that satisfy the prediction of the combined tactics or optimal tactic with behavioral imprecision (e.g., [27]), many animal species show multimodal ETs within a limited angular sector (esp., 90–180°) (Figure 1D) (e.g., [4, 5, 28]). To explore the discrepancy between the predictions of the models and empirical data, some researchers have hypothesized mechanical/sensory constraints [17, 29]; however, the reasons why certain animal species prefer specific multiple ETs remain unclear.

      All experiments are well controlled (I especially liked the control where you varied the cutoff distance given that it is so critical to the model). Some of the figures require more labeling and the main marquee Figure 1 needs an overhaul because (1) the predator adjustment model that is only addressed in the Appendix shouldn't be central to the main introductory figure - it's the equivalent of the models/situations in Figure 6, and probably shouldn't take up too much space in the introductory text either (2) the drawing containing the model variables could be more clear and illustrative.

      (1) According to this comment and comment #11 from reviewer #2, we moved the two panels in the figure (Figure 1B and D in the original version) to Appendix-figure 1, and accordingly, we changed the first paragraph of the Model section so as to clearly describe that we focus on Domenici’s model in this study (L103-108).

      As for Figure 6 (Figure 7 in the new version) and related parts, we tempered our claims to clearly describe that our model has only the potential to explain the different patterns of escape trajectories observed in previous works. We would like to keep this figure in the main text because it is fundamental to explain the potential applicability of our model to other predator-prey systems.

      (2) To alleviate the burden for readers, we added the model variables to the figure and made them colored (Figure 2B in the new version).

      Finally, I think a major question could be posed in the article's future recommendations: Is there some threshold for predator learning that the fish's specific distribution of optimal vs. suboptimal choice prevents from happening? That is, the suboptimal choice is performed in proportion to its ability to differentiate Tdiff. This is "bimodal" in a sense, but a probabilistic description of the distribution (e.g., a bernoulli with p proportional to beta) would be really beneficial. Because prey capture is a zero-sum game, the predator will develop new strategies that sometimes allow it to win. It would be interesting if eventually the bernoulli description could be run via a sampler to an actual predator using a prey dummy; one could show that the predator eventually learns the pattern if the bernoulli for choosing optimal escape is set too high, and the prey has balanced its choice of optimal vs. suboptimal to circumvent predator learning.

      We thank the reviewer for this constructive comment. Actually, we are now developing a dummy prey system. We added the following sentence in the Discussion to mention future research.

      The added sentence (L496-499): Further research using a real predator and dummy prey (e.g., [48]) controlled to escape toward an optimal or suboptimal ET with specific probabilities would be beneficial to understand how the prey balances the optimal and suboptimal ETs to circumvent predator learning.

      Reviewer #2 (Public Review):

      First, it is unclear how the dummy predator is actuated. The description in the Methods section does not clearly address how rubber bands are used for this purpose.

      To clearly mention how the dummy predator was actuated by rubber bands, we added a figure (Figure 3-figure supplement 3B) and the following sentences.

      The added sentences (L608-611): The dummy predator was held in place by a metal pipe anchored to a four-wheel dolly, which is connected to a fixed metal frame via two plastic rubber bands (Figure 3—figure supplement 3B). The wheel dolly was drawn back to provide power for the dummy predator to strike toward the prey.

      Second, the predator's speed, which previous research has identified as a critical factor during predator-prey interactions, is not measured from the motion of the dummy predator in the experiments. Instead, it is estimated using an optimization algorithm that utilizes the mathematical model and the prey-specific parameters. It is unclear why the authors chose this method over measuring velocity from their experiments. Since the prey fish are responding to a dummy predator moving toward them at a particular speed during the interaction, it is important to measure the speed of the predator or clearly explain why estimating it using an optimization procedure is more appropriate.

      We chose this method (optimizing predator speed from the prey’s viewpoint) because there was no significant effect of predator speed on the escape trajectory in our experiment (L203-208). In other words, we considered that, at least in our case, the prey did not change the escape trajectory in response to the predator speed, and thus it would be more appropriate to use a specific predator speed estimated through an optimization algorithm from the prey’s point of view. It may be appropriate to use measured predator speed in other cases where the prey adjusts the escape trajectory in response to the change in predator speed. Therefore, we conducted a further analysis using actual predator speeds (both the predator speed at the onset of escape response, and the mean speed for the predator to cover the distance between the predator and prey). The results show that the model fit became worse when using measured predator speed per trial compared to the model using the fixed predator speed estimated through the optimization procedure (Table 3—source data 1; Figure 5—figure supplement 1). We added the above explanation in L219-226.

      One of the major claims of this article is that the model can explain escape trajectories observed in other predator-prey systems (presented in Figure 6). Figure 6 panels A-C show the escape responses of different prey in response to some threatening stimuli. Further, panels D-F suggest that the empirical data can be predicted with the model. But the modeling parameters used to produce the escape trajectories in D-F are derived from the authors' experiments with fish, instead of the experiments with the species shown in panels A-C.

      We thank the reviewer for this comment. We agree that this part in the previous version was an over-interpretation. Therefore, we tempered our statements to simply suggest that our approach has the potential to explain multiple ETs observed in other taxa. The revised sentences are as follows.

      Abstract (L27-30): By changing the parameters of the same model within a realistic range, we were able to produce various patterns of ETs empirically observed in other species (e.g., insects and frogs): a single preferred ET and multiple preferred ETs at small (20–50°) and large (150–180°) angles from the predator.

      Results (L395-407): Potential application of the model to other ET patterns. ...(sip)... To investigate whether our geometric model has the potential to explain these different ET patterns, we changed the values of model parameters (e.g., Upred, Dattack) within a realistic range, and explored whether such adjustments can produce the ET patterns observed in the original work. ...(sip)... These results indicate that our model has the potential to explain various patterns of observed animal escape trajectories.

      Discussion (L538-548): We show that our model has the potential to explain other empirically observed ET patterns (Figure 7). ...(sip)... Further research measuring the escape response in various species and applying the data to our geometric model is required to verify the applicability of our geometric model to various predator-prey systems.

    1. i think so like in social terms the conservatives would say well i like that it benefits from the wisdom of math already invented you're not 00:36:39 throwing anything away you're not you're not throwing it all away and starting over you're taking what we already have and you're you're using it that's great and a libertarian might say i really like that you're free to create as you see fit you can make anything you 00:36:52 want and you're working within this background framework that's minimally invasive it doesn't make a lot of rules for you but it is highly functional i like that it kind of keeps everyone in line while 00:37:03 like satisfying some formal contracts or something while still being uh i'm still free to create and a progressive might say i like about category that theory that everyone can contribute to 00:37:15 making their own world making it more rich adding new ideas uh making it more meaningful understanding connections between things a modern viewpoint would say i like that 00:37:26 it's completely rigorous that it's been used in proving well-known conjectures that people thought were important to prove but also that it's interesting it's useful in science and technology and a postmodern person might say i like 00:37:40 that um that no perspective is right that that there's just all sorts of different categories but that navigating between these perspectives lets you look at problems from all sides or a hippie might say i like that it's 00:37:53 all about relationship and connection or irrelevant i don't know what that means maybe a practical person might say that i like that it's that we can actually use it to organize and learn from big data in 00:38:06 today's world or to manage complexity of software projects that are that are very large and changing all the time i like that you can think about ai and other complex systems with this stuff i think it's relevant and 00:38:19 practical for right now so that's that's my uh tutorial or that's the the part i'm going to record and now i'm going to open it up for questions

      David Spivak discusses how category theory may appeal to different political ideologies for a variety of reasons.

    1. Kevin Flowers Nov 7th at 12:50 PM# Question about repliesForgive me a bit if this is the wrong place to ask, but is the feature of having Hypothes.is list replies somewhere on the roadmap?  I checked the github issues with "label:enhancement" but nothing matches what I'm wondering aboutI could be missing something obvious, but when I search my username in https://hypothes.is/users, none of the replies I've made on other people's public annotations show up# Use casesSometimes people have insightful observations and references they provide, so I tend to reply to those annotations with tags that I use to sort through (eg, tags like "to read", "how to", "tutorial", and so forth)I also tend to make comments on what the OP's annotation made me think of at the time of reading it which is exemplified in the attached screenshotimage.png 9 repliesMichael DiRoberts  7 days ago@Kevin Flowers You’re right, the Activity Page (https://hypothes.is) doesn’t show replies. The Notebook, which will be built out more with time, does.https://web.hypothes.is/help/how-to-preview-the-hypothesis-notebook/HypothesisHow to Preview the Hypothesis Notebook : HypothesisHypothesis has released an early preview of Notebook, which enables you to view, search for, and filter annotations. While this tool is available in both the LMS and web apps, it is designed to bring much-needed functionality to our LMS users. This initial release contains some basic features we have planned to include in the […]Est. reading time2 minutes1Michael DiRoberts  7 days agoI hope Notebook solves the issue for you! For now it’s going to work on private groups and not the Public group (due to it having a limit of 5,000 annotations), though that may change in the future.Michael DiRoberts  7 days agoIf you’re comfortable using APIs then you might check out our API as well: https://h.readthedocs.io/en/latest/api-reference/v1/.You can find replies by looking at rows that contain references.Kevin Flowers  7 days agoOh, the Notebook seems like a neat tool, I'll have to share that with some friendsKevin Flowers  7 days agoThe issue for my own PKM (personal knowledge management) stack is that I couple Hypothes.is with an Obsidian [1] plugin that imports my annotations into my local file system.  Atm, I think the plugin only references the Activity Page to import annotations, so it looks like I'll have to play around with the API you mentioned if I want to grab my replies (along with their parent replies & annotations)[1] Obsidian is a notetaking software similar to Roam & Logseq; it just adds a pretty GUI on top of .md files which are stored locallyMichael DiRoberts  7 days agoNote that the Obsidian plugin wasn’t made by us, so I’m not familiar with how it works. It’s a little weird to me that it would work over the activity page and not use our API, however.Brian Cordan Young  7 days ago@Kevin Flowers Do you have, or have you considered, blogging about your use of Hypothesis as a part of a PKM?I’m still not a regular user of Hypothesis because it doesn’t fit in to my current info consumption well enough. That said I love learning how others do fit it in.(Obsidian is really great too) (edited) Kevin Flowers  7 days ago@Michael DiRoberts ah, you're right, thanks for mentioning that.   Looks like it requires one to generate an API token in order to pull highlights, so it must be using the Hypothes.is API in some way.  Sadly, I'm not familiar enough with general software development design (or JavaScript/TypeScript), and the source code for obsidian-hypothesis-plugin doesn't have enough high level comments for me to parse what any given file does.  It'll probably be cumbersome and somewhat painful, but I'll probably learn more by just building something from scratch@Brian Cordan Young Huh, I hadn't considered that until you mentioned it.   Recently developed some interest in building something with JavaScript (probably with the Next.js framework), so a blog might be just the project I've been looking forGitHubobsidian-hypothesis-plugin/src at master · weichenw/obsidian-hypothesis-pluginAn Obsidian.md plugin that syncs highlights from Hypothesis. - obsidian-hypothesis-plugin/src at master · weichenw/obsidian-hypothesis-plugin (150 kB)https://github.com/weichenw/obsidian-hypothesis-plugin/tree/master/srcMichael DiRoberts  7 days agoJust in case, or for others in the future, you can generate a Hypothesis API token here: https://hypothes.is/account/developer1

      This is a post I made on the Slack public channel asking about whether or not Hypothes.is indexes replies. A tech support membered confirmed this is true.

      However, Obsidian's Hypothes.is plugin does pull replies. It should be noted that default settings don't capture updates to the annotations or tags.

    1. Educational policy has placed teachersin a precarious corner of needing to address the ongoing needs and ques-tions in their classrooms while also navigating worries that administrators,parents, and observers may see these efforts as indoctrination.

      This made me think about the meaning of hidden curriculum. How are we ensuring as educators that we are addressing state standards while also addressing the needs of our students social and emotionally. This also makes me think about how we can intertwine transformative healing practices into our everyday curriculum

    1. Author Response

      Reviewer #2 (Public Review):

      Grasses develop morphologically unique stomata for efficient gas exchange. A key feature of stomata is the subsidiary cell (SC), which laterally flanks the guard cell (GC). Although it has been shown that the lateral SC contributes to rapid stomatal opening and closing, little is known about how the SC is generated from the subsidiary mother cell (SMC) and how the SMC acquires its intracellular polarity. The authors identified BdPOLAR as a polarity factor that forms a polarity domain in the SMC in a BdPAN1-dependent manner. They concluded that BdPAN1 and BdPOLAR exhibit mutually exclusive localization patterns within SMCs and that formative SC division requires both. Further mutant analysis showed that BdPAN1 and BdPOLAR act in SMC nuclear migration and the proper placement of the cortical division site marker BdTANGLED1, respectively. This study reveals a unique developmental process of grass stomata, where two opposing polarity factors form domains in the SMC and ensure asymmetric cell division and SC generation.

      The findings of this study, if further validated, are novel and interesting. However, I feel that the data presented in the current manuscript do not fully support some crucial conclusions. The lack of dual-color images is the weakest point of this study. If it is technically impossible to add them, alternative analyses are needed to validate the main conclusions.

      1) Is BdPOLAR-mVenus functional? Although the authors interpret that weak BdPOLAR-mVenus expression partially rescued the bdpolar mutant phenotype in Fig. S4D, the localization pattern visualized by BdPOLAR-mVenus may not be completely reliable with this partial rescue activity.

      This is indeed a valid point. The partial complementation of weakly expressing translational reporters (Figure 3–figure supplement 1D) and the weak effect of BdPOLAR-mVenus overexpression lines (Figure 3–figure supplement 1J) at least suggest partial functionality which is strongly dependent on dosage. Yet the localization pattern and the temporal dynamics might indeed not fully reflect the spatiotemporal dynamics of the endogenous BdPOLAR. This criticism is, however, true for any transgenic reporter line–even when fully complementing–as the requirement for dosage, stability, and turnover likely varies strongly between different protein classes and functions.

      Nonetheless, we have added a sentence on p. 7, which mentions this potential caveat.

      2) Regardless of the functionality of the tagged protein, the authors need to provide more information on their localization. For example, is there a difference in polarity pattern depending on expression level? Does overexpressed BdPOLAR-mVenus invade the BdPAN1 zone? In such cases, might the loss of BdPOLAR polarity in the bdpan1 mutant be a side effect of overexpression, not PAN1 exclusion? Does BdPOLAR expression (no tag) show a dose-dependent effect, similar to the mVenus-tagged protein?

      The difference in polarity patterns in bdpan1 mutants and wild-type does not depend on expression level. BdPOLAR-mVenus was crossed into bdpan1 and mutant and wild-type siblings in the F2 generation were analyzed. This means that the data presented in Fig. 3E and F show exactly the same transgene insertion line in wt and bdpan1 and were imaged with the same setting for comparability. Therefore, the difference in localization is not due to different expression levels but indeed reflects a PAN1-dependent effect.

      To address if BdPOLAR without a tag is also sensitive to dosage, we have generated an untagged complementation line that includes the untagged, genomic locus of BdPOLAR including promoter (-3.1kb) and terminator (+1.1kb). Yet, even though this construct is much better at rescuing the mutant, we still see remaining defects in T0 lines (Figure 3–figure supplement 1K) suggesting that even without a tag we cannot fully recapitulate wild-type functionality. Yet, to actually measure protein levels of untagged BdPOLAR, we would need to raise an antibody against BdPOLAR, which we think is clearly out of the scope of this study.

      3) A major conclusion of this study was that the polarity domains of BdPOLAR and BdPAN1 are mutually exclusive. However, not all the cells in the figures were consistent with this statement. For example, the BdPOLAR signals at the GMC/SMC interphase appear to match BdPAN1 localization (compare 0:03 s in Video 1 and 0:20 s in Video 2 [top cell]). The 3D rendered image in Fig. 2F shows that BdPOLAR is excluded near the GMC on the front side of the SMC, where BdPAN1 is not localized. Some cells did not exhibit polarization (Fig. 3A, bottom left; Fig. 3E, bottom left). The most convincing data are the dual-color images of these two proteins. Otherwise, a sophisticated image analysis is required to support this conclusion.

      We agree that dual-color image analysis would have provided the most convincing data. As mentioned in our answers to the reviewing editor and reviewer 1, we have generated a dual marker line (BdPAN1p:BdPAN1-CFP; BdPOLARp:BdPOLAR-mCitrine), yet the BdPAN1-CFP signal (compared to mCitrine signal) was too weak to visualize the proximal BdPAN1 domain.

      This issue was also raised by reviewer 1 and deemed an essential revision. To determine how BdPOLAR and BdPAN1 relate spatially to each other, we have added data in Figure 2E where we manually traced mature SMC outlines to determine BdPOLAR-mVenus and BdPAN1-mCitrine occupancy along the SMC’s circumference. This confirmed that the polarization is indeed opposite yet not perfectly reciprocal (see details above, Essential Revisions #1).

      Finally, we realized that the 3D image renderings were more confusing than helpful and we removed them from the revised version.

      4) Another central conclusion was that BdPOLAR was excluded at the future SC division site, marked with BdTANGLED1. However, these data are also not very convincing, as such specific exclusion cannot be seen in some figure panels (e.g., Fig. 3A, bottom left; Fig. 3E, all three cells on the left). If dual-color imaging is not feasible, a quantitative image analysis is needed to support this conclusion.

      As for point 3, this was also criticized by reviewer 1 and deemed an essential revision by the reviewing editor.

      To determine whether the absence of BdPOLAR signal and the presence of BdTAN1 signal colocalize, we again manually traced mature SMC outlines to determine BdPOLAR-mVenus and BdTAN1-mCitrine occupancy along the SMC’s circumference. We plotted the relative average fluorescence intensity in Figure 4G-I nicely showing that BdTAN1 indeed resides in the BdPOLAR gaps above and below the GMC (again, details above, Essential Revisions #2).

      5) I could not find detailed imaging conditions and data processing methods. Are Figs. 2B and 2E max-projection or single-plane images? If they are single-plane images, which planes of the SMC are observed? In addition, how were Figs. 2C and 2F rendered? (e.g., number of images, distance intervals, processing procedures). This information is important for data interpretations.

      We agree that we might not have provided sufficient imaging condition details and have added more details regarding image acquisition in the method part (p. 20). We always use a consistent depth and show the midplane of SMCs. As mentioned above, we removed Figs. 2C and 2F and the supplemental movies as these data did not seem to be helpful.

      6) [Minor point] The authors should clearly describe where BdPAN1 is expressed and localized. Is it expressed in the GMC and localized at the GMC/SMC interface? Alternatively, is it expressed and localized in the SMC?

      BdPAN1 is expressed throughout the epidermis but starts to strongly accumulate at the GMC/SMC interface. According to the literature (Cartwright et al 2009 with immunostainings against ZmPAN1 and Sutimantanapi et al. 2014 with PAN1 and PAN2 reporter) and our own observations (Fig. S3), this accumulation occurs in the SMC rather than in the GMC. In Fig. S3A, third panel, second GMC from the top, for example, one can see that the early PAN1 polarity domain expands beyond the GMC/SMC interface suggesting that it is indeed forming in SMCs rather than in GMCs. We have specified this in the text more clearly now (p. 5).

    1. Author Response

      Reviewer #1 (Public Review):

      The research investigates the genetic basis for resistance to high CO2 levels in the human pathogenic fungus Cryptococcus neoformans. Screening collections of over 5,000 gene deletion strains revealed 96 with impaired growth, including a set of genes all related to the same RAM signaling pathway. Further genetic dissection was able compellingly to place where this pathway lies relative to upstream inputs and through the isolation of suppressor mutants as potential downstream targets of the pathway. Given the high levels of CO2 encountered by fungi in the human host, this work may provide new directions for the control of disseminated fungal disease.

      The research presents both strengths and weaknesses.

      Strengths include:

      (1) One of the largest scale analyses of genes involved in growth under high CO2 concentrations in a fungus, revealing a set of just under 100 mutants with impaired growth.

      (2) Elegant genetic epistasis analysis to show where different components fit within a pathway of transmission of CO2 exposure. For example, over expression of one of the kinases, Cbk1, can overcome the CO2-sensitivity of mutations in the CDC24 or CNA1 genes (but not in the reciprocal overexpression direction).

      (3) Isolation of suppressor mutations in the cbk1 background, now able to grow at high CO2 levels, was able to lead to the identification of two genes. Follow up characterization, which included examining in vitro phenotypes, gene expression analysis, and impact during mouse infection was able to reveal that the two suppressors restore a subset of the phenotypes impacted by mutation of CBK1. Indeed, one conclusion from this careful work is that the reduced virulence of the cbk1 mutant is not due to its sensitivity to high levels of CO2, perhaps an unexpected finding given the original goals of the study towards linking CO2 sensitivity with decreased virulence.

      Weaknesses include:

      (1) What is the rationale for examining gene expression using the NanoString technology of 118 genes rather than a more genome-wide approach such as RNA-sequencing?

      (2) Without additional species examined, some of the conclusions about differences in impact between ascomycetes and basidiomycetes might instead reflect differences between species. For example, RAM mutants in other strains of C. neoformans do not exhibit so strong a temperature sensitive phenotype. Or to extend the comparison further, one might assume given the use of CO2 for Drosophila manipulations that the RAM pathway components in an insect would not be required for surviving high CO2.

      (3) Given the relative ease of generate progeny of this species, it would have been informative to explore if the suppressors of cbk1 also suppressed the loss of genes like CDC24, CNA1, etc, equivalent to the experiment performed of overexpression of CBK1 in those backgrounds.

      We thank the reviewer for the kind summary of our work and the highlights of the major findings. We chose NanoString because we have already generated a probe set of 118 genes that are differentially expressed in response to CO2 based on RNA-seq profiles of multiple natural cryptococcal isolates in a separate study. Nanostring allowed us to focus on CO2 relevant transcripts and do multiple replicates and conditions in a way that is not practical using RNA-Seq.

      Although the RAM pathway has not been extensively characterized in different species of Cryptococcus, we do know that RAM pathway mutants lead to pseudohyphal growth in multiple strain backgrounds including two different species of Cryptococcus (Magditch, Liu, Xue, & Idnurm, 2012; Walton, Heitman, & Idnurm, 2006). We have added corresponding references and discussed this point on lines 167-169.

      We agree with the reviewer that it would be interesting to test the effects of the cbk1Δ suppressor mutations in the backgrounds of other CO2-sensitive gene knockout strains. This is part of our plan for future investigation in characterizing the signaling pathways involved in CO2 tolerance.

      Reviewer #2 (Public Review):

      In the paper by Chadwick et al., the authors identify the molecular determinants of CO2 tolerance in the human fungal pathogen Cryptococcus neoformans. The authors have screened a collection of deletion mutants to identify the genes that are sensitive at 37oC (host temperature) and elevated CO2 levels. The authors identified that the genes responsible for CO2 sensitivity are involved in the pathways responsible for thermotolerance mechanisms such as Calcineurin, Ras1-Cdc24, cell wall integrity, and the Regulator of Ace2 and Morphogenesis (RAM) pathways. Moreover, they identified that the mutants of the RAM pathway effector kinase Cbk1 were most sensitive to elevated temperature and CO2 levels. This study uncovers the previously unknown role of the RAM pathway in CO2 tolerance. Transcriptome data indicates that the deletion of CBK1 results in an alteration in the expression of CO2-related genes. To identify the potential downstream targets of Cbk1, the authors performed a suppressor screen and obtained the spontaneous suppressor mutants that rescued the sensitivity of cbk1 mutants to elevated temperature and CO2. Through this screen, the authors identified two suppressor groups that showed a modest improvement in growth at 37˚C and in presence of CO2.

      Interestingly, from the suppressor screen, the authors identified a previously known interactor of Cbk1 which is SSD1, and an uncharacterized gene containing a putative Poly(A)-specific ribonuclease (PARN) domain named PSC1 (Partial Suppressor of cbk1Δ) which acts downstream of Cbk1. Deletion of these two genes in cbk1 null mutants rescued the sensitivity to elevated CO2 levels and temperature but did not fully rescue the ability to cause disease in mice.

      This study highlights the underappreciated role of the host CO2 tolerance and its importance in the ability of a fungal pathogen to survive and cause disease in host conditions. The authors claim to gain insight into the genetic components associated with carbon dioxide tolerance. The experimental results including the data presented, and conclusions drawn do justice to this claim. Overall, it is a well-written manuscript. However, some sections need improvement in terms of clarity and experimental design.

      • One major drawback of the study is the virulence assay performed to test the ability of cbk1 mutants to cause the disease in the mouse model. The cbk1 null mutants are thermosensitive in nature. Using these mutants, establishing the virulence attributes in mice would undermine the mutants' ability to infect mice as they won't be able to survive at the host body temperature.

      • The rationale for choosing the genes to test further is not clear in two instances in the study. a) From a list of 96 genes, how do the authors infer the pathways involved? Was any pathway analysis performed that helped them in shortlisting the pathways that they subsequently tested? A GO term analysis of the list of genes identified through the genetic screen would be more helpful to get an overview of the pathways involved in CO2 tolerance. b) The authors do not clearly mention why they chose only four genes to test for the CO2 sensitivity out of 16 downregulated genes identified from the nano string analysis.

      • It would be more useful to the readers if the authors could also include a thorough analysis of the presence of the putative PARN domain-containing protein across various fungal species rather than mentioning that it is only observed in C. neoformans and S. pombe. Also, the authors may want to discuss the known role(s) of SSD1, if any, in pathogenic ascomycetous yeasts so that the proposed functional divergence is supported further.

      We are glad that the reviewer appreciated the approach, the findings, and the significance of this research, and we are grateful for the helpful suggestions to improve the manuscript.

      To remove temperature sensitivity as a variable when testing virulence, we have added a new infection model in the revised manuscript to test the cbk1Δ mutant and its suppressors. This infection model uses the Galleria mellonella larvae as a host. G. mellonella larvae are commonly used to test virulence for temperature sensitive strains as the body temperature of the larvae is similar to that of the environment. We performed cryptococcal infection in this model and the larvae were kept at 30°C rather than at 37°C. The results of these experiments are now described in results section 5 and shown in Figure 6 of the manuscript. The data using the larva-infection model supports our original conclusion about the virulence of these strains observed in mouse models.

      We performed a GO term analysis of the hits from our screening, but did not find any significant or outstanding pathways. From our list of 96 genes, we chose to focus on the RAM pathway because the mutants were among the most sensitive to CO2. We have added an explanation for the genes we decided to test for host CO2 level sensitivity from the 16 downregulated genes on lines 139-141.

      Through Blast searching, we have found that the PARN domain-containing protein has homologs in other basidiomycetes. There might be some homologs in a few zygomycetes and ascomycetes but the confidence scores were so low that we deemed unlikely. We now report this in the manuscript on lines 210-213, “This domain was previously reported to be found in S. pombe (Marasovic, Zocco, & Halic, 2013). Interestingly, through a Blast search of the PARN domain, we did not identify this domain in the genomes of S. cerevisiae, C. albicans or other ascomycetes, but found it in Basidiomycetes and higher eukaryotes”.

      Ssd1 has been studied in the pathogenic yeast Candida albicans and is also regulated by Cbk1 in this organism. We have added a discussion about possible functions of Ssd1 in C. neoformans based on references to studies in C. albicans in the discussion section on lines 401-408. “In C. albicans, Ssd1 plays an important role in polarized growth and hyphal initiation by negatively regulating the transcription factor Nrg1 (H. J. Lee, Kim, Kang, Yang, & Kim, 2015). The observation that cbk1Δpsc1Δ and cbk1Δssd1Δ suppressor mutants partially rescue cell separation defects or depolarized growth suggests that C. neoformans may primarily utilize Ssd1/Psc1 rather than a potential Ace2 homolog to regulate cell separation or polarization. Differential regulation of target mRNA transcripts by Ssd1 and Psc1 may explain the functional divergence of the RAM pathway we observed here between basidiomycete Cryptococcus and the ascomycete yeasts.”

      Reviewer #3 (Public Review):

      In this work the authors identify genes and pathways important for CO2 and thermotolerance in Cryptococcus neoformans. They additionally rule out the contribution of the bicarbonate or cAMPdependent activation of adenylyl cyclase to this pathway, which is important for CO2 sensing in other fungi, further solidifying the need to characterize CO2 sensing in basidiomycetes. The authors establish the importance of focusing on CO2 tolerance by testing the impact of CO2 on fluconazole susceptibility with varied pH, suggesting the ability of CO2 to sensitize cryptococcal cells to fluconazole. Furthermore, the authors compared the CO2 tolerance of clinical reference strains to environmental isolates. The characterization of the RAM pathway Cbk1 kinase illustrated the integration of multiple stress signaling pathways. By using a series of CBK1OE insertions in strains with deletions in other pathways, the ability of Cbk1 over-expression to rescue several strains from CO2 sensitivity was apparent. Additionally, NanoString expression analysis comparing cbk1∆ to H99 validated the author's screen of CO2-sensitive mutants as 16/57 downregulated genes were found in their screen, further confirming the interconnected nature of these pathways. The importance of the RAM pathway in maintaining CO2 and thermotolerance was also incredibly clear.

      Perhaps most interestingly, the authors identify suppressor colonies with distinctive phenotypes that allowed for the characterization of downstream effectors of the RAM pathway. These suppressor colonies were found to have mutations in SSD1 and PSC1 which somewhat restore growth at 37oC with CO2 exposure. Further confirming the importance of the RAM pathway, the cbk1∆ strain had markedly attenuated virulence during infection. Interestingly, the generated suppressor strains had varying impacts on fungal infection in vivo. While the sup1 suppressor was completely cleared from the lungs during both intranasal and IV infection, the sup2 strain, containing mutations in SSD1, maintained a high fungal load in the lungs and was able to disseminate into host tissues during IV infection but not intranasal infection.

      The authors make a strong case for the exploration of thermotolerance and CO2 tolerance as contributors to virulence. Through screening and characterization of RAM pathway kinase CBK1's ability to rescue other mutants from CO2 sensitivity, the overlapping contributions of several signaling pathways and the importance of this kinase were revealed. This work is important and will be valuable to the field. However, the cbk1∆ strain does show reduced melanization, urease secretion, and higher sensitivity to cell wall stressor Congo Red in SI Appendix, Figure S4. While the authors make a strong argument that these well-established virulence factors are not perfect predictors of virulence in vivo, the cbk1∆ strain is not an example of such a case as it does have defects in these important factors in addition to thermotolerance and CO2 tolerance. Not acknowledging the changes in these virulence factors in the cbk1∆ and their potential contribution to phenotypes observed is a weakness of the manuscript. Interestingly, the sup1 and sup2 strains also rescue these virulence factors compared to cbk1∆. Additionally, the assertion that "the observation that only sup2 can survive, amplify, and persist in animals stresses the importance of CO2 tolerance in cryptococcal pathogens" due to the sup2's slightly higher CO2 tolerance compared to sup1, could be better supported by the data. These suppressors did not restore transcript abundances of the differentially expressed genes to WT levels, suggesting post-transcriptional regulation. However, there may be differences in the ability of sup2 to resist stress better than sup1 especially given the known Ssd1 repression of transcript translation in S. cerevisiae. Finally, pH appears to impact the sup1 and sup2 strain's sensitivity to CO2 in SI Appendix Figure 4. This could be better explained and interrogated in the manuscript. Finally, this work includes a variety of genes in several signaling pathways. The paper would be greatly clarified by a graphical abstract indicating how CBK1 may be integrating these pathways or by indicating which genes belong to which pathways in the Figure 1 legend to make this figure easier to follow.

      We thank the reviewer for the thorough summary of the study. We appreciate the reviewer’s enthusiasm about this study as well as constructive critiques on the manuscript. Indeed, the suppressor mutations in the cbk1Δ mutant rescue more phenotypes of cbk1Δ in vitro than just thermotolerance and CO2 tolerance (Supplemental Figure 5), which could benefit the survival of these suppressor strains in vivo compared to the original the cbk1Δ mutant. However, between the sup1 and the sup2 mutants, the only clear difference in growth we observed was in host levels of CO2 and temperature. There was no obvious difference in their resistance to Congo red (cell wall stress), melanization, susceptibility to FK506 (calcineurin pathway inhibitor), sensitivity to H2O2 (ROS), or urease (Supplemental Figure 5). Nonetheless, we agree with the reviewer that there could be other reasons which may influence the outcome in vivo, given that the host environment is more complex than we know. We have changed our wording in the manuscript to make it clear that contribution of better tolerance of CO2 to better survival of the sup2 mutant is only our hypothesis and there could be other unrecognized contributing factors. “The only in vitro difference observed between sup1 and sup2 was better growth of sup2 at host CO2 levels which may explain the difference in their ability to propagate and persist in the mouse lungs. However, due to the complexity of the host environment, there could be other unrecognized factors contributing to their growth difference in vivo.” (Lines 276279).

      About growth at different pH levels, C. neoformans tends to grow better at lower pH, closer to pH 5. This fungus can grow at pH 3, the lowest pH that our lab had tested (it may be able to sustain viability even at pH 2 based on others’ conference presentations). The high temperature/CO2 combined with neutral or high pH likely causes worse growth of both H99 and the mutants tested.

      We tried making a model to integrate all the pathways and factors identified in this work as the reviewer suggested. However, in this process, we found it difficult to propose a model. Although the current findings clearly demonstrate the importance of Cbk1 in thermotolerance and CO2 tolerance (overexpression of CBK1 can partially restore thermotolerance and/or CO2 tolerance in the mutants defective in the cell wall integrity pathway, the calcineurin pathway or the Cdc24-Ras1 pathway, and that the reciprocal overexpression of these genes in the cbk1∆ mutant does not rescue any of the cbk1∆ mutant’s defects), we do not know the exact mechanisms underlying this phenomenon. Do these pathways directly interact with Cbk1, affect its phosphorylation status, or alter its subcellular localization? Or do these pathways act through some other massagers to indirectly activate Cbk1 or maybe Cbk1’s downstream targets? These are the questions that warrant further investigations in the future. To be prudent, we think it is better not to propose a model at this point given the uncertainty of the mechanism. The mutants belonging to each of the pathways are clearly specified in the texts in this revised manuscript to help orient the readers. For example “As the RAM pathway effector kinase mutant cbk1Δ showed the most severe defect in thermotolerance and CO2 tolerance compared to the mutants of the other pathways, we first overexpressed the gene CBK1 in the following mutants, cdc24∆ (Ras1-Cdc24), mpk1∆ (CWI), cna1∆ (Calcineurin), and the cbk1Δ mutant itself, and observed their growth at host temperature and host CO2 (Figure 2B)...”

    1. Reviewer #3 (Public Review):

      To motivate the proposal, Karageorgiou et al. first identify a problem in applying current multivariable MR (MVMR) methods with many correlated exposures. I believe this problem can really be broken into two pieces. The first is that MVMR suffers from weak instrument bias. The second is that some traits may have nearly co-linear genetic associations, making it hard to disentangle which trait is causal. These problems connect in that inclusion of co-linear traits amplifies the problem of weak instrument bias - traits that are nearly co-linear with another trait in the study will have no or very few conditionally strong instruments.<br /> The authors then propose a solution: Apply a dimension reduction technique (PCA or sparse PCA) to the matrix of GWAS effect estimates for the exposures. The identified new components can then be used in MVMR in place of the directly measured exposures.

      I think that the identified problem is timely and important. I also like the idea of applying dimension reduction techniques to GWAS effect estimates. However, I don't think that the manuscript in its current form achieves the goals that it has set out. Specifically, I will outline the weaknesses of the work in three categories:<br /> 1. The causal effects measured using this method are poorly defined.<br /> 2. The description of the method lacks important details.<br /> 3. Applied and simulation results are unconvincing.<br /> I will describe each of these in more detail below.

      1. To me, the largest weakness of this paper is that it is not clear how to interpret the putatively causal effects being measured. The authors describe the method as measuring "the causal effect of the PC on outcome" but it is not obvious what this means.

      One possible implication of this statement is that the PC is a real biological variable (say some hidden regulator) that can be directly intervened on. If this is the intention it should be discussed. However, this situation would imply that there is one correct factorization and there is no guarantee that PCs (or sparse PCs) come close to capturing that.

      The counterfactual implied by estimating the effects of PCs in MVMR is that it is possible to intervene on and alter one PC while holding all other PCs constant.<br /> In the introduction, the authors note (and I agree) that one weakness of MR applied to correlated traits is that "MVMR models investigate causal effects for each individual exposure, under the assumption that it is possible to intervene and change each one whilst holding the others fixed." However, it is not obvious that altering one PC while holding the others constant is more reasonable.

      2. This section combines a few items that I found unclear in the methods section. The most critical one is the lack of specification on how to select instruments.<br /> For the lipids application, the authors state that instruments were selected from the GLGC results, however, these only include instruments for LDL, HDL, and TG, so 1) it would not be possible to include variants that were independently instruments for one of the component traits alone and 2) there would be no instruments for the amino acids. There is no discussion of how instruments should be selected in general.<br /> This choice could also have a dramatic impact on the PCs estimated. The first PC is optimized to explain the largest amount of variance o of the input data which, in this case, is GWAS effect estimates. This means that the number of instruments for each trait included will drive the resulting PCs. It also means that differences in scaling across traits could influence the resulting PCs.

      The other detail that is either missing or which I missed is what is used as the variant-PC association in the MVMR analysis. Specifically, is it the PC loadings or is it a different value? Based on the computation of the F-statistic I suspect the former but it is not clear. If this is the case, what is the effect of using loadings that have been shrunk via one of the sparse methods? It would be nice to see a demonstration of the bias and variance of the resulting method, though it is not clear to me what the "truth" would be.

      3. In the lipids application, the fact that M.LDL.PL changes sign in MVMR analysis are offered as evidence of multicollinearity. I would generally associate multicollinearity with large variance and not bias. Perhaps the authors could offer some more insight on how multicollinearity would cause the observation.<br /> A minor point of confusion: I was unable to interpret this pair of sentences "Although the method did not identify any of the exposures as significant at Bonferroni-adjusted significance level, the estimate for M.LDL.PL is still negative but closer to zero and not statistically significant. The only trait that retains statistical significance is ApoB." The first sentence says that none of the exposures were significant while the second sentence says that Apo B is significant. The GRAPPLE results don't seem clearly bad, indeed if only Apo B is significant, wouldn't we conclude that of the 118 exposures, only Apo B is causal for heart disease? It would help to discuss more how the conclusions from the PC-based MVMR analysis compare to the conclusions from GRAPPLE.

      It is a bit hard to interpret Table 4. I wasn't able to fully determine what "VLD, LDL significance in MR" means here. From the text, it seems that it means that any PC with a non-zero lodaing on VLDL or LDL traits was significant, however, this seems like a trivial criterion for the PCA method, since all PCs will be dense. This would mean this indicator only tells us whether and PCs were found to "cause" heart disease.

      In simulations, I may be missing something about the definition of a true and false positive here. I think this is similar to my confusion in the previous paragraph. Wouldn't the true and false positive rates as computed using these metrics depend strongly on the sparsity of the components? It is not clear to me what ideal behavior would be here. However, it seems from the description that if the truth was as in Fig 7 and two methods each yielded one dense component that was found to be causal for Y, these two methods would get the same "score" for true positive and false positive rate regardless of the distribution of factor loadings. One method could produce a factor that loaded equally on all exposures while the other produced a factor that loaded mostly on X1 and X2 but this difference would not be captured in the results.

  7. www.kernel.community www.kernel.community
    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Response to Reviewers:

      1. General Statements

      We thank the reviewers for the comments and the suggestions. We hope that we have addressed all the queries raised by the reviewers in the revised manuscript. We provide a point-by-point response below. Please note that the line numbers indicated in parentheses correspond to the pdf file without the track changes display.

      2. Point-by-point description of the revisions


      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      Summary: Srinivasan and co-workers developed an alternative screening method for defining the ability of FtsZ inhibitor to affect FtsZ polymerization. This alternative assay was defined considering the expertise of the authors on the topic, and they use Schizosaccharomyces pombe as a model for studying the effect of PC190723, sanguinarine and berberine on FtsZ assembly. The use of a heterologous expression system is useful for the evaluation of FtsZ coming from different strains, both Gram - and Gram +. The same model could gain insights also on the capability of FtsZ inhibitors to affect eukaryotic cell physiology. Finally, authors resulted also in suggesting a possible cause to suspected resistance to PC190723 from Gram - strains as E. coli.

      Major comments: • The conclusions are included in the discussion section and are quite convincing, for a general audience.

      We thank the reviewer for the positive comments.

      In my opinion, the authors should define which could be the limits of their method, since no data on the possible weaknesses are reported.

      RESPONSE: We have discussed the limitations of the methods as well. The discussion has been modified and the following sentences have been now included in the revised manuscript.

      “However, one of the major disadvantages of using fission yeast could be the need to use much higher concentrations of drugs than normally used for mammalian cell cultures to achieve an inhibitory effect. This could probably be due to the poor permeability of certain drugs in fission yeast because of its thick cell wall (Benko et al. 2017; Pérez and Ribas 2004). A similar effect of toxicity might arise at much lower concentrations in other eukaryotic cells, such as human cells. Consistently, while sanguinarine and berberine are known to affect the eukaryotic microtubules at 10 μΜ – 20 μM concentrations (Lopus and Panda 2006; Wang et al. 2016; Raghav et al. 2017), morphological effects on yeast cells were observed only at concentrations > 100 μM. However, yeast microtubules were not affected by berberine and sanguinarine. Differences in membrane lipid profiles and MDR efflux pumps between yeasts and mammalian cells might also contribute to differential resistance to the drugs being tested (Balzi and Goffeau 1991). Conversely, an inhibitory effect in yeast cells may not necessarily translate into toxicity in a human cell. These and the permeability of drugs in yeast cells represent an important caveat in using such heterologous expression systems for the screening of compounds against target molecules.”

      [Lines 498-513]

      As suggested in the later sections, we have also elaborated on the pros and cons of various methods including the yeast-based screening methods. [Lines 462-523]

      • No additional experiments are required to support the claims.

      • The suggested experiments could be quite easy to be realized for authors working in the microbiological field, and familiar with protein expression and purification, as well as bacteria and yeast growth.

      • From my side, even if I am not so expert in microbiology and plasmid/protein purification, the methods presented could be reproduced with no significant doubt.

      • Statistical analysis was done and seems to be adequate.

      RESPONSE: We thank the reviewer for these encouraging comments.

      Minor comments: • Prior studies should be deepened, especially for the state of art authors referred to. Additional paper, both reviews on the possible methods for evaluating FtsZ inhibition, as well as research papers on FtsZ inhibitors targeting E. coli and other Gram negative strains should be mentioned, since, in my opinion, these could move authors in changing a little bit the overall text of the manuscript.

      RESPONSE: We have now elaborated the state-of-art methods used for evaluation of FtsZ inhibition and cited the relevant papers and reviews. We have also included papers on development of FtsZ inhibitors, especially the ones similar to PC190723, targeting Gram-negative bacteria. The following sentences have been included in the revised manuscript.

      “Several approaches have been used to screen small molecules targeting bacterial cell division and FtsZ. While in vitro methods such as NMR (Domadia et al. 2007; Sun et al. 2014; Araújo‑Bazán et al. 2019) and crystallography (Läppchen et al. 2008; Fujita et al. 2017) are valuable and offer information on distinct binding sites, these are not efficient for screening. Electron microscopic examination can distinguish the effects of the compounds being tested on the FtsZ protofilament assembly and lateral associations (Nova et al. 2007; Kaul et al. 2012; Anderson et al. 2012; Sun et al. 2014; Huecas et al. 2017; Kumar et al. 2011; Park et al. 2014). Other techniques that are routinely used include fluorescence anisotropy (Ruiz‑Avila et al. 2013; Park et al. 2014), 90º light-scattering assay (Mukherjee and Lutkenhaus 1999) and dynamic light scattering (Hou et al. 2012; Di Somma et al. 2020) for assessing inhibition of FtsZ assembly (Kaul et al. 2012; Nova et al. 2007; Lui et al. 2019; Anderson et al. 2012, (Irwin et al. 2015). Other easily scalable high-throughput assays include FCS/FCCS and FRET-based methods (Hernández‑Rocamora et al. 2015; Mikuni et al. 2015; Reija et al. 2011).

      In vivo assays relying on cell filamentation phenotype coupled with the localization of Z-ring might be a good indicator of FtsZ being the direct target. However, since bacteria can undergo cell filamentation and not assemble FtsZ rings in response to a variety of conditions, including DNA damage (Mukherjee et al. 1998) and disruption of membrane potential (Strahl and Hamoen 2010), the in vivo assay is not so useful unless combined with the in vitro assays mentioned above. Finally, the isolation of resistance mutants in FtsZ to the drug can provide strong evidence of FtsZ being the direct target.

      Reconstitution systems are powerful and provide excellent control over the system, but they are emerging technologies and are technically challenging. Reconstitution systems include a variety of methods, such as the use of membrane nanodiscs, microbeads of different materials, supported bi-layer membranes (SLBs) and biomimetic systems that provide cell-like environments (Monterroso et al. 2013; Rivas et al. 2014).”

      [Lines 462-487]

      “Several compounds have been evaluated for their activity against FtsZ from both Gram-positive bacteria and Gram-negative bacteria. Although many exhibited only weak activity in vivo against Gram-negative bacteria, derivatives could be promising. These include benzamides (Haydon et al. 2008; Adams et al. 2011; Straniero et al. 2017, 2020a), trisubstituted benzimidazoles (Kumar et al. 2011), 4-bromo-1H-indazole derivatives (Wang et al. 2015), cinnamaldehyde and its derivatives (Domadia et al. 2007; Li et al. 2015), curcumin (Rai et al. 2008), heterocyclic molecules like guanidinomethyl biaryl compounds (Kaul et al. 2012), pyrimidine-quinuclidine scaffolds (Chan et al. 2013), 3-phenyl substituted 6,7-dimethoxyisoquinoline (Kelley et al. 2012), thiazole orange derivatives (Sun et al. 2017), viriditoxin (Wang et al. 2003), N-heterocycles such as zantrins and derivatives (Margalit et al. 2004; Nepomuceno et al. 2015).”

      [Lines 69-80]

      “Several efforts have been made to target Gram-negative bacteria with derivatives of benzamide. Examples include difluorobenzamides, substituted benzodioxanes, heterocyclic and non-heterocyclic derivatives (Straniero et al. 2017; Chai et al. 2020; Straniero et al. 2020a, 2020b). Although many exhibited promising activity in vitro, most were substrates for the AcrAB class of efflux pumps (Chai et al. 2020; Kaul et al. 2014; Straniero et al. 2020a, 2020b; Casiraghi et al. 2020). Thus, the poor membrane permeability, signature outer membrane, particularly lipopolysaccharide (LPS) structure (Wang et al. 2021), the presence of multiple efflux pumps in species such as E. coli, Klebsiella pneumonia and Pseudomonas aeruginosa (Piddock 2006), and differences in FtsZ sequences in the binding-site (Kaul et al. 2013b; Miguel et al. 2015) have been cited as reasons for lack of susceptibility of Gram-negative bacteria to benzamide derivatives (Casiraghi et al. 2020). More recently, two molecules, TXA6101 and TXY6129, with substituted 2,6-difluorobenzamide scaffold, have been shown to inhibit the polymerization of both E. coli and Klebsiella pneumoniae FtsZ. Moreover, despite being substrates for efflux pumps, TXA6101 induced morphological changes in K. pneumoniae (Rosado‑Lugo et al. 2022). Studies in the past on the effects of PC190723 on E. coli have been confusing, with a few reports suggesting an effect on FtsZ polymerization resulting in cell filamentation (Kaul et al. 2014), while others did not find any effect on EcFtsZ (Andreu et al. 2010; Anderson et al. 2012; Khare et al. 2019)⁠. The outer membrane has been shown to be a permeability barrier for PC190723 in E. coli (Khare et al. 2019; Chai et al. 2020). In addition, the Resistance-Nodulation-Division (RND) family of efflux pumps has been attributed to resistance against 2,6-difluorobenzamide derivatives, including TX436 (a prodrug of PC190723) in Gram-negative bacteria (Kaul et al. 2014).”

      [Lines 527-550]

      The whole text requires a deep check for grammar and word choice. Some sentences should be re-written since it is not so easy to understand their meaning. Figures are clear, even if I am not so convinced on the need of including Figure 1.

      RESPONSE: We have now deleted Figure 1 and 2 (as also suggested by Reviewer #2), revised the manuscript and have re-written certain long sentences. We have used Grammarly to check for grammatical errors. We hope the manuscript is easier to follow with these changes.

      Reviewer #1 (Significance (Required)):

      • In my opinion, the outcome coming from this work could move researchers in evaluating an alternative method for assessing FtsZ inhibition. Nevertheless, the actual state of art, a few reviews of the last years confirm this, already underlined a huge number of possible assays, both microbiological, biochemical, biophysical, physiological, or other. As a result, the authors did not result in convincing me about the importance of their methods, when compared to others. They may include some other possible assays and comment of the differences, pros and cons.

      RESPONSE: Several alternative methods have been evaluated and several excellent reviews published in the recent past have underlined the importance of these multiple methods to screen and validate small molecules targeting FtsZ. As suggested by the reviewer here and above, we have now discussed these methods including the yeast-based assay we describe, their advantages and limitations in the revised manuscript.

      The following lines have now been included in Introduction.

      “Several methods have been used to ascertain FtsZ as the target of the drug, and the various approaches have been reviewed in detail by many (Kusuma et al. 2019; Silber et al. 2020; Zorrilla et al. 2021; Andreu et al. 2022). Andreu et al. (2022) have recently proposed a streamlined experimental protocol for the screening and characterization of FtsZ inhibitors.”

      Introduction – [Lines 113-117]

      The following paragraphs, including ones as mentioned above have included in the discussion sections of the revised manuscript.

      “Several approaches have been used to screen small molecules targeting bacterial cell division and FtsZ. While in vitro methods such as NMR (Domadia et al. 2007; Sun et al. 2014; Araújo‑Bazán et al. 2019) and crystallography (Läppchen et al. 2008; Fujita et al. 2017) are valuable and offer information on distinct binding sites, these are not efficient for screening. Electron microscopic examination can distinguish the effects of the compounds being tested on the FtsZ protofilament assembly and lateral associations (Nova et al. 2007; Kaul et al. 2012; Anderson et al. 2012; Sun et al. 2014; Huecas et al. 2017; Kumar et al. 2011; Park et al. 2014). Other techniques that are routinely used include fluorescence anisotropy (Ruiz‑Avila et al. 2013; Park et al. 2014), 90º light-scattering assay (Mukherjee and Lutkenhaus 1999) and dynamic light scattering (Hou et al. 2012; Di Somma et al. 2020) for assessing inhibition of FtsZ assembly (Kaul et al. 2012; Nova et al. 2007; Lui et al. 2019; Anderson et al. 2012, (Irwin et al. 2015). Other easily scalable high-throughput assays include FCS/FCCS and FRET-based methods (Hernández‑Rocamora et al. 2015; Mikuni et al. 2015; Reija et al. 2011).

      In vivo assays relying on cell filamentation phenotype coupled with the localization of Z-ring might be a good indicator of FtsZ being the direct target. However, since bacteria can undergo cell filamentation and not assemble FtsZ rings in response to a variety of conditions, including DNA damage (Mukherjee et al. 1998) and disruption of membrane potential (Strahl and Hamoen 2010), the in vivo assay is not so useful unless combined with the in vitro assays mentioned above. Finally, the isolation of resistance mutants in FtsZ to the drug can provide strong evidence of FtsZ being the direct target.

      Reconstitution systems are powerful and provide excellent control over the system, but they are emerging technologies and are technically challenging. Reconstitution systems include a variety of methods, such as the use of membrane nanodiscs, microbeads of different materials, supported bi-layer membranes (SLBs) and biomimetic systems that provide cell-like environments (Monterroso et al. 2013; Rivas et al. 2014). While in vitro biochemical assays and reconstitution systems are useful to find molecules that directly target FtsZ, they are cumbersome and need to be performed at optimal physiological pH and ionic conditions, which can be considerably variable among FtsZ from different species.

      Our results on the ability of sanguinarine and berberine to specifically affect the assembly of FtsZ and not MreB in fission yeast highlight the utility of the heterologous expression system as a platform to identify molecules that specifically affect FtsZ polymerization. The yeast platform offers a cellular context mimicking the cytoplasm for cytoskeletal assembly. The system is simple to replicate in any laboratory, including those focused on chemical synthesis with minimum microbiological expertise and can be easily reproduced and scaled up as well. However, one of the major disadvantages of using fission yeast could be the need to use much higher concentrations of drugs than normally used for mammalian cell cultures to achieve an inhibitory effect. This could probably be due to the poor permeability of certain drugs in fission yeast because of its thick cell wall (Benko et al. 2017; Pérez and Ribas 2004). A similar effect of toxicity might arise at much lower concentrations in other eukaryotic cells, such as human cells. Consistently, while sanguinarine and berberine are known to affect the eukaryotic microtubules at 10 μΜ – 20 μM concentrations (Lopus and Panda 2006; Wang et al. 2016; Raghav et al. 2017), morphological effects on yeast cells were observed only at concentrations > 100 μM. However, yeast microtubules were not affected by berberine and sanguinarine. Differences in membrane lipid profiles and MDR efflux pumps between yeasts and mammalian cells might also contribute to differential resistance to the drugs being tested (Balzi and Goffeau 1991). Conversely, an inhibitory effect in yeast cells may not necessarily translate into toxicity in a human cell. These and the permeability of drugs in yeast cells represent an important caveat in using such heterologous expression systems for the screening of compounds against target molecules. However, notwithstanding this caveat, the heterologous system provides significant advantages in assessing the direct effects of the drug on FtsZ assembly. Moreover, fission yeast-based high-throughput platform screening methods using imaging have been successfully adapted to the screening of drugs against HIV-1 proteases by large-scale screening facilities such as the NIH Molecular Libraries Probe Production Centers Network in the Molecular Libraries Program, leading to several candidate drugs (Benko et al. 2017, 2019).”

      Discussion - [Lines 462-519]

      “A powerful emerging technique based on cytological profiling has been successfully used to identify the cellular pathways targeted by the inhibitors (Nonejuie et al. 2013; Martin et al. 2020), including cell division inhibition by FtsZ (Araújo‑Bazán et al. 2016). The recent advances in computational image analysis and deep learning approaches (von Chamier et al. 2021; Spahn et al. 2022) could further advance image-based screening for FtsZ inhibitors (Andreu et al. 2022).”

      Discussion – [Lines 581-586]

      As I mentioned before, there are a lot of reviews including the possible tests to perform for assessing FtsZ inhibition. A recent one was not cited, but, from my side, it should be mentioned (10.3390/antibiotics10030254).

      The suggested article is an excellent review that in addition to providing an overview of the state-of-art methods currently in practice for screening drugs targeting FtsZ, also suggests other emerging technologies suitable for assay development. We had cited this article (Zorrilla et al., 2021; doi: 10.3390/antibiotics10030254) in other contexts in our original manuscript but inadvertently missed in the text while mentioning the methods for screening.

      We have now cited Zorrilla et al., 2021 at all appropriate places in the revised manuscript. In addition, we have also cited (Monterroso 2013; https://doi.org/10.1016/j.ymeth.2012.12.014); (Rivas 2014; https://doi.org/10.1016/j.cbpa.2014.07.018); Kusuma 2019 (doi: 10.1021/acsinfecdis.9b00055); Schaffner-Barbero 2012 (doi: 10.1021/cb2003626); Silber et al 2020 (doi: 10.2217/fmb-2019-0348); Li et al., 2015 (doi: 10.1016/j.ejmech.2015.03.026); Casiraghi et al 2020 (doi: 10.3390/antibiotics9020069); Andreu et al., 2022 (10.3390/biomedicines10081825)

      Moreover, I think authors should reconsidered novel research papers, in which researchers evaluated the reason behind the apparent inactivity of benzamide derivatives, similar to PC190723, towards Gram negative strains.

      RESPONSE: Several novel papers that have reported reason for the inactivity of benzamide derivatives towards Gram-negative bacteria, including PC190723 have now been cited. The following sentences have been now included in the revised manuscript.

      “Several efforts have been made to target Gram-negative bacteria with derivatives of benzamide. Examples include difluorobenzamides, substituted benzodioxanes, heterocyclic and non-heterocyclic derivatives (Straniero et al. 2017; Chai et al. 2020; Straniero et al. 2020a, 2020b). Although many exhibited promising activity in vitro, most were substrates for the AcrAB class of efflux pumps (Chai et al. 2020; Kaul et al. 2014; Straniero et al. 2020a, 2020b; Casiraghi et al. 2020). Thus, the poor membrane permeability, signature outer membrane, particularly lipopolysaccharide (LPS) structure (Wang et al. 2021), the presence of multiple efflux pumps in species such as E. coli, Klebsiella pneumonia and Pseudomonas aeruginosa (Piddock 2006), and differences in FtsZ sequences in the binding-site (Kaul et al. 2013b; Miguel et al. 2015) have been cited as reasons for lack of susceptibility of Gram-negative bacteria to benzamide derivatives (Casiraghi et al. 2020). More recently, two molecules, TXA6101 and TXY6129, with substituted 2,6-difluorobenzamide scaffold, have been shown to inhibit the polymerization of both E. coli and Klebsiella pneumoniae FtsZ. Moreover, despite being substrates for efflux pumps, TXA6101 induced morphological changes in K. pneumoniae (Rosado‑Lugo et al. 2022). Studies in the past on the effects of PC190723 on E. coli have been confusing, with a few reports suggesting an effect on FtsZ polymerization resulting in cell filamentation (Kaul et al. 2014), while others did not find any effect on EcFtsZ (Andreu et al. 2010; Anderson et al. 2012; Khare et al. 2019)⁠. The outer membrane has been shown to be a permeability barrier for PC190723 in E. coli (Khare et al. 2019; Chai et al. 2020). In addition, the Resistance-Nodulation-Division (RND) family of efflux pumps has been attributed to resistance against 2,6-difluorobenzamide derivatives, including TX436 (a prodrug of PC190723) in Gram-negative bacteria (Kaul et al. 2014).”

      [Lines 527-550]

      Researchers working on FtsZ inhibitors could be interested in this paper, especially microbiologists.

      I specifically work on the design, synthesis and evaluation of the microbiological assays performed by others on my compounds.

      ========================================================================

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      Dr. Srinivasin and colleagues previously developed a system where they expressed bacterial FtsZ in yeast and showed that it could assemble into polymers related to the Z rings. Here they develop this system further as a way to assay for drugs that may poison FtsZ, which would be candidates for new antibiotics. They test three drugs against three species of FtsZ. The results suggest that this system should be useful in screening new drugs that may target FtsZ. I would recommend publication after addressing a number of concerns and apparent contradictions.

      Fig. 1 showing chemical formulas of the drugs, and Fig. 2 showing a schematic of the yeast expression system, are probably not needed.

      RESPONSE: Reviewer #1 had also made a similar suggestion and we have now deleted these two figures (Fig. 1 and Fig. 2 in the older version).

      The authors make a point that sanguinarine and berberine inhibit eukaryote cell morphology. In fact, what they show is that they affect yeast cell morphology. This may or may not extend to other eukaryotes. Also, other eukaryotic cells may be more sensitive to drugs than yeast. They should me more conservative in this claim that the system also screens for drugs effects on eukaryotes.

      RESPONSE: We agree with the reviewer’s suggestions here that other eukaryotic cells may be more sensitive to drugs than yeast. We have modified the statements pertaining to these claims in the revised manuscript.

      We have made the following changes in the revised version.

      The title of the manuscript has been now modified as “A salt bridge-mediated resistance mechanism to FtsZ inhibitor PC190723 revealed by a cell-based screen”.

      Lines 23-24 in the abstract has been modified to read as “The strategy also allows for simultaneous assessment of the toxicity of the drugs to eukaryotic yeast cells.”

      Other sentences modified in the revised version are:

      “We find that although sanguinarine and berberine affected FtsZ polymerization, they also affected yeast cell physiology”. [Lines 146-147]

      “In this study, we have attempted to develop a cell-based assay using fission yeast (S. pombe) as a heterologous expression host, which would enable the screening of compounds that could directly affect FtsZ polymerization as well as identify potential toxicity to yeast (or eukaryotic) cells simultaneously”. [Lines 444-447]

      “However, one of the major disadvantages of using fission yeast could be the need to use much higher concentrations of drugs than normally used for mammalian cell cultures to achieve an inhibitory effect. This could probably be due to the poor permeability of certain drugs in fission yeast because of its thick cell wall (Benko et al. 2017; Pérez and Ribas 2004). A similar effect of toxicity might arise at much lower concentrations in other eukaryotic cells, such as human cells”. [Lines 498-503]

      “Conversely, an inhibitory effect in yeast cells may not necessarily translate into toxicity in a human cell. These and the permeability of drugs in yeast cells represent an important caveat in using such heterologous expression systems for the screening of compounds against target molecules”. [Lines 510-513]

      Fig. 3 has some new structural data that should be explored more quantitatively. My quick measurement gave 0.5 and 0.8 µm for the outside diameters of Ec and Sa rings. The spirals of Hp seem to be 0.8 µm outside diameter, similar to SA rings. These spirals may be related to those reported by Popp and by Andreu under certain buffer conditions. This should be explored and referenced.

      RESPONSE: We have now quantitatively measured the diameters of the rings formed by EcFtsZ and SaFtsZ and the diameter and pitch of the spiral polymers of HpFtsZ. These have been now included in the results section and presented as a graph in a new figure (Supplementary Fig. S2). Please also note that the scale bar in Figure 1 (previously Figure 3) was erroneously marked as 5 µm. This has been corrected in the revised version to 2.5 µm.

      Also, the possibility that these spiral polymers may be related to those described by Popp and Andreu have been discussed. We included the following sentences in the discussion.

      “Previous studies have shown that various factors such as molecular crowding, variable C-terminal regions and bound nucleotide state lead to the formation of supramolecular structures like twisted helical structures, toroids and rings similar to those that have been observed in vivo (Popp et al. 2009; Huecas et al. 2017). Thus, the molecular crowding due to the dense cytoplasm of the yeast cells could have possibly induced the spiral and ring-like assembly of FtsZ polymers (Erickson et al. 2010).”

      [Lines 456-461]

      But Fig. 4 presents a contradiction. Here the Hp control cells show long smooth polymers, not helical. This seems an important difference and needs to be addressed. Are the polymers sometimes straight and sometimes helical? After finishing the paper I see that in some experiments the HP is helical, and in others the polymers are straight and smooth. I think it would be important to determine what favors the two forms. If this remains a mystery, at least address it openly.

      RESPONSE: This was definitely an oversight from the authors. We should have clearly mentioned this in the manuscript but completely missed the description of different polymers assembled by HpFtsZ.

      We have now described this clearly in the results and added a new Figure (Supplementary Fig. S1) showing a time course for the appearance of spiral and linear polymers. We have also replaced the images in Figure 5E.

      We have modified the results to read as:

      “Interestingly, HpFtsZ assembled into linear cable-like structures as well as twisted polymers that were curled and spiral in appearance (Fig. 1D). The spiral filaments were more clearly visualized by deconvolution of the images (Fig. 1D iii and 1E). Further, super-resolution imaging using 3D-SIM clearly revealed that HpFtsZ assembles into spiral filaments in fission yeast (Fig. 1F).”

      [Lines 171-175]

      We have also added the following lines in the results section:

      “Spiral polymers appeared early, at 16 – 18 hours after induction of expression (absence of thiamine), and linear cables appeared later at 20 – 22 hours (Fig. S1). The smooth linear polymers possibly arise from lateral association and bundling of FtsZ filaments (Monahan et al. 2009), but the factors determining the two forms in yeast cells remain unclear.”

      [Lines 175-179]

      I am concerned that the quantitation of drug inhibition in Fig 4, 5 is flawed. Visually from 4A it looks like ~90-100% of control cells have polymers, and sang reduces polymers by 70% for Sa and Ec and 100% for Hp: this is based on the number of spots and filaments I see in Fig. 4 Aii. But the quantitation in D shows only 17-23% reduction for all three. These numbers were based on determining the fraction of cells that showed polymer (spots or lines) vs diffuse. It seems that cells are counted as containing polymer even if they had a great reduction in spots or lines, but still had a few. E.g., 4Aii Sa has 4 cells, two of them with no spots, one with only 2, and one with ~7, which totals ~1/3 the spots in control cells. Categorizing cells with only a couple of spots as polymerized, seems to be a poor way to quantitate. Would it not be better to count all spots in all cells, or measure the total length of line polymers, as a measure of inhibition.

      RESPONSE: We agree with the reviewer here that number of spots or the length of the polymers would be a better quantitative measure of the effect of the drugs than the percentage of cells presented. In the revised manuscript, we now present quantified data as suggested.

      We have quantitated the number of spots per cell for SaFtsZ and total polymer length per cell for HpFtsZ to elucidate the effect of drugs on FtsZ polymers. The number of spots per cell were counted using built-in ImageJ macro OPS threshold IJ1 script which combines the otsu thresholding method and analyse particles plugin. The total polymer length per cell in the case HpFtsZ, was measured using used the lpx-plugins as described by Higaki (Higaki et al., 2017).

      In addition, using the lpx-plugins, we also quantify density, a measure of the amount cytoskeleton per unit area in a given cell (Henty-Ridilla et al., 2014; Higaki et al., 2017). We had previously used this measure successfully to quantify assembly of Spiroplasma citri MreB in fission yeast (Pande et al., 2022).

      The methodology has been described in detail in the Materials and Methods section under the heading – “Quantitation of the number of spots, polymer length and density”

      Lines [665-689]

      The new data has been included in the results (lines 207-231 and 275-284) and new Figures (Fig. 2 E, G and Fig. 3 G, H) have been added.

      Fig. 5 makes a convincing case that PC19 accelerates or enhances the polymerization of Sa and Hp. Fig. S2 shows that the structures of polymers are not changed when PC19 is added at 20 hrs, after polymers have already formed. It would have been nice to see for both 5A and S2A that the round spots had holes in the center, when imaged by SIM. Again the quantitation of cells as polymer vs diffuse seems ill suited, because it misses cells with a reduced number of spots.

      RESPONSE: We have imaged the FtsZ polymers of Sa and Hp in the presence of PC190723 using SIM and included these images as new panels in the figures. Figure 3C, 3F and Figure S4 in the revised manuscript.

      Again, for Figure 5 (Fig. 3 in the revised version), we have provided the quantitation as number of spots per cell, polymer length per cell and density (amount of cytoskeleton per unit area) as described above (new Figures - Fig. 3 G, H) in the revised manuscript.

      [Lines 275-284]

      Fig. 6 uses FRAP to show that PC reduces the dynamic exchange of Sa polymers by a factor of 3. It is remarkable to me that rapid exchange is not completely eliminated by PC. Regardless, it would be very important to reference the previous study of Adams..Errington 2011, where they showed the same thing for Foci in Bacillus. PC19 reduced the exchange from 3 to 10 s, but the foci were still very dynamic.

      RESPONSE: We had referenced this work in the original submission in the discussion section – “These results are also consistent with the earlier findings that PC190723 acts to induce FtsZ polymerization and stabilize FtsZ filaments (Andreu et al. 2010; Elsen et al. 2012; Miguel et al. 2015; Fujita et al. 2017) and its derivative compound, 8j acting to slow down FtsZ-ring turnover by 3-fold in B. subtilis (Adams et al. 2011).”

      [Lines 563-567] in revised manuscript

      We have now added the following statement and referenced Adams et al., 2011 in the results section as well.

      “Interestingly, compound 8j, a related benzamide derivative, has been shown to slow down FtsZ-ring turnover by 3-fold in B. subtilis (Adams et al. 2011).”

      [Lines 324-326]

      The analysis of the salt bridge as opposed to a single Arg or His being the cause of resistance to PC19 is an interesting addition to the study. In Fig. 8D some numbers do not agree between the caption and figure (R309/7; S226/7). The whole figure should be carefully checked.

      RESPONSE: We thank the reviewer for pointing to these. We have corrected these errors now in the revised version (Fig. 6).

      I am not familiar with the Gram -ve and Gram +ve nomenclature. Why not simply gram- and gram+?

      RESPONSE: We agree that Gram -ve / +ve are not standard notations and inappropriate.

      We have now written them as Gram-negative and Gram-positive throughout the text.

      The Discussion is quite long largely because it repeats items from Results and Introduction. It is also redundant to hype the value of this system in both Introduction and Discussion; The Introduction should be sufficient. The Discussion should be pared down by eliminating repetition and focusing on relating results to previous literature, in particular items that have not been referenced previously in the paper. Also, I think we don't need the final "In summary" paragraph. That is already nicely presented in the Abstract.

      RESPONSE: We have omitted the repetitive statements from the discussion. We have also deleted the final summary paragraph. We had added new paragraphs [lines 462-519] pertaining to previous literature (also suggested by Reviewer #1) to the discussion section in the revised manuscript.

      The authors should probably provide references to other studies that have used yeast expression to study assembly of FtsZ. I am thinking in particular of papers from the Osteryoung lab looking at chloroplast FtsZ.

      RESPONSE: We have now referenced other papers that have used yeast expression to study assembly of FtsZ.

      The following statement has been added to the introduction:

      “Moreover, the dynamics of chloroplast FtsZs have also been successfully studied using the heterologous fission yeast expression system (TerBush and Osteryoung 2012; Yoshida et al. 2016; TerBush et al. 2018).”

      Lines [132-134]

      NO PAGE NUMBERS. Authors should be penalized a week delay for submitting a mss without page numbers.

      RESPONSE: We sincerely apologise for this gross error and oversight and thank the reviewer for patiently reading through and reviewing a manuscript with no page numbers and line numbers. We are truly sorry for having submitted a manuscript as such and have now included page numbers and line numbers in the manuscript.

      Reviewer #2 (Significance (Required)):

      This work should be of interest to the broad field of research on FtsZ. The authors present it as a new platform for assaying drugs targeting FtsZ, and researchers in this area will certainly be interested. It will also be of broader interest for the novel assay of assembly and exchange dynamics and how they may be modulated by small molecules.

      ========================================================================

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      Summary: The authors established a proof-of-concept assay to investigate the bacterial cytoskeletal protein FtsZ in fission yeast, and this heterologous yeast system is useful for compounds identification targeting FtsZ. The authors used this system to understand the mechanism of FtsZ's resistance to drug PC190723. Major comments: 1. From the study, the pombe seems to be a good system for investigating the bacterial cytoskeleton proteins and testing the drugs for them. However, to my knowledge it is not convincing that this is the proper system can be used to assessing the eukaryotic toxicity, since no toxicity to pombe does not mean no toxicity to human cells and vice versa.

      RESPONSE: We agree with the reviewer that toxicity to S. pombe cannot be directly extended to assessing toxicity to other eukaryotic cells such as human cells. As suggested by Reviewer#2 as well, we have modified these claims in the revised manuscript, discussed the possibilities and limited the scope of this work to assessing toxicity in yeast cells.

      We have made the following changes in the revised version.

      The title of the manuscript has been now modified as “A salt bridge-mediated resistance mechanism to FtsZ inhibitor PC190723 revealed by a cell-based screen”.

      Lines 23-24 in the abstract has been modified to read as “The strategy also allows for simultaneous assessment of the toxicity of the drugs to eukaryotic yeast cells.”

      Other sentences modified in the revised version are:

      “We find that although sanguinarine and berberine affected FtsZ polymerization, they also affected yeast cell physiology”. [Lines 146-147]

      “In this study, we have attempted to develop a cell-based assay using fission yeast (S. pombe) as a heterologous expression host, which would enable the screening of compounds that could directly affect FtsZ polymerization as well as identify potential toxicity to yeast (or eukaryotic) cells simultaneously”. [Lines 444-447]

      “However, one of the major disadvantages of using fission yeast could be the need to use much higher concentrations of drugs than normally used for mammalian cell cultures to achieve an inhibitory effect. This could probably be due to the poor permeability of certain drugs in fission yeast because of its thick cell wall (Benko et al. 2017; Pérez and Ribas 2004). A similar effect of toxicity might arise at much lower concentrations in other eukaryotic cells, such as human cells”. [Lines 498-503]

      “Conversely, an inhibitory effect in yeast cells may not necessarily translate into toxicity in a human cell. These and the permeability of drugs in yeast cells represent an important caveat in using such heterologous expression systems for the screening of compounds against target molecules”. [Lines 510-513]

      From figure 4A to 4C, there seems no big difference of cell morphology between control and drug treatment, except for Berberine treatment of SaFtsZ-GFP. Under the low concentration of Sanguinarine (20 µM) and Berberine (53.791 µm), the FtsZ polymerization was disrupted and seems no effect on cell morphology. Why would the authors use much higher Sanguinarine (135.95 µM) and Berberine (134.45 µM) to prove there two drugs are toxic to pombe cells?

      RESPONSE: Earlier reports had shown that sanguinarine and berberine affect mammalian microtubules (Lopus and Panda 2006 - DOI: 10.1111/j.1742-4658.2006.05227.x; Raghav et al., 2017 - DOI: 10.1021/acs.biochem.7b00101). While, we did not observe any growth defect in yeast cells, earlier studies have suggested that yeasts possibly require higher concentrations of certain drugs than used for mammalian cells due to the presence of the cell wall, particularly S. pombe (Perez and Ribas 2004 - https://doi.org/10.1016/j.ymeth.2003.11.020; Benko et al., 2017 - DOI: 10.1186/s13578-016-0131-5). We had thus explored the possibility of cell toxicity to yeast cells at higher concentrations of the drugs.

      The following lines have thus been added to the results section in the revised manuscript.

      “Although we did not observe any growth defect in yeast cells at lower concentrations of the drugs, earlier studies have suggested that yeast cells possibly require higher concentrations of drugs than used for mammalian cells due to the presence of the cell wall, which is particularly thick in S. pombe (Benko et al. 2017; Pérez and Ribas 2004). We thus explored the possibility of cell toxicity to yeast cells at higher concentrations of the drugs.”

      Lines [234-239]

      Sanguinarine and Berberine are FtsZ disruption drugs, do these drugs have effect on microtubule?

      RESPONSE: We have now examined the effect of Sanguinarine and Berberine on yeast microtubules as well and did not find any visible differences between the control and inhibitor (either low or high concentrations) treated cells. This data has been added as a new figure (Supplementary Fig. S3 A and B) in the revised manuscript and the following line added to the results.

      “However, even at higher concentrations, neither of the drugs showed any visible effect on yeast microtubules (Fig. S3 A and B).”

      [Lines 241-242]

      The discussion has been modified as follows:

      “However, one of the major disadvantages of using fission yeast could be the need to use much higher concentrations of drugs than normally used for mammalian cell cultures to achieve an inhibitory effect. This could probably be due to the poor permeability of certain drugs in fission yeast because of its thick cell wall (Benko et al. 2017; Pérez and Ribas 2004). A similar effect of toxicity might arise at much lower concentrations in other eukaryotic cells, such as human cells. Consistently, while sanguinarine and berberine are known to affect the eukaryotic microtubules at 10 μΜ – 20 μM concentrations (Lopus and Panda 2006; Wang et al. 2016; Raghav et al. 2017), morphological effects on yeast cells were observed only at concentrations > 100 μM. However, yeast microtubules were not affected by berberine and sanguinarine. Differences in membrane lipid profiles and MDR efflux pumps between yeasts and mammalian cells might also contribute to differential resistance to the drugs being tested (Balzi and Goffeau 1991). Conversely, an inhibitory effect in yeast cells may not necessarily translate into toxicity in a human cell. These and the permeability of drugs in yeast cells represent an important caveat in using such heterologous expression systems for the screening of compounds against target molecules.”

      [Lines 498-513]

      There are very few SaFtsZ-GFP dot structure in fig 5B, and this is inconsistent with the SaFtsZ-GFP dot structure in fig 4A. Fig 5D has the same issue compare to Fig 4Ci

      RESPONSE: We had probably not made it very clear the experimental differences between Figure 4 and 5 (Figure 2 and 3 in the revised manuscript), which has led to this apparent inconsistency.

      The strong nmt1 promoter (thiamine repressible) takes about 18 hours for full-induction in the absence of thiamine (Forsburg 1993 - https://doi.org/10.1093/nar/21.12.2955). We have utilised the medium strength nmt41 promoter in our studies and hence, in Figure 2, expression of FtsZ-GFP fusions were allowed for longer periods of time (22 – 24 hours) in the experiments concerning sanguinarine and berberine treatments.

      This has been now clearly mentioned in the revised version of the manuscript in the results section (lines 196-199) as well as in figure legends.

      In contrast the very few dot structures or polymers in Figure 3 (revised manuscript) is because of a shorter period of expression of FtsZ-GFP (12 – 14 hours in the absence of thiamine). The shorter period of expression time in these experiments allowed us to test if PC190723 indeed induced the polymerisation of FtsZ, at a stage when the control cells still exhibited diffuse fluorescence and had minimal FtsZ assembly. Thus, the cultures were allowed to express FtsZ for a shorter period of time and imaged in the case of experiments presented in Figure 3.

      This has been now clearly mentioned in the results (lines 259-263) as well as in figure legends in the revised manuscript.

      We hope that we have now made these experimental differences clear and provide more clarity. We have also included this information (hours of induction) in the figure panel.

      The concentration of PC190723 the author used is 20 µg/ml, which is enough for disrupting FtsZ function, however according to the Sanguinarine and Berberine experiments, the author may use higher concentration of PC190723 to assess its toxicity to pombe cells. Same as Sanguinarine and Berberine, does PC190723 has effect on microtubule?

      RESPONSE: As suggested by the reviewer, we have tested the effect of PC190723 at a higher concentration (140.6 µM) similar to that of Sanguinarine and Berberine. We did not find any morphological changes in yeast upon treatment with higher concentrations of PC190723. Also, the drug did not seem to affect the yeast microtubules. These have been now included in the results section and new images have been added in the figure (Supplementary Fig. S3).

      The following lines have been added in the revised manuscript to the results section:

      “Earlier studies had reported that PC190723 was non-toxic to eukaryotic cells, including budding yeast (Haydon et al. 2008). We further tested if PC190723 resulted in morphological defects in S. pombe, like sanguinarine and berberine, at higher concentrations. However, consistent with the earlier reports, PC190723 was inactive against S. pombe at both 56.2 μM and 140.6 μM and did not cause any morphological changes (Fig. 2H iv). Further, PC190723 did not disrupt the yeast microtubules at either of the concentrations (Fig. S3 A iv and B iv).”

      [Lines 294-300]

      The authors mentioned much higher concentrations of drugs than normally used for mammalian cell cultures have to be used for fission yeast. Is there any criterion for this?

      RESPONSE: In the discussion section, we had mentioned that “Much higher concentrations of drugs than normally used for mammalian cell cultures have to be used for fission yeast probably due to permeability issues because of the presence of a thick cell wall (Benko 2017 - DOI: 10.1186/s13578-016-0131-5).

      This has now been mentioned in the results as well in the revised manuscript.

      “Although we did not observe any growth defect in yeast cells at lower concentrations of the drugs, earlier studies have suggested that yeast cells possibly require higher concentrations of drugs than used for mammalian cells due to the presence of the cell wall, which is particularly thick in S. pombe (Benko et al. 2017; Pérez and Ribas 2004). We thus explored the possibility of cell toxicity to yeast cells at higher concentrations of the drugs.”

      [Lines 234-239]

      The following lines in the discussion have been modified in the revised manuscript to read as – “However, one of the major disadvantages of using fission yeast could be the need to use much higher concentrations of drugs than normally used for mammalian cell cultures to achieve an inhibitory effect. This could probably be due to the poor permeability of certain drugs in fission yeast because of its thick cell wall (Benko et al. 2017; Pérez and Ribas 2004). A similar effect of toxicity might arise at much lower concentrations in other eukaryotic cells, such as human cells.”

      [Lines 498-503]

      Minor comments: 1. There are two units used for drug concentration µM for Sanguinarine and Berberine and µg/ml for PC190723, I think they should be consistent.

      We have now used µM for all drugs.

      Check the units (µM and µg/ml) italic in text and figure legend.

      We have now used µM for all drugs and corrected the italics. We apologise for the erroneous usage of italics in the text for µM.

      Reviewer #3 (Significance (Required)):

      The authors provided a proof-of-concept assay for studying bacterial cytoskeleton proteins in yeast cells. This idea will facilitate people to investigate the bacterial cytoskeleton proteins and also find compounds targeting them without affecting the yeast cells. This study will provide different perspectives to people who study cell biology and secondary metabolites discovery.

      We hope that we have satisfactorily addressed all the concerns raised by the reviewers in the revised manuscript.

      Thanking you,

      With Regards

      Dr. Ramanujam Srinivasan

      Dr. Pananghat Gayathri

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Dr. Srinivasin and colleagues previously developed a system where they expressed bacterial FtsZ in yeast and showed that it could assemble into polymers related to the Z rings. Here they develop this system further as a way to assay for drugs that may poison FtsZ, which would be candidates for new antibiotics. They test three drugs against three species of FtsZ. The results suggest that this system should be useful in screening new drugs that may target FtsZ. I would recommend publication after addressing a number of concerns and apparent contradictions.

      Fig. 1 showing chemical formulas of the drugs, and Fig. 2 showing a schematic of the yeast expression system, are probably not needed.

      The authors make a point that sanguinarine and berberine inhibit eukaryote cell morphology. In fact, what they show is that they affect yeast cell morphology. This may or may not extend to other eukaryotes. Also, other eukaryotic cells may be more sensitive to drugs than yeast. They should me more conservative in this claim that the system also screens for drugs effects on eukaryotes.

      Fig. 3 has some new structural data that should be explored more quantitatively. My quick measurement gave 0.5 and 0.8 µm for the outside diameters of Ec and Sa rings. The spirals of Hp seem to be 0.8 µm outside diameter, similar to SA rings. These spirals may be related to those reported by Popp and by Andreu under certain buffer conditions. This should be explored and referenced.

      But Fig. 4 presents a contradiction. Here the Hp control cells show long smooth polymers, not helical. This seems an important difference and needs to be addressed. Are the polymers sometimes straight and sometimes helical? After finishing the paper I see that in some experiments the HP is helical, and in others the polymers are straight and smooth. I think it would be important to determine what favors the two forms. If this remains a mystery, at least address it openly.

      I am concerned that the quantitation of drug inhibition in Fig 4, 5 is flawed. Visually from 4A it looks like ~90-100% of control cells have polymers, and sang reduces polymers by 70% for Sa and Ec and 100% for Hp: this is based on the number of spots and filaments I see in Fig. 4 Aii. But the quantitation in D shows only 17-23% reduction for all three. These numbers were based on determining the fraction of cells that showed polymer (spots or lines) vs diffuse. It seems that cells are counted as containing polymer even if they had a great reduction in spots or lines, but still had a few. E.g., 4Aii Sa has 4 cells, two of them with no spots, one with only 2, and one with ~7, which totals ~1/3 the spots in control cells. Categorizing cells with only a couple of spots as polymerized, seems to be a poor way to quantitate. Would it not be better to count all spots in all cells, or measure the total length of line polymers, as a measure of inhibition.

      Fig. 5 makes a convincing case that PC19 accelerates or enhances the polymerization of Sa and Hp. Fig. S2 shows that the structures of polymers are not changed when PC19 is added at 20 hrs, after polymers have already formed. It would have been nice to see for both 5A and S2A that the round spots had holes in the center, when imaged by SIM. Again the quantitation of cells as polymer vs diffuse seems ill suited, because it misses cells with a reduced number of spots.

      Fig. 6 uses FRAP to show that PC reduces the dynamic exchange of Sa polymers by a factor of 3. It is remarkable to me that rapid exchange is not completely eliminated by PC. Regardless, it would be very important to reference the previous study of Adams..Errington 2011, where they showed the same thing for Foci in Bacillus. PC19 reduced the exchange from 3 to 10 s, but the foci were still very dynamic.

      The analysis of the salt bridge as opposed to a single Arg or His being the cause of resistance to PC19 is an interesting addition to the study. In Fig. 8D some numbers do not agree between the caption and figure (R309/7; S226/7). The whole figure should be carefully checked.

      I am not familiar with the Gram -ve and Gram +ve nomenclature. Why not simply gram- and gram+?

      The Discussion is quite long largely because it repeats items from Results and Introduction. It is also redundant to hype the value of this system in both Introduction and Discussion; The Introduction should be sufficient. The Discussion should be pared down by eliminating repetition and focusing on relating results to previous literature, in particular items that have not been referenced previously in the paper. Also, I think we don't need the final "In summary" paragraph. That is already nicely presented in the Abstract.

      The authors should probably provide references to other studies that have used yeast expression to study assembly of FtsZ. I am thinking in particular of papers from the Osteryoung lab looking at chloroplast FtsZ.

      NO PAGE NUMBERS. Authors should be penalized a week delay for submitting a mss without page numbers.

      Significance

      This work should be of interest to the broad field of research on FtsZ. The authors present it as a new platform for assaying drugs targeting FtsZ, and researchers in this area will certainly be interested. It will also be of broader interest for the novel assay of assembly and exchange dynamics and how they may be modulated by small molecules.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Manuscript number: RC-2022-01680

      Corresponding author(s): Woo Jae, Kim

      1. General Statements The goal of this study is to provide the groundwork for future studies of genetically controlled neuronal regulation of ‘interval timing’ through the provision of a behavioral paradigm. Interval timing, or the sense of time in the seconds to hours range, is important in foraging, decision making, and learning in humans via activation of cortico-striatal circuits. Interval timing requires completely distinct brain processes from millisecond or circadian timing. In summary, interval timing allows us to subjectively sense the passage of physical time, allowing us to integrate action sequences, thoughts, and behavior, detect developing trends, and predict future consequences.

      Many researchers have tried to figure out how animals, including humans, can estimate time intervals with such precision. However, most investigations have been conducted in the realm of psychology rather than biology thus far. Because the study of interval timing was limited in its ability to intervene in the human brain, many psychologists concentrated on developing convincing theoretical models to explain the known occurrence of interval timing.

      To overcome the limits of studying interval timing in terms of genetic control, we have reported that the time investment strategy for mating in Drosophila males can be a suitable behavioral platform to genetically dissect the principle of brain circuit mechanism for interval timing. For example, we previously reported that males prolong their mating when they have previously been exposed to rivals (Kim, Jan & Jan, "Contribution of visual and circadian neural circuits to memory for prolonged mating induced by rivals" Nature Neuroscience, 2012), and this behavior is regulated by visual stimuli, clock genes, and neuropeptide signaling in a subset of neurons (Kim, Jan & Jan, “A PDF/NPF Neuropeptide Signaling Circuitry of Male Drosophila melanogaster Controls Rival-Induced Prolonged Mating” Neuron, 2013).

      Throughout their lives, all animals must make decisions in order to optimize their utility function. Male reproductive success is determined by how many sperms successfully fertilize an egg with a restricted number of investment resources. To optimize male reproductive fitness, a time investment strategy has been devised. As a consequence, we believe that the flexible responses of mating duration to different environmental contexts in Drosophila males might be an excellent model to investigate neural circuits for interval timing.

      One of the most well-known features of human interval timing is the association of different sensory inputs with perception of time intervals, which influences our estimate of time intervals. Therefore, the first step toward comprehending the neural regulation of interval timing is to dissect the role that numerous sensory inputs play in determining the time duration. In this article, we discuss a different time-investment strategy adopted by males, called "Shorter-Mating-Duration" (SMD). According to our findings, male Drosophila with more sexual experience had shorter mating duration. During our investigation into the sensory inputs for SMD behavior, we found a small number of cells that express sugar receptors and pheromone receptors (ppk25 and ppk29) and thus transmit the multisensory information from females in order to generate memories of sexual experiences, which will determine the final decision of mating duration.

      Our discovery of sensory integration mechanisms associated with complex behavioral trait in male Drosophila at the brain circuit and genetic network levels will be a huge step forward in our knowledge of interval timing behavior.

      Description of the planned revisions

      REVIEWER #1

        • Overall I think this would be difficult for a general audience as the rationale and explanation of experiments needs to be clearer. * Answer: During the revision process, we will make our text more legible for wide audiences.

      REVIEWER #2

        • 'The knockdown of LUSH, an odorant-binding protein' Lush is expressed in trichoid sensilla in olfactory organs , from the beginning, they exclude the role of olfaction and later one they said 'suggesting that the expression of the pheromone sensing proteins LUSH and Snmp1 in Gr5a-positive gustatory neurons is critical for generating SMD behavior.' ? Therefore, I recommend If available, please provide a reference for the statement in the Methods section that the Orco1 line was "validated via electrophysiology", or include the electrophysiology data itself in this manuscript as supplementary figure. Ideally, positive behavioral controls for this line would also be included in the manuscript. * Answer: We value the reviewer's concern. LUSH has been discovered as an odorant-binding protein; nevertheless, current research suggests that LUSH may be involved in the sensing of additional pheromones to cVA, implying the presence of a lush-independent cVA detection mechanism [1]. Billeter et al. demonstrated in their paper that LUSH detects a female stimulatory chemical and modifies male mating latency (Fig. 2 of Billeter at al.). As Billeter et al. stated, our present understanding of pheromonal recognition in Drosophila is insufficient, and we concur. As a result, we attempted to validate the expression of Snmp1 in the male leg by experiments (Fig. 7I-J) performing sncRNA seq analysis on the Fly SCope dataset, as shown in Fig. 12. As demonstrated in Fig.12, Snmp1 and LUSH is higly expressed fly leg and wing system. Future study will look at the roles of Snmp1 and LUSH in female pheromone sensing, as well as PPK receptors.

      Following the reviewer's advice, we will repeat the electrophysiologically validated Orco2 mutant phenotype with proper control and attach it when we submit the complete revision to the journal.

      • What is this (GustDx6)? I suggest using Poxn mutant line. *

      Answer: We value the reviewer's recommendation. We believe we have previously demonstrated that the Gr5a-mediated gustatory pathway is essential for the generation of sensory input for SMD behavior, but we will test the Poxn mutant and Poxn-RNAi to replace the GustDx6 mutant result.

      Description of the revisions that have already been incorporated in the transferred manuscript

      REVIEWER #1

      1. My copy of this ms does not have page numbers or line numbers, this makes it extremely difficult to identify where I am making queries/ suggestions. I don't know whether this is a decision of the journal or authors, but please change this in the future.* Answer: We put page numbers and line numbers.

      2. A general point, there is simply too much in this ms. It covers too much ground and so doesn't give proper descriptions, discuss the consequences of the data fully or integrate properly with existing literature. Quantity does not equal impact. *

      Answer: We appreciate the reviewer's insight. We have previously separated this document from our original preprint [2] in response to a prior reviewer's advice; we believe we have included too much data, which may confuse readers. As a result, we will delete all of the mechanosensory/thermosensory receptor screening data from our present paper and write a second manuscript on sensory integration for the production of SMD behavior. We also removed the most of sncRNA seq data analysis except Fig.12 which confirms our finding in a single diagram.

      • Results paragraph 1 says that white mutant background had no effect "unlike that of LMD behavior as reported previously", ignoring that there has been a contrary report that extension of mating duration after exposure to a rival does not involve visual cues and so is not affected by the white mutation (Bretman et al 2011 Curr Biol). *

      Answer: We recognize that there is a conflicting report concerning white mutation on LMD behavior, however because we are now reporting SMD rather than LMD behavior, we deleted the statement comparing white mutant results to earlier reports, as shown below;

      “thus suggesting that the effect of the white mutant genetic background was not evident.” (line 97)

      • A general point in the methodology, it's not very helpful just to say "as in a previous study" without giving at least a brief idea of what that was (e.g. the explanation of egg counting procedures).

      A "sperm depletion" assay is described in the results that I cannot find any methodology for. *

      Answer: We thank the reviewer for allowing us to clarify our lacking methodologies for a better comprehension of our manuscript.

      We included the egg counting procedure to the EXPERIMENTAL PROCEDURES section to further illustrate our approach of egg laying assay as below;

      “In short, wild type females mated with naïve or experienced males were transferred to a fresh new vial and allowed to lay eggs for 24 hr at 25°C. After 24 hr of egg laying, number of eggs were counted under the stereomicroscope. After we count the number of eggs, we kept vials in 25°C incubator and counted the total number of progenies ecolsed from them.” (line 956-960)

      We included “Sperm Depletion from Males” section in EXPERIMENTAL PROCEDURES as below;

      “To deplete sperm from males, 40 virgin Defexel6234 females which lacks SPR and shows multiple mating with males (Yang 2009) were placed in a vial containing four CS males for indicated time (2 h, 4 h, 8 h, and 24 h).” (line 880)

      • Was the "excessive mating" with SPR females actually observed, or inferred from previous work? Needs to be clear. In what way do virgins expressing fruitless behave like mated females? It is so unclear how all the evidence in this paragraph leads to the conclusion that both cues from females and successful copulation. Especially as in the next paragraph experience with feminized females (with which the focal males cannot copulate) elicits the response.

      It might be helpful to combine the results into a table, so it is easy to see under which conditions males reduce mating duration. *

      Answer: We modified the sentence describing SPR mutant female experiment and added references as below;

      “Sexual experiences with sex peptide receptor (SPR) mutant females which exhibit a delayed post-mating response and multiple mating with males [3] had no additional effect on SMD (Fig. 2I).” (line 135)

      We clarify in which extent, fru>UAS-mSP virgin females behave like mated females as below;

      “Virgin females behave like mated females by expressing a membrane-bound version of male sex-peptide in fruitless-positive neurons, hence rejecting the male's copulation attempt.” (line 136)

      In the instance of feminized males, we assume that these feminine males can give adequate signals for inducing SMD and eliminated the term "successful copulation" since we are unsure if males can copulate these feminized males or not, despite the fact that males can mount and mate with them (Fig. 2O-P).

      Tables S1 and S2 describe the conditions, genotypes, and descriptions of an experiments illustrated in Fig. 2. We believe that these tables may assist general audiences in comprehending our experimental design.

      • Why are no statistics reported in the results? Identifying sig diffs on figures is not sufficient. I'm very sceptical that "mating duration of males showed normal distribution" for all comparisons, but then it's also difficult to identify which were analysed in this way (if statistics were properly reported this would not be an issue). *

      Answer: We described our statistical analysis with mating duration previously [4–7] and followed the statistical analysis of copulation duration assay reported by Crickmore et al., published in CELL (2013) and NEURON (2020) [8,9]. To further validate our statistical analysis, we added estimation statistics which focuses on the effect size of one's experiment/intervention, as opposed to significance testing [10]. We already described our statistical analysis in EXPERIMENTAL PROCEDURES section in details. We also described our statistical analysis for mating duration will be same in all other figures in the Fig.1 legend.

      We appreciate the reviewer's recommendation that the normal distribution of our mating duration data be validated. As a consequence, we performed the normailty test with Graphpad prism and added the histogram and QQ plot results to Fig. S1M and N. Table S3 also contains the results of the normality and lognormality tests.

      • Gr5a/ Gr66a mediate acceptance/ avoidance of what? Why would you hypothesise these in particular to be involved? *

      Answer: We accidentally left out the citation for that phrase and updated it with Wang et al.'s CELL (2004) paper. Wang et al. wrote in their article about taste representations in the Drosophila brain, “Our behavioral studies reveal that Gr5a cells recognize sugars and mediate acceptance/attractive behaviors whereas Gr66a cells recognize bitter compounds and mediate avoidance…. This suggests that Gr5a cells may be “acceptance” cells rather than “sweet” cells…. Our expression and behavioral studies reveal that Gr5a marks cells that recognize sugars and mediate taste acceptance, whereas Gr66a marks cells that recognize bitter compounds and mediate avoidance.” [11]

      As a result, we hypothesize that Gr5a and Gr66a-positive cells influence acceptance or avoidance of "taste." We also changed certain sentences to make them clearer, as seen below;

      “Of the various gustatory receptors, Gr5a marks cells that recognize sugars and mediate taste acceptance, whereas Gr66a marks cells that recognizes bitter compounds and mediates avoidance.” (line 173)

      • As Orco was not found to affect the behaviour, why test Or67d? *

      Answer: We appreciate the reviewer bringing this to our attention. We omitted the Or67d result from the present manuscript to simplify it and make it easier for readers to grasp.

      • "Mate guarding" suddenly appears in the modelling section. Can a difference of a couple of minutes in a mating duration of 15-20min really be considered mate guarding? A similar variation in response to rival males is not considered mate guarding, but is linked to adjustments in ejaculate expenditure (admittedly not in a very straight forward way). Surely in a system like this the benefits arise more from how many females the male can mate with in a given time? How does this model relate to any of the previous models of mate guarding?

      In this section the work of Linklater et al 2007 is important, they showed progeny declined over successive matings, and related this to exhaustion of Acps rather than sperm. I would urge the authors to consider that what they observe does not necessarily have an adaptive explanation. *

      Answer: We have defined “mate guarding” in the text now. The costs and benefits of mate guarding have been extensively studied in insects and demonstrated to shape the optimal mating duration of males. In our experiment, we cannot specify whether the shortened mating duration was caused by the adjustments in ejaculate expenditure or a shorted stay after the ejaculation. Instead, our model has a general assumption that the costs of mate guarding increase linearly at the same rate in both pre- and post-ejaculation periods, which is highlighted in the model text.

      There exist many models for the optimal mating duration (earlier models include Grafen and Ridley, 1983. A model of mate guarding. J. Theor. Biol. 102: 549 – 567 [12]). While our model was not built upon a novel theoretical approach (it was built based on the classical Charnov’s marginal value theorem equation), our model was developed specifically for generating testable predictions for the observed SMD behaviors.

      We have rephrased the text as follow;

      “This model assumes that (i) the shortened (or prolonged) mating duration is controlled by males and shaped by a trade-off between the benefit of mate guarding (remaining with the female both before and after the sperm ejaculation) and opportunistic costs (e.g. searching for another mate).” (line 970)”

      • I can't find a data accessibility statement. *

      Answer: We added it in the manuscript.

      • That said, a current grand challenge in understanding behaviour is discovering the mechanisms that enable individuals to respond plastically to changing environments. This speaks directly to that challenge. However, this behavioural observation is not novel, as claimed. Generally the idea of refractoriness is widely known, and specifically the reduction in mating duration over successive matings in D. melanogaster was shown by Linklater et al 2007 Evolution. Moreover, the time between exposure to females has been shown to be important. Linklater et al 2007 gave males mating attempts in quick succession and observed the decrease in mating duration, whereas given recovery time of 3 days, males either mate equally as long, or even longer across their life course (Bretman et al 2011 Proc B, Bretman et al 2013 Evolution). These papers should be discussed, and more broadly the work understood in the light of previous knowledge. The behaviour does not need to be novel for this manuscript to make a significant contribution to the field. *

      Answer: We believe the reviewer highlighted relevant past research that examined the influence of female experiences on mating duration. We agree with the reviewer that SMD behavior does not have to be original in order to contribute significantly to the field. As a result, we examined past reports and updated the introduction as follows;

      “It has been reported that previous sexual experience with females influences the mating duration of male D. melanogaster [15,16,34]; however, the neural circuits and physiology underlying this behavior have not been deeply investigated. Here, we report the sensory integration mechanisms by which sexually experienced males exhibit plastic behavior by limiting their investment in copulation time; we refer to this behavior as "shorter mating duration (SMD)."” (line 85)

      • Both in the introduction and discussion the extended mating duration in response to rivals is raised. A great deal of work has been done on this plasticity and yet the way this is written implies just two papers from these authors (whilst referencing others elsewhere). *

      Answer: We agree with the reviewer. In the introductory and discussion sections, we cited as many key publications explaining the plastic responses of male mating duration as we could.

      __REVIEWER #2

      __

        • Summary: The submitted manuscript reports that Drosophila melanogaster males use information derived from their previous sexual experiences from multiple sensory inputs to optimize their investment in mating. They refer to this plasticity as 'shorter-mating duration (SMD)'. SMD requires sexually dimorphic taste neurons. They identified several neurons in the male foreleg and midleg that express specific sugar, pheromone and mechanosensory receptors. Unfortunately, several aspects of the study design and methods used are inappropriate. Although the statistical approaches used are appropriate, the results are questionable. The discussion and conclusions are therefore too speculative in my view and overstretch the implications of the results as presented. Below I explain each one of these concerns about the study design, methods and results in detail as follows.* Answer: We appreciate the reviewer's assessment, especially the statement that our statistical approaches were appropriate. We will revise our manuscript in response to the reviewer's suggestions.
      1. The conclusions (as the authors point out) hinge on small (often extremely small) effect sizes. This is not an insurmountable problem, so long as the assays are robust across trials. Unfortunately, they are not-the variation in the baseline for control replicates is often as large as, or larger than, the effects from which the conclusions are derived. Given the extreme experimental challenges of small effect size combined with large intertrial variability, it is notable that the authors do not report any likely false negative or false positive data, as would be frequently expected under these conditions. One explanation for the reproducibility of statistical effect seen across many experiments despite these experimental hurdles is manipulation of sample size. The authors acknowledge the extreme variability in sample size offer seemingly harmless explanations, but a closer look shows how problematic this practice is. For example, see Figure 1 (I, J, L) there is a big different between naive and experience males? *

      Answer: We value the reviewer's feedback. Several research have been conducted to investigate the mating duration of male fruit fly. For example, our lab [2,13–15] and others [13–30] have regularly reported that previous rival exposure increases male fruit fly mating duration. Bretman A et al. utilized 49-59 males in their studies to compare the variations in mating duration between circumstances. Crickmore et al. also reported the effect of mating duration differences caused by genetic or experimental modification [8]. They utilized 10-18 male flies in their study to compare the variations in mating duration across circumstances, as shown in Figs. 1G (n=15-18) and 2A (n=10-27). All of these findings indicate that our mating duration sample size is sufficient to examine the effect size variations between the naive and experienced conditions. To confirm our statistical analysis further, we incorporated estimate statistics, which focus on the effect size of one's experiment/intervention rather than significance tests [10]. We have already detailed our statistical analyses under the EXPERIMENTAL PROCEDURES section. We conducted hundreds of mating duration assays using this configuration and confirmed that all of our results are reproducible in a blind test. As a result, we believe our mating duration assay has been validated by other groups' findings, several analytic tools, and numerous blind tests conducted by us. We appreciate the reviewers' concerns, but our data meets the reproducibility requirements.

      • I am not sure if you keep using the same control with different experiments (that is okay if those exp is done in the same time) as in figure 1 B, I,J,K,L.But I don't think you did Fig 1B in the same time with Fig 1I, J, K,L. *

      Answer: We appreciate the reviewer's feedback. Yes, all of our tests comparing the differences in mating duration between naive and experienced conditions were conducted under the same conditions and at the same time. We replaced Fig.1B with new data (n=49-51) obtained lately in a new lab in China. As previously stated, SMD behavior could be reproduced by the same Canton S genotype in different locations by different experimenters.

      • It will be clear if you mention in the text how much reduction in percent happened in copulation duration when the males had previous sexual experience? *

      Answer: We appreciate the reviewer’s suggestion and added in the manuscript as follow;

      “We found that the mating duration of various wild-type and w1118 naïve males are significantly longer (wild type 15.7~15.8%, w1118 12.4%) than that of sexually experienced males (Fig. 1B-D, Fig. S1A)” (line 99)

      • 'Drosophila simulans, the sibling species of D. melanogaster also exhibits SMD, thus suggesting that SMD is conserved between close species of D. melanogaster (Fig. S1B).'. If you want come with this conclusion, you need to test D. erecta, D. sechelia and D. yakuba. *

      Answer: We appreciate the reviewer's feedback. We removed the D. simulans data because it is not required for the conclusion of this manuscript. In future research, we will look on the conservation of SMD behavior between species.

      • The authors mention that Gr66a is salt. This is not 100% correct. GR66a is expressed in many bitter sensing neurons and is required for the physiological and behavioral responses to many bitter compounds. check this reference DOI:https://doi.org/10.1016/j.cub.2019.11.005. *

      Answer: We made the following changes and cited the article reviewer's suggestion.

      “Of the various gustatory receptors, Gr5a marks cells that recognize sugars and mediate taste acceptance, whereas Gr66a marks cells that recognizes bitter compounds and mediates avoidance (Wang et al, 2004; Dweck & Carlson, 2020).” (line 180)

      • Drosophila melanogaster mating duration is between 21- 23 mins. I never saw copulation duration in normal condition (control) 10-15 mins as in figure fig 2E, Fig 7 C,E,F, Fig 8 E and fig 12 G . To the best of my knowledge, of all of the papers on copulation duration, the only one that ascribes a shortened duration to manipulations of the female is Rideout...Goodwin Nature Neuroscience 2010, who argue that this shortening results from markedly increased female activity/agitation during mating, leading the male to terminate early. *

      Answer: We appreciate the reviewer's feedback. Copulation duration in Drosophila melanogaster male is extremely variable and has been reported to be approximately 20 minutes. However, as other groups documented, male copulation duration can range from 10-15 minutes depending on sperm completion (Fig. 1a-c of Bretman A et al.) [30] and genetic background (Fig. 1C, Fig. 2E, Fig. 5D, and Fig. 7A and E of Crickmore et al) [8]. And, as previously stated, males dominate copulation duration [8,30], not females, and we always utilized the same genotype of females for mating duration experiment. As a result, we believe that these rather short mating duration outcomes are the product of a distinct genetic background. Because we employed the same genotype of males while altering the female experience condition, we believe our mating duration results are all equivalent and comparable.

      • In some experiments, the authors test very few number of replicates which is not convinced me to their conclusion as example Fig 2F and Fig 12 E. Why you test 100, 103 replicates in this exp fig 10 F? How you compare 47 replicates against 9 replicates in fig S10 I? *

      Answer: We appreciate the reviewer's input. As we previously stated in response to Reviewer Question 2, the n number of males exhibited in Figs. 2F and 12E is statistically significant. To corroborate findings with replication, we examined 100, 103 duplicates of Fig. 10F, which represents pyx-RNAi screening results. The results of Fig. S10I are screening data, and we cannot rule out the possibility that TrpA1 knockdown in Gr5a neurons affects the mating success of sexually experienced males. We only placed it there because it was screening results and the differences between naive and experienced conditions were substantial despite the small sample size. However, we deleted Fig. 10F and Fig. S10I data from the current paper in response to Reviewer #1's advice, thus it will not be an issue for the manuscript's conclusion.

      • 'Next, to decipher whether DEG/NaC channel-expressing pheromone sensing neurons require the function of OBP, we expressed lush-RNAi using ppk23-, ppk25- and ppk29-GAL4 drivers to knockdown LUSH in each channel-expressing neuron. The knockdown of LUSH in ppk25- and ppk29-GAL4 labeled cells, but not in ppk23-GAL4 labeled cells, led to a disturbance in SMD behavior, thus suggesting that LUSH functions in ppk25- and ppk29-positive neurons to detect pheromones and elicit SMD behavior (Fig. 9G-I). The knockdown of SNMP1 in ppk29-GAL4- labeled neurons also inhibited SMD behavior (Fig. 9J), thus suggesting that SNMP1 also functions in ppk29-positive neurons to induce SMD behavior.' What about ppk25? **

      *

      Answer: As indicated by the reviewer, we included ppk25-GAL4/snmp1-RNAi data in Fig. S9I, indicating that snmp1 expression in ppk25-positive cells is similarly implicated in SMD behavior.

      • There are no page or line numbers throughout the ms! *

      Answer: We included page and line numbers.

      • The use of subheadings in the results section makes reading much easier.*

      Answer: We added subheadings in the results section.

      • 'We found that the mating duration of various wild-type and w 1118 naïve males are significantly longer than that of sexually experienced males (Fig. 1B-D, Fig. S1A)' . I think you should change various wild type to CS and WT Berlin as in legend and figure 1B,C .*

      Answer: The revised sentence is as follows:

      “We found that the mating duration of Canton S, WT-Berlin, Oregon-R, and w1118 naïve males are significantly longer (wild type 15.7~15.8%, w1118 12.4%) than that of sexually experienced males (Fig. 1B-D, Fig. S1A)” (line 102)

      • Suggested exp , Fig S1E-H , they might test 2,6, 12 hours males separation from females to test exactly when this behavior change over time. *

      Answer: We value the reviewer's recommendation. As seen in Fig. S4B of Kim et al., we have previously conducted experiments for examining the memory circuit of SMD [6]. Briefly, the male with a shorter mating duration recovers completely after 12 to 24 hours of isolation from females. As we are currently preparing the memory section of the SMD study, this information will be included in a future manuscript.

      • General comment in figures, you could remove the common y axis as example in figure 1 B,C,D , difference between means and mating duration. *

      Answer: We welcome the reviewer's idea, however in this situation we believe that the y axis of each data set is independent from one another and will thus retain the originals. We feel this would be more useful for the general audiences.

      • You might move the number of replicates to the legend. *

      Answer: We appreciate the reviewer's idea, however we feel that adding more information to the graphic will aid the general audience in comprehending our statistics.

      • Latin name should be italic as example Drosophila simulans.*

      Answer: We fixed it.

      Description of analyses that authors prefer not to carry out

      N/A

      References

      1. Billeter J-C, Levine JD. The role of cVA and the Odorant binding protein Lush in social and sexual behavior in Drosophila melanogaster. Frontiers Ecol Evol. 2015;3: 75. doi:10.3389/fevo.2015.00075
      2. Kim WJ, Lee SG, Schweizer J, Auge A-C, Jan LY, Jan YN. Sexually experienced male Drosophila melanogaster uses gustatory-to-neuropeptide integrative circuits to reduce time investment for mating. Biorxiv. 2016; 088724. doi:10.1101/088724
      3. Yang C, Rumpf S, Xiang Y, Gordon MD, Song W, Jan LY, et al. Control of the Postmating Behavioral Switch in Drosophila Females by Internal Sensory Neurons. Neuron. 2009;61: 519–526. doi:10.1016/j.neuron.2008.12.021
      4. Kim WJ, Jan LY, Jan YN. Contribution of visual and circadian neural circuits to memory for prolonged mating induced by rivals. Nat Neurosci. 2012;15: 876–883. doi:10.1038/nn.3104
      5. Kim WJ, Jan LY, Jan YN. A PDF/NPF Neuropeptide Signaling Circuitry of Male Drosophila melanogaster Controls Rival-Induced Prolonged Mating. Neuron. 2013;80: 1190–1205. doi:10.1016/j.neuron.2013.09.034
      6. Kim WJ, Lee SG, Auge A-C, Jan LY, Jan YN. Sexually satiated male uses gustatory-to-neuropeptide integrative circuits to reduce time investment for mating. Biorxiv. 2016; 088724. doi:10.1101/088724
      7. Wong K, Schweizer J, Nguyen K-NH, Atieh S, Kim WJ. Neuropeptide relay between SIFa signaling controls the experience-dependent mating duration of male Drosophila. Biorxiv. 2019; 819045. doi:10.1101/819045
      8. Crickmore MA, Vosshall LB. Opposing Dopaminergic and GABAergic Neurons Control the Duration and Persistence of Copulation in Drosophila. Cell. 2013;155: 881–893. doi:10.1016/j.cell.2013.09.055
      9. Thornquist SC, Langer K, Zhang SX, Rogulja D, Crickmore MA. CaMKII Measures the Passage of Time to Coordinate Behavior and Motivational State. Neuron. 2020;105: 334-345.e9. doi:10.1016/j.neuron.2019.10.018
      10. Claridge-Chang A, Assam PN. Estimation statistics should replace significance testing. Nat Methods. 2016;13: 108–109. doi:10.1038/nmeth.3729
      11. Wang Z, Singhvi A, Kong P, Scott K. Taste Representations in the Drosophila Brain. Cell. 2004;117: 981–991. doi:10.1016/j.cell.2004.06.011
      12. Grafen A, Ridley M. A model of mate guarding. J Theor Biol. 1983;102: 549–567. doi:10.1016/0022-5193(83)90390-9
      13. Kim WJ, Jan LY, Jan YN. A PDF/NPF Neuropeptide Signaling Circuitry of Male Drosophila melanogaster Controls Rival-Induced Prolonged Mating. Neuron. 2013;80: 1190–1205. doi:10.1016/j.neuron.2013.09.034
      14. Kim WJ, Jan LY, Jan YN. Contribution of visual and circadian neural circuits to memory for prolonged mating induced by rivals. Nat Neurosci. 2012;15: 876–883. doi:10.1038/nn.3104
      15. Wong K, Schweizer J, Nguyen K-NH, Atieh S, Kim WJ. Neuropeptide relay between SIFa signaling controls the experience-dependent mating duration of male Drosophila. Biorxiv. 2019; 819045. doi:10.1101/819045
      16. Bretman A, Fricke C, Chapman T. Plastic responses of male Drosophila melanogaster to the level of sperm competition increase male reproductive fitness. Proc Royal Soc B Biological Sci. 2009;276: 1705–1711. doi:10.1098/rspb.2008.1878
      17. Bretman A, Westmancoat JD, Chapman T. Male control of mating duration following exposure to rivals in fruitflies. J Insect Physiol. 2013;59: 824–827. doi:10.1016/j.jinsphys.2013.05.011
      18. Bretman A, Gage MJG, Chapman T. Quick-change artists: male plastic behavioural responses to rivals. Trends Ecol Evol. 2011;26: 467–473. doi:10.1016/j.tree.2011.05.002
      19. Lizé A, Doff RJ, Smaller EA, Lewis Z, Hurst GDD. Perception of male–male competition influences Drosophila copulation behaviour even in species where females rarely remate. Biol Letters. 2012;8: 35–38. doi:10.1098/rsbl.2011.0544
      20. Rouse J, Bretman A. Exposure time to rivals and sensory cues affect how quickly males respond to changes in sperm competition threat. Anim Behav. 2016;122: 1–8. doi:10.1016/j.anbehav.2016.09.011
      21. Bretman A, Fricke C, Hetherington P, Stone R, Chapman T. Exposure to rivals and plastic responses to sperm competition in Drosophila melanogaster. Behav Ecol. 2010;21: 317–321. doi:10.1093/beheco/arp189
      22. Rouse J, Watkinson K, Bretman A. Flexible memory controls sperm competition responses in male Drosophila melanogaster. Proc Royal Soc B Biological Sci. 2018;285: 20180619. doi:10.1098/rspb.2018.0619
      23. Maguire CP, Lizé A, Price TAR. Assessment of Rival Males through the Use of Multiple Sensory Cues in the Fruitfly Drosophila pseudoobscura. Plos One. 2015;10: e0123058. doi:10.1371/journal.pone.0123058
      24. Bretman A, Westmancoat JD, Gage MJG, Chapman T. COSTS AND BENEFITS OF LIFETIME EXPOSURE TO MATING RIVALS IN MALE DROSOPHILA MELANOGASTER. Evolution. 2013;67: 2413–2422. doi:10.1111/evo.12125
      25. Bretman A, Fricke C, Westmancoat JD, Chapman T. Effect of competitive cues on reproductive morphology and behavioral plasticity in male fruitflies. Behav Ecol. 2016;27: 452–461. doi:10.1093/beheco/arv170
      26. Price TAR, Lizé A, Marcello M, Bretman A. Experience of mating rivals causes males to modulate sperm transfer in the fly Drosophila pseudoobscura. J Insect Physiol. 2012;58: 1669–1675. doi:10.1016/j.jinsphys.2012.10.008
      27. Bretman A, Westmancoat JD, Gage MJG, Chapman T. Males Use Multiple, Redundant Cues to Detect Mating Rivals. Curr Biol. 2011;21: 617–622. doi:10.1016/j.cub.2011.03.008
      28. Fowler EK, Leigh S, Rostant WG, Thomas A, Bretman A, Chapman T. Memory of social experience affects female fecundity via perception of fly deposits. Bmc Biol. 2022;20: 244. doi:10.1186/s12915-022-01438-5
      29. Dore AA, Rostant WG, Bretman A, Chapman T. Plastic male mating behavior evolves in response to the competitive environment*. Evolution. 2021;75: 101–115. doi:10.1111/evo.14089
      30. Bretman A, Fricke C, Chapman T. Plastic responses of male Drosophila melanogaster to the level of sperm competition increase male reproductive fitness. Proc Royal Soc B Biological Sci. 2009;276: 1705–1711. doi:10.1098/rspb.2008.1878
    1. <![endif]-->

      Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1:

      Minor edits

      1. Line 91. Is a bit misleading to say "many other vibrios" possess T3SS. This conveys that this is perhaps the majority, but T3SS in vibrios is at best 50/50. I think best just to delete this sentence.

      We deleted this comment, as suggested.

      1. Revised to "Thus, in this study, we set out to..." Since the entire paragraph starts with "recent study" I missed that this was summary of new data rather than preview of new results.

      The sentence was revised as suggested.

      1. Line 503. Correct "xxx-584" or more detail on what this means.

      We Thank the reviewer for pointing out this typo._ This refers to the deletion made in tie1, in the region corresponding to nucleotides 485-584 of this gene. The text was corrected accordingly.

      1. Line 603. Salmonella should be italicized.

      Corrected.

      1. The labelling of the figures is pretty complicated with the long genetic designations. Is it reasonable to for example name the ∆vprh/∆hns1 strain with an abbreviation (such as ∆VH)? Or instead create a strain name, common used approaches would be HC## (for Hadar Cohen) or TAU# for Tel Aviv University. If you go this route, be sure to update the strain list. The current method can be followed, the figures are just complicated.

      We thank the reviewer for raising this concern. We acknowledge the difficulty in following the many different strains and mutations. Nevertheless, after considering the proposed modifications to the strain names, we believe that they will not add much clarity, and may even cause some confusion. Therefore, we respectfully decided to keep the current nomenclature in place.

      Reviewer #2:

      Minor edits

      1. The authors used a hyperactive T6SS (HNS mutant) to investigate its toxicity. Would the authors be able to use a wild type strain to reproduce the function of T6SS?

      We have yet to reveal the external cues that lead to full activation of T6SS3 in vitro. Therefore, in the current study we used genetic tools, such as hns deletion or Ats3 over-expression, to monitor the effect of this system on immune cells. We will dissect the activating conditions in future studies, but we believe that the use of genetic tools should not affect the validity of the results in the current study, nor their timely publication.

      1. The authors showed that Tie1 and Tie2 are secreted by T6SS3. It is important to show if they are actually delivered into the host cells during infection. Otherwise it is hard to conclude that they are truly effectors. The primary concern is the lack of in vivo studies to show that Tie1 and Tie2 are actually effectors that play a role in activation of NLRP3 inflammasome._

      We present 3 pieces of evidence that, when taken together, support the conclusion that Tie1 and Tie2 are T6SS3 effectors: 1) the proteins are secreted in a T6SS3-dependent manner; 2) their deletion does not hamper overall T6SS3 activity; and 3) their deletion causes the same loss of NLRP3-mediated inflammasome activation and pyroptosis as does inactivation of T6SS3 by deletion of its structural component, tssL3. Although we agree with the reviewer that directly showing delivery of Tie1 and Tie2 into host cells will further strengthen our conclusion, such experiments are quite challenging and difficult to interpret, especially with T6SS effectors that can use diverse mechanisms for secretion through the system. This point was also noted by reviewer #3: “…I believe they were suggesting to demonstrate secretion in host cells. Although this would be nice, it is non-standard and technically not feasible. These types of experiments require genetically fusing the effector with either an enzymatic moiety (e.g. Beta lactamase) or fragment of split GFP. Although such approaches have been previously performed, they often result in either blocked or aberrant secretion due to the presence of the added fragment."

      Regarding the reviewer’s comment on the lack of in vivo studies: we agree that these are extremely important, yet they are beyond the scope of the current work, as concurred by reviewers #1 and #3:

      Reviewer#1 with regard to Reviewer#2: "I don't think mouse (or aquatic animal) studies are essential for this study. The work contributes nicely to our understanding molecular mechanisms of this T6SS system. As noted in my review, there are many additional lines of study that can be pursued from this work, including animal studies, but this should not preclude publication of this work that is itself an intact unit."

      Reviewer#3 regarding reviewer #1's comment on Reviewer#2: "I don't believe that reviewer #2 was suggesting to perform mouse or aquatic animal studies by suggesting in vivo demonstration of secretion…”

      Reviewer #3:

      Major comments:

      1. If the authors believe that GSDME partially compensates in the absence of GSDMD, have they infected a GSDME/GSDMD double knockouts to see if there is an additive effect?

      Indeed, this is a very interesting and specific question for the cell death field. We do not currently possess such a GSDME/GSDMD double knockout mouse, and generating one will be a long endeavor. Since its absence does not diminish the importance or the conclusions of the current work, we think that it should not warrant a delay in publication. We do plan to address this question in future studies.

      1. It is clear that Ats3 regulates T6SS3, but not the T6SS1; however, there no evidence suggesting that Atg3 does not regulate other gene clusters. For example, have the authors performed RNA seq to compare the transcriptomes of WT and an Ats3 mutant? If not, the authors should refrain using the words "specific activation".

      We thank the reviewer for this important note. Indeed, we lack additional data indicating that Ats3’s effect is indeed restricted only to T6SS3. Therefore, we modified the text accordingly and removed mentions of specific T6SS3 activation.

      1. In figure 6B, it's unclear why the bacteria infecting cytochalasin D-treated cells grow more than the T6SS3 mutants in the absence of cytochalasin D.

      The difference probably stems from the fact that phagocytosis, the major mechanisms by which BMDMs kill bacteria, is hampered in the presence of cytochalasin D, thus allowing bacteria to grow more than when the BMDMs phagocytose them. The results show that in the absence of cytochalasin D, an active T6SS3 counteracts the killing effect by BMDMs with functional phagocytosis.

      Minor comments:

      1. Figure 1A and other secretion assays: The Western blots include loading control (LC) blots. These are non-standard, non-informative, and not required with the inclusion of the western blots on the "cells" fraction. I would suggest removing these as they may confuse the reader.

      We respectfully disagree. Loading controls are standard in bacterial secretion assays, and they are important since they confirm comparable loading and allow proper analysis of the results, especially since we aim to determine whether certain mutations affect the expression of T6SS components. Notably, some groups choose to blot for a cytoplasmic protein (e.g., RpoB in Allsop et al., PNAS, 2017; Liang et al., PLoS Pathogens, 2021) instead of showing overall loaded proteins, as shown in our figures.

      1. Line 503: "xxx" should reflect the actual nucleotide nubmers_

      We thank the reviewer for pointing out this typo._ This refers to the deletion made in tie1, in the region corresponding to nucleotides 485-584 of this gene. The text was corrected accordingly.

      1. Since V. proteolyticus is an aquatic pathogen, have the authors tried to infect corals, fish, and crustaceans (or derived cells) with WT and effector mutants?

      This is an interesting point, and indeed we are setting up such systems and we plan to perform such experiments in the future as part of follow up projects. However, these in vivo studies are beyond the scope of the current manuscript, as also noted by the reviewer in the cross-consultation comments: “…my previous comment on infecting aquatic animals or cells derived from them is non-standard and not necessary…”

      1. Are the targeted host proteins in this study (performed with murine BMDM) conserved in the natural hosts for V. proteolyticus?

      We hypothesize that the conservation is not in the pathway components that are activated upon infection, but rather in the ability of the host cell to sense danger (i.e., to sense the effect of T6SS3 effectors on the host cell or one of its components), which is the role of the NLRP3 inflammasome in mammalian cells. It is well documented that major differences in immune mechanisms exist between mammals and the potential natural marine hosts of V. proteolyticus (e.g., corals, arthropods, and fish); therefore, the conservation at the protein level is low. Nevertheless, basic signaling pathways, such as programed cell death, are conserved between the different phyla. For example, a caspase-1 homolog which was found in arthropods (Chu, B. et al. PLoS One (2014). doi:10.1371/journal.pone.0085343) probably induces an apoptotic-like cell death mechanism, similar to apoptosis in C. elegans. We now provide further discussion on this point in the text (lines 648-659).

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      This paper by Cohen et al described discovery of the function of novel genes in the T6SS operon of Vibrio proteolyticus, a Vibrio isolated from corals. V. proteolyticus also impacts other sea animals. The T6SS3 in particular is found to kill eukarytoic phagocytic cells following engulfment of bacteria into the phagocyte. This strategy of killing phagocytic cells following entry has been shown for other Vibrios. The net goal is protection of the population by the bystander effector. The study first shows that deletion of H-NS (a global negative regulator) stimulates T6SS facilitating ease of work by pushing the system to great cell killing. This allowed them to probe the mechanism of cell death and reveal it as NLRP3 dependent, capase 1 dependent pyroptosis via pore formation by Gasdermin D. Activation of the inflammasome is also linked to cleavage and release of IL-1beta. When GSDMD is absent, there was a slower cell killing by GSDME via capsase 3 activation. The stimulation of this system is additive by two newly recognized T6SS effectors Tie1 and Tie2.

      The study is complete, the experiments are well conducted and well controlled. The experiments show reproducibility. The manuscript text is clear, Overall. I suggest no changes in the results or experiments and suggest only a few minor edits of the text.

      Minor edits

      Line 91. Is a bit misleading to say "many other vibrios" possess T3SS. This conveys that this is perhaps the majority, but T3SS in vibrios is at best 50/50. I think best just to delete this sentence.

      Line 102. Revised to "Thus, in this study, we set out to..." Since the entire paragraph starts with "recent study" I missed that this was summary of new data rather than preview of new results.

      Line 503. Correct "xxx-584" or more detail on what this means.

      Line 603. Salmonella should be italicized.

      Figures. The labelling of the figures is pretty complicated with the long genetic designations. Is it reasonable to for example name the ∆vprh/∆hns1 strain with an abbreviation (such as ∆VH)? Or instead create a strain name, common used approaches would be HC## (for Hadar Cohen) or TAU# for Tel Aviv University. If you go this route, be sure to update the strain list. The current method can be followed, the figures are just complicated.

      Referees cross-commenting

      With regard to Reviewer#2, I don't think mouse (or aquatic animal) studies are essential for this study. The work contributes nicely to our understanding molecular mechanisms of this T6SS system. As noted in my review, there are many additional lines of study that can be pursued from this work, including animal studies, but this should not preclude publication of this work that is itself an intact unit.

      Significance

      The work is significant in that it links T6SS to a eukaryotic killing system and discovers novel details regarding the mechanisms of death, that may impact our knowledge of other Vibrio T6SS (including V. cholerae) that also target eukaryotic cell actin. There are remaining questions that could be probed, but these are in my opinion major studies that would easily themselves comprise new papers if done properly and thus are not essential for this paper. These include the struture and biochemical activity of Tie1 and Tie2 and the mechanism of caspase-8 independent activation of caspase-3 to then cleave GSDME. Why NLRP3 is required for capase 3 activation is also an open question. I look forward to following this work for some time to come. The authors have revealed very interesting effectors and interesting cell biological process that will merit multiple years and multiple manuscripts to unravel. This work will be of interest to the community interested in bacterial toxin systems (microbial pathogenesis), the bacterial effector mechanism field (biochemistry and cell biology), and the inflammasome activation field (immune systems). The work will be of interest (with essentially no modification) directed at these fields of interest.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1:

      Major comments:

      In general, the data support the conclusions. I cannot comment on the atomistic simulation experiment as it is outside of my expertise. I had some difficulties interpreting Figure 2 as the contrast in the colour panels made it difficult to assess the different staining patterns. I would recommend changing the blue to cyan for easier visibility. While I agree that there are some differences between Fig 2F and Fig 2G it is not simple for the non-expert to distinguish the gonadal mesoderm from the somatic mesoderm. I think the enlarged panels could do with also showing the overlap in staining, or at least a tracing of the different cell populations so that the gonadal mesoderm can be clearly defined. Please also add some scale bars to the figure. Figure 3 demonstrates clear differences in gonad morphology between male and female mutants but the contrast in the colour panels A-G could also be improved. Panels H-J are very clear.

      Response: As suggested by Referees 1 and 3 we have modified the colour channels in all figures. We have also enlarged the figures taking away the uninformative region and focused around the enlarged gonads and added scale bars. For Fig 2F-G, we have added a close up of the region of interest both in colour and in black and white. These changes have increased the contrast and facilitate the data interpretation to non-expert readers.

      The rescue experiment in Figure 4 is clearly presented but could the DLC3 mutants in the graph (panel b) please be named similarly to the schematic proteins shown in panel a.

      Response: We have changed the names to maintain nomenclature uniformity.

      I found the difference between the RhoGAP domain mutants and the StART domain mutants of Cv-c to be clearly defined, and correlate with DLC3 function. This is a very interesting result that indicates multiple molecular functions for the Cv-c /DLC family.

      Response: The methods are well described, statistics adequate and the data well described._

      Minor Comments:

      My only suggestion for the text is to provide a more through description of the StART domain in the introduction.

      Response: We have included the following paragraph in the introduction describing the StART domain:

      “This family of proteins share different domains: besides the Rho GTPase Activating Protein domain (GAP), they present a protein-protein interacting Sterile Alpha Motif (SAM) at the N terminal end and a Steroidogenic Acute Regulatory protein (StAR)-related lipid transfer (StART) domain at the C terminal. StART domains have been shown in other proteins to be involved in lipid interaction, protein localization and function.”

      Reviewer #2:

      My only issue with the present study is to how well the present experimental findings in Drosophila translate to humans. As far as I can tell the present studies show that inactivating mutations in Cv-c in Drosophila result in failure of germ cell enclosure by somatic cells into the testis, resulting in sterility. In humans, and in experimental mouse transgenic lines, it has been well established that absence of germ cells does not of itself lead to failure of testis differentiation and onward development, nor does it lead automatically to sex reversal or impairment of masculinization. For the latter to occur, there must be impairment/failure of fetal Leydig cell function such that insufficient androgen is produced to effect genital/bodywide masculinization. Obviously, this will happen if no testis forms as appears to be the case in the new human DLC3 mutant reported in the present manuscript (although detail on this is unfortunately lacking). This appears to be different to the previous published DLC3/STARD8 mutant sisters, in whom the phenotype appears to reflect failure of steroidogenesis. Is the proposal that DLC3/STARD8 plays a role in both testis differentiation and in Leydig cell function (steroidogenesis) or is this due to different DLC3 genes? I think the authors need to address these key issues in their discussion, if only to highlight that there are at present many gaps in our understanding.

      The reviewer says:

      “As far as I can tell the present studies show that inactivating mutations in Cv-c in Drosophila result in failure of germ cell enclosure by somatic cells into the testis, resulting in sterility.”

      Response: This sentence does not represent the spirit of our findings accurately and this probably reflects the fact that we stressed the interaction between somatic mesodermal cells and germ cells in Drosophila which probably concealed that the main defects in Cv-c mutants are caused by the abnormal interaction of the mesodermal cells with germ cells but also among themselves. Our study provides insights about a new conserved pathway required in the mesodermal cells for the maintenance of an already formed testis, and only indirectly can be considered to deal with sterility. We show that Cv-c is required in the mesodermal cells for the correct maintenance of the testis structure, that when it fails leads to the testis dysgenesis which, among other defects, releases the germ cells. We show that in the absence of Cv-c function in the testis, the mesodermal pigment cells do not form a continuous layer around the testis and the ECM surrounding the testis breaks. We also show that the interstitial gonadal cells fail to ensheath the germ cells and as a result of all these the germ cells become dispersed. These perturbations can be partially corrected by expression in the testis mesoderm of human DLC3 or Drosophila Cv-c that in both cases require a functional StART domain. Thus, our results suggest that Cv-c/DLC3 have a fundamental function on the mesodermal testis cells that has been conserved. These results indicate that, as in Drosophila, the primary cause for the gonadal dysgenesis in DLC3 human patients is due to the abnormal maintenance of the testis mesoderm cells, which include both Sertoli and Leydig cells. Thus, our proposal is that DLC3/STARD8 plays a role in testis maintenance through its function in mesodermal cells which will probably affects both Sertoli and Leydig cell function.

      To clarify the issue raised by the referee we have modified both, the introduction and the discussion to highlight that although humans and Drosophila diverged millions of years ago there are similarities regarding gonad stabilisation.

      We have modified the introduction to clarify this issue:

      “Gonadogenesis can be subdivided into three stages: specification of precursor germ cells, directional migration towards the somatic gonadal precursors and gonad compaction. In mammals, somatic cells, i.e. Sertoli cells in male and Granulosa cells in females, play a central role in sex determination with the germ cells differentiating into sperms or oocytes depending on their somatic mesoderm environment. In humans, Primordial Germ Cells (PGCs) are formed near the allantois during gastrulation around the 4th gestational week (GW) and migrate to the genital ridge where they form the anlage necessary for gonadal development (GW5-6). Somatic mesodermal cells are required for both PGCs migration and the formation of a proper gonad. Once PGCs reach their destination, the somatic gonadal cells join them (around GW 7-8 in males, GW10 in females) and provide a suitable environment for survival and self-renewal until gamete differentiation {Jemc, 2011 #413}. Thus, mutations in genes regulating somatic Sertoli and Granulosa support cell function in humans are often associated with complete or partial gonadal dysgenesis in both sexes and sex reversal in males {Zarkower, 2021 #430; Knower, 2011 #418; Brunello, 2021 #399}. Other mesodermal cells, the Leydig cells, also play an important role in the testis by being the primary source of testosterone and other androgens and maintaining secondary sexual characteristics.”

      Also we have added a paragraph in the discussion to emphasize this argument:

      “We show that in the absence of Cv-c function in the testis, the mesodermal pigment cells do not form a continuous layer around the testis and the ECM surrounding the testis breaks. We also show that the interstitial gonadal cells fail to ensheath the germ cells and as a result of all these the germ cells become dispersed from the testis. These perturbations can be partially corrected by expression in the testis mesoderm of human DLC3 or Drosophila Cv-c that in both cases require a functional StART domain. Thus, our results suggest that Cv-c/DLC3 have a fundamental function on the mesodermal testis cells that has been conserved. These results indicate that, as in Drosophila, the primary cause for the gonadal dysgenesis in DLC3 human patients is due to the abnormal maintenance of the testis mesoderm cells, which include both Sertoli and Leydig cells”.

      I would also suggest that the authors highlight another potentially more important spin-off from such studies, namely that understanding of the regulation of DLC3/STARD8 genes, and what might perturb their expression/action would appear to present a whole new area for exploration in relation to testicular dysgenesis/masculinization disorders.

      Response: We have modified the last part of the discussion to introduce referee 2’s suggestion:

      “Our work points to DLC3/Cv-c as a novel gene required specifically in testis formation. Adding DLC3 to the list of genes involved in 46X,Y complete dysgenesis opens up a new avenue to analyse the molecular and cellular mechanisms behind these disorders that could help in diagnosis and the development of future treatments”.

      Reviewer #3 :

      Major comments:

      1. This study has shown the expression pattern of cv-c and the consequence of cv-c mutation on different aspects of gonad development. However, one major comment is there is no quantification of the expression levels as well as the scoring of the mutant phenotypes.
      2. In Figure 2, for instance, I recommend that the authors display the quantification of the fluorescence intensity of the cv-c expression under all circumstances (in situ hybridization as well as protein-trap based GFP expression) to better depict the differences among the male vs female gonad.

      Response: We don’t think quantifying the stainings will add much to the results. We believe that the changes performed increasing the images’ contrast and their amplification are sufficient to illustrate our statement about cv-c being expressed in testis but not in ovaries.

      1. In Figure 3, the authors show the different gonad developmental defects associated with the cv-c mutation. Specifically, the authors show that the gonad mesoderm cells are displaced with the pigment cells failing to ensheath the germ cells. In addition, the authors also suggest that there is an increased frequency of germ cell blebbing, an indication of migratory activity. However, there is no quantification of these findings. I think the authors should display a quantitative estimation of % of the mutant gonad depicting these phenotypes vs the normal gonad to have a perspective of how penetrant the phenotypes are.

      Response: As referee suggested, we have quantified bleb phenotype. The results are presented in figure 3, panel J.

      1. In Figure 4, the authors attempt to rescue the Cv-C mutation linked gonadal defects by overexpressing different Cv-C protein variants. The rescue experiments are not very clear. The graph shows the % of normal testes under different genotypic combinations. It is not very clear what the authors mean by normal (in what context)? Since the mutation results in different defects of gonad development, I think recommend that represent the rescue in terms of these defects. It would be interesting to see for instance, what happens to the blebbing or germ cell ensheathment phenoype upon rescue. How many % of testes show the rescue as compared to cv-c mutants?

      Response: The percentages are quantified considering if the testes have any germ cell outside the gonad. We have added a line to clarify this point in the figure legend: “…quantified as encapsulated gonads with all germ cells inside the testis as assessed by Fisher-test”.

      Nevertheless, we are going to quantify the number of ECM breaks and show the results in the reviewed manuscript.

      1. Did the authors try cell-specific depletion of cv-c and examined the consequence on gonad development?

      Response: cv-c mutants are embryonic lethal because of Cv-c’s widespread requirement on various embryonic tissues during development. Induction of FRT clones in the embryonic testis mesoderm was unsuccessful because of the low number of divisions during embryogenesis. We also tried to knock down cv-c expression with 3 different RNAi lines. Unfortunately, overexpression of these RNAi with different testis Gal4 drivers did not decrease cv-c mRNA levels significantly in the mesoderm or in other tissues where cv-c is expressed. Despite these experiments unsatisfactory outcome, our finding that cv-c is expressed in the testis mesoderm cells, and the fact that we can rescue the testis phenotypes by expressing Cv-c with gonadal mesodermal specific Gal4 lines supports a testis mesoderm requirement of cv-c for its gonadal function.

      1. Another major concern is the lack of mechanistic insight of cv-c. For example, how does loss of cv-c result in gonadal dysgenesis? The authors suggested that StART domains regulate via lipid binding. The authors could examine if StART domain function is dependent on lipid-mediated interactions.

      Response: We agree with the referee that the molecular characterisation of the StART-mediated GAP-independent Cv-c function we have uncovered in this work is a very interesting finding that should be addressed by future work. However, such biochemical characterisation requires a complex approach to distinguish between the already known StART function regulating the GAP activity shown before (Sotillos Scientific Reports) and the new GAP-independent function we describe in the testes that falls beyond this work.

      The central point of this manuscript is the demonstration that both DLC3/Cv-c are involved in male gonad formation, an important conserved function for both of them that had been overlooked by previous publication. Thus, DLC3 should be considered a new gene to be analysed in the future when studying gonadal dysgenesis. A second important point raised by our work is the demonstration that DLC3/Cv-c can perform RhoGAP independent functions, something that had never been described for these proteins.

      Not withstanding this, in the revised version, we have added a new supplementary figure (1) related to the StART domain-lipid interaction analysed in-silico. The in-silico model shows that the DLC3-StART domain Ω1-loop structure displays the highest frequency of interaction with the membrane. This loop is conserved in the StART domains of several other STARD proteins and seems to modulate access to the ligand binding cavity. Ω-loops play multiple roles in protein function, often related to ligand binding, stability and folding. In this context, mutations in the proximity of the Ω1-loop, like the ones carried by the patients, may have drastic effects on overall protein stability that could affect the interaction between gonadal precursor cells.

      1. Do the cv-c mutants survive to adulthood? If yes, then it would be interesting to know how the adult testis behaves in cv-c mutants. Does it result in sterility?

      Response: Unfortunately, all studied cv-c mutants are embryonic lethal.

      1. Ensheathment is required for proper germline development and defects in ensheathment can affect soma-germline communication and germline development. Germ cell ensheathment affects the proliferation of germ cells and display defective JAK/STAT signaling. It would be interesting to know if the germ cells in cv-c mutant gonad show the proliferation defect and impaired JAK/STAT signaling.

      Response: This is an interesting suggestion. JAK/STAT signalling has a male specific function that could explain why cv-c gonadal defects are male specific. We are going to study how cv-c affects STAT signalling in the male gonad. We are currently preparing stocks combining 10XSTAT::GFP reporter with cv-c mutants and preparing samples for anti-STAT labelling. We will also analyse if embryos lacking STAT activation, activate cv-c expression in the testes.

      1. I was also wondering if the authors have examined the number of germ cells in the mutant gonads.

      Response: Yes, we have counted the number of germ cells in cv-c mutants and, if anything, there are more. We initially considered that an excess of GC proliferation could be the cause of gonad disruption. However, we have discarded this hypothesis as phospho-histone 3 stainings did not show a significant increase of GC divisions. Moreover, when we blocked cell proliferation in cv-c’ mutant gonads using UAS-p21, the testes phenotype was not rescued. We are unsure what could be responsible for the slight increase of germ cells observed.

      1. In addition, I think the quality of the images should be improved.

      Response: We have changed the colours used in the confocal images and amplified the relevant regions in all panels. We thank both referees for this suggestion as these changes have improved the figure contrast.

      Minor comments:

      1. cv-c mRNA in Figure 2 panels (Fig. 2D) should be in italics.

      Response: We have changed it.

      1. There is no scale bar in Figure panels. In addition, there is no scale bar in the zoomed images in Figure 2. Scale bars should be consistently put in the all the Figures, in particular on the first panels of the Figures.

      Response: We have added scale bars to all panels.

      1. In the line 677, the manuscript says "arrowhead". There are no arrowheads but the arrows.

      Response: Corrected

      1. Please be consistent with the labels in Figure panels: Vasa is shown in capital while Eya is not.

      Response: Corrected

      1. Please be consistent with the labeling of the Figure panels: Figure 3A vs Figure 4a.

      Response: Corrected

      1. What does the asterisk signify in Figure 2? There is no mention of asterisk in the Figure 2 legend.

      Response: The meaning of the asterisk was explained in the figure legend.

      1. There is no grey channel (sagittal view) for the panels Figure 3I and J.

      Response: We have already included sagittal views in the figure.

      1. Please be thorough in labeling the genotypes in Figures. For instance, Figure 4c depict the % of normal testis in cv-c delta StART. However, the correct genotype is twi>Cv-c StART. In addition, in Figure 4c graph, cv-c mut should be cv-cGAPmut.
      2. Please be consistent with the depiction of the "START" domain of the protein throughout the manuscript. In figure 4c for instance, it is "START" in the graph while in the figure panel 4i, it is StART.
      3. In Figure 4b, it is written DLC3-GA. Did the authors mean DLC3-S993N?
      4. In line 723, it should be anti-beta catenin.

      Response: As suggested, we have unified figure labelling.

      1. The authors have shown two images to suggest that cv-c mutant gonad depict the germ cell blebbing (Figure 3I and J). I think it would be much better to put up a graph showing the number or percentage of cv-c mutant gonads displaying the germ cell blebbing than putting two images with the same information.

      Response: We have already done the quantification and added the data as a graph in figure (3J).

      1. The previous comment is also true for Figure 6H and I. In both the panels, the authors wish to show discontinuous ECM marked by Perlecan expression in cv-c mutant gonads. I think it would be better to display a score of the number of mutant gonads depicting the discontinuous ECM.

      Response: We are repeating stainings to quantify Perlecan disruption in cv-c mutants and we will display the results as a graph in figure 6.

    1. Author Response

      Reviewer #1 (Public Review):

      The layered costs and benefits of translational redundancy by Raval et al. aim to investigate the impact of gene copy number redundancy on E. coli fitness, using growth rate in different media as the primary fitness readout. Genes for most tRNAs and the three ribosomal RNAs are present in multiple copies on the E. coli chromosome. The authors ask how alterations in the gene copy number affect the growth rate of E. coli in growth media that support different rates of growth for the wild type.

      While it was shown before that mutants with reduced numbers of ribosomal RNA operons grow at reduced rates in rich medium (LB), this study extends these findings and reaches some important conclusions:

      1) In a poor medium (supporting slow growth rates), the mutants with fewer rRNA operons actually grow faster than the wild type, showing that redundancy comes at a cost.

      2) The same is true for mutants with reduced gene copy number of certain tRNAs and correlates with slower rates of protein synthesis in these mutants.

      3) That rRNA operon gene copy number is more decisive for growth rate than any tRNA gene copy number (>1).

      In addition, measurements of strains with deletions of genes encoding tRNA-modification enzymes that affect tRNA specificity are included. While interesting, no unifying conclusion could be reached on the impact of these mutations on growth rate.

      Thank you for this clear summary of our work.

      The well-known "growth law" relationships between growth rate and macromolecular composition (RNA/protein ratio, for example) specifically concern steady-state growth rates. It is concerning that all growth rates in this work were measured on cultures that were only back-diluted 1:100 from overnight LB precultures. That only allows 6-7 doubling times before the preculture OD is reached again. The exponential part of growth would end before that, allowing perhaps only 3-4 generations of growth in the new medium before the growth rate was measured. Thus, the cultures were not in balanced growth ("steady state") when the measurements were made, rather they were presumably in various states of adapting to altered nutrient availability.

      A detailed connection with exact growth rate laws indeed requires growth rate measurement in steady-state. Hence, we refrained from making such a connection in this manuscript, though it would be an interesting future avenue to explore. Our main goal here was to ask how E. coli growth rate is affected by external nutrient availability and internal translation components. For this, the key comparisons involve the WT vs. gene deletion mutations, and rich vs. poor growth media. For any given comparison, strains were tested under identical conditions and experimental protocols, and hence we can address our main questions without the need to obtain steady-state growth. As an aside, we note that the nutrient fluctuations inherent in such experiments may also be more relevant than steady-state growth for natural bacterial populations.

      As noted by the reviewer, we measured fitness only in a relatively narrow growth regime of several doublings; but we do capture exponential growth by focusing on the early data points (representing the exponential phase) for our growth rate calculations. We have now explicitly mentioned this in the methods section “Measuring growth parameters”.

      A second concern is the use of the term "tRNA expression levels" in the text in Figure 4. I believe the YAMAT-seq method reports on the fractional contribution of a given tRNA to the total tRNA pool. Thus, since the total tRNA pool is larger in fast-growing cells than in slow-growing cells, a given tRNA may be present at a higher absolute concentration in the fast than in the slow-growing cells but will be reported as "higher in poor" in figure 4, if the given tRNA constitutes a smaller fraction of the total tRNA pool in rich than in poor medium. For this reason, the conclusions regarding the effect of growth medium quality on tRNA levels are not justified.

      Thank you for this important point. We agree that our phrasing was incorrect, and we have modified the relevant text and figures accordingly. The fractional contribution of a given tRNA isotype to the total tRNA pool is still useful to compare, and is justified as now rephrased.

      Reviewer #2 (Public Review):

      Raval et al. by creating a series of deletion mutants of tRNAs, rRNAs, and tRNA modifying enzymes, have shown the importance of gene copy number redundancy in rich media. Moreover, they successfully showed that having too many tRNAs in poor media can be harmful (for a subset of the examined tRNAs). Below, please find my comments regarding some of the methodologies, conclusions, and controls needed to stratify this manuscript's findings.

      Figure 2 presents Rrel as a relative measurement (GRmut/GRwt). Therefore, I'm confused as to how Rrel can be negative, as shown in supplemental file 3 (statistics).

      We apologize for the confusion. Supplemental file 3 shows details of the statistical analysis (not raw data), and we included the effect size here (mean difference between the WT and the mutant relative growth rate) along with statistical significance. Thus, if the rel R of a given mutant is 1.1, the mean difference would be (1–1.1) = –0.1, meaning that it is performing 10% better than the WT.

      The “raw” relative growth rates are provided in source data files (labeled figure-wise), and there are no negative values there, as expected.

      We have now explicitly (and separately) referenced the source and statistics data files in the data analysis section in the methods, and in each figure legend. We hope this avoids confusion and makes it easier for readers to find the correct file.

      Does Figure 3 show the mean of 4 biological replicates or technical replicates? It should be stated clearly in the legend of figure 3.

      All replicates are biological replicates until unless stated otherwise. This is now stated in the methods (lines 185-187), and in the figure legends.

      Do all strains (datapoint on figure 3 left panel) significantly perform better than the WT in nutrient downshift? Looking at supplemental file 3 I see this is not the case. Please mark the statistically significant points. I suggest giving each set a different symbol/shape and coloring the significant ones in red.

      We had considered indicating statistical significance in the plot, but decided not to do so because it was difficult to show the many potentially useful layers of information without cluttering the plot. One other practical difficulty was that each point in the figure represents two values: one from the upshift (Y axis) and one from the downshift (X axis). For some mutants the fitness difference was significant in only one direction, so it was not straightforward to indicate significance. Further, our main goal here was to show where strains from different deletion Sets (Figure 1) fall in this plot (i.e. which quadrant they occupy), and so we wanted to ensure that points were easily distinguished by Set. In the text we do not include statistically non-significant points in the summary of observed patterns, and refer readers to information on statistical significance provided in the supplemental file.

      Another issue is that in the statistics of figure 2 (in supplemental file 3), positive values reflect cases where the mutant performs poorly compared to the WT, while in figure 3 the negative values indicate this. Such discrepancy is not very clear. And again, how can Rrel be negative?

      As noted in response to an earlier comment, Rrel values (given in source data files) are not negative, but effect sizes (given in supplemental file with statistics) may be negative or positive since they show differences in the relative growth rate of WT and mutant. We agree that the discrepancy between the calculation of mean difference for Figs 2 and 3 was confusing. We have now fixed this: in both cases, negative mean difference values now indicate that the mutant performs better.

      Both axes say glycerol. What about galactose?

      The typo has been corrected.

      Lines 414-419: The authors state that "all but one had a growth rate that was comparable to WT (16 strains) or higher than WT (10 strains) after transitioning from rich to poor media (i.e. during a nutrient downshift, note data distribution along the x-axis in Fig 3; Supplementary file 3). In contrast, after a nutrient upshift, 11 strains showed significantly slower growth in one or both pairs of media, and only 2 showed significantly faster growth than WT (note data distribution along the y-axis in Fig 3; Supplementary file 3)".

      Looking at the Rrel values when transitioning from TB to Glycerol and vice versa suggests no direction in the effect of reducing redundancy. During downshift, four strains perform better, and three strains perform worse than the WT. During upshift, four stains perform better, and six strains perform worse. Only during downshift and upshift from TB to Gal and vice versa give a strong signal.

      The authors should write it clearly in the text because the effect is specific to that transition/conditions and not of general meaning is written in the text (e.g., transition from every rich to every poor media and vice versa). I am convinced that the authors see an actual effect when downshifting or upshifting from TB to galactose and vice versa. In that case, the conclusion is that redundancy is good or bad depending on the conditions one used and not as a general theme.

      Also, this is true just for some tRNAs, so I don't think the conclusion is general regarding the question of redundancy.

      The fitness impacts of altered redundancy are best explained by a combination of multiple factors (in addition to nutrient availability): the number of tRNA genes deleted, number of tRNA gene copies remaining as a backup, availability of wobble or ME as backup, and codon usage. Thus, any of these variables alone would provide only partial explanation for the observed fitness effects of all strains.

      In many tRNA deletion strains – especially single gene deletions – redundancy was not significantly lowered by the deletion, as we explain in the results section. These strains were therefore not expected to show major fitness impacts or follow strong nutrient dependent trends, and this is what we observe.

      The same is true for nutrient upshift-downshift experiments, where a vast majority of strains were not expected to show a specific pattern because they do not show significant fitness impacts in general, nor do they show a strong correlation in relative fitness impacts vs. growth rate (Figure 1d). In addition, in these experiments the difference between the two media also matters. For example, comparing the maximum WT growth rate, M9 Gal is poorer than M9 Glycerol. Therefore, shifts between TB-Gal are nutritionally more drastic than TB-Gly shifts, and one would expect a larger fitness impact in the former (for strains with significantly altered redundancy). Hence, despite differences across media pairs, our broader conclusions about the impact of redundancy are generalizable as long as redundancy and nutrients are both substantially altered, e.g. due to deletion of 3 tRNA genes, deletion of tRNA+ME, or deletion of multiple rRNA operons.

      Figures are indicated differently along the text. Sometimes they are written "figure X", sometimes FigX. Referring to the supplemental figures are also not consistent.

      We have now corrected this.

      Line 443-444: "In fact, 10 tRNAs were significantly upregulated in the poor medium relative to the rich medium".

      This result contradicts the author's hypothesis. If redundancy is bad in poor media because the cells have more tRNAs than they need, the tRNAs level will be downregulated, not upregulated. How do the authors explain this?

      This statement referred to the WT strain, and was meant to highlight that (as noted by the reviewer) some tRNAs appear to be upregulated in poor medium, which is counterintuitive. However, as noted by reviewer 1 (see their comment on the interpretation of YAMAT-seq data), we can only infer the relative contribution of each tRNA isotype to the total tRNA pool (rather than absolute up- or down- regulation). Thus, we have removed this specific sentence, and instead we focus on the mismatch between the media-dependent changes in the composition of the tRNA pool and the fitness effects of different tRNA isotypes (lines 475-482).

      Line 445-447: "In contrast (and as expected), all tested tRNA deletion strains had lower expression of focal tRNA isotypes in the rich medium (Fig 4B, left panel), showing that the backup gene copies are not upregulated sufficiently to compensate for the loss of deleted tRNAs". It is great that the authors validated the expression in their strains. However, for accuracy, please indicate that it was done in four strains to avoid the impression that they did it in all the strains.

      We have now reworded this sentence to remind readers that we measured 4 tRNA deletion strains in this experiment.

      Finally, across the manuscript, the authors reveal that deleting some tRNAs or modifying enzymes can be deleterious in rich media or advantageous in poor media. However, I think this result and the conclusions derived from it could be more convincing if the authors would show in a subset of their strains that expressing the deleted tRNAs or modifying enzymes from a plasmid can rescue the phenotype.

      Thank you for this suggestion. For a small subset of strains, we now include data showing that complementation from a plasmid indeed rescues the deletion phenotype (Fig 2 – Fig supplement 7).

      Reviewer #3 (Public Review):

      In this manuscript, Raval et al. investigated the cost and benefit of maintaining seemingly redundant components of the translation machinery in the E. coli genome. They used systematic deletion of different components of the translation machinery including tRNA genes, tRNA modification enzymes, and ribosomal RNA genes to create a collection of mutant strains with reduced redundancy. Then they measured the effect of the reduced redundancy on cellular fitness by measuring the growth rate of each mutant strain in different growth conditions.

      This manuscript beautifully shows how maintaining multiple copies of translation machinery genes such as tRNA or ribosomal RNA is beneficial in a nutrient-rich environment, while it is costly in nutrient-poor environments. Similarly, they show how maintaining parallel pathways such as non-target tRNA which directly decodes a codon versus target tRNA plus tRNA modifying enzymes which enable wobble interactions between a tRNA and a codon have a similar effect in terms of cost and benefit.

      Further, the authors show the mechanisms that contribute to the increased or reduced fitness following a reduction in gene copy number by measuring tRNA abundance and translation capacity. This enables them to show how on one hand reduced copy numbers of tRNA genes result in lower tRNA abundance in rich growth media, however in nutrient-limiting media higher copy number leads to increased expression cost which does not lead to an increased translation rate.

      Overall, this work beautifully demonstrates the cost and benefits of the seemingly redundant translation machinery components in E. coli.

      Thank you for the clear summary and encouraging comments.

      However, in my opinion, this work’s conclusion should be that the seeming redundancy of the translation machinery is not redundant after all. As mentioned by the authors, it is known that tRNA gene copy number is associated with tRNA abundance (Dong et al. 1996, doi: 10.1006/jmbi.1996.0428), this effect is also nicely demonstrated by the authors in the section titled “Gene regulation cannot compensate for loss of tRNA gene copies”. Moreover, this work demonstrates how the loss of the seeming redundancy is deleterious in a nutrient-rich environment. Therefore, I believe the experiments presented in this work together with previous works should lead to the conclusion that the multiple gene copies and parallel tRNA decoding pathways are not redundant but rather essential for fast growth in rich environments.

      The point is well taken. However, as described in the introduction, here we focus on functional redundancy at the cellular level, where there are multiple ways of achieving the same translation rate. Hence we say that translation components are redundant at this level of analysis. One of the key conclusions from our work is that such redundancy is context-dependent, i.e. it is essential when rapid growth is possible, but is costly and dispensable otherwise. Therefore, we show that the definition of redundancy itself changes with environmental conditions.

      The following analogy may help convey this. There may be many ways to reach a flight on an airport: multiple entrances, multiple check-in and security check counters, multiple boarding gates, etc. On a deserted airport these may seem redundant and even costly to maintain. On the other hand, they have a utility when traffic is high. Hence even though from a purely architectural perspective the multiple routes are redundant, from a utilitarian perspective it depends on the flux of passengers.

    1. Authors’ response (5 November 2022)

      GENERAL ASSESSMENT

      Piezo1 and Piezo2 are stretch-gated ion channels that are critically important in a wide range of physiological processes, including vascular development, touch sensation and wound repair. These remarkably large molecules span the plasma membrane almost 40 times. Cryo-EM and reconstitution experiments have shown that Piezos adopt a cup-like structure and, by doing so, curve the local membrane in which they are embedded. Importantly, membrane tension is a key mediator of Piezo function and gating, an idea well-supported several independent studies. Cells have varied three-dimensional shapes and are dynamic assemblies surrounded by plasma membranes with complex topologies and biochemical landscapes. How these microenvironments influence mechanosensation and Piezo function are unknown.

      The current preprint by Zheng Shi and colleagues asks how the shape of the membrane influences Piezo location. The authors use creative approach involving methods to distort the plasma membrane by generating “blebs” and artificial “filopodia”. Overall, the work convincingly shows that the curvature of the lipid environment influences Piezo localization. Specifically, they show that Piezo1 molecules are excluded from filopodia and other highly curved membranes. These experiments are well controlled and the results fully consistent with previous structural and biochemical work. Furthermore, the work explores the hypothesis that a chemical modulator of Piezo1 channels called Yoda1 functions by “flattening” the channels, a movement previously proposed to be linked to mechanical gating. Consistent with this model, the authors show that Yoda1 application is sufficient to allow Piezo1 channels to enter filopodia. While the flattening model is provocative hypothesis, hard evidence awaits structural verification.

      Overall, the preprint by Shi and colleagues will be of interest to scientists studying how mechanical forces are detected at the molecular level. The work introduces important concepts regarding how the shape of cellular membranes affects the movement and function of proteins within it. The technical advance for changing the shape of a plasma membrane is of note. 

      We thank the reviewers for the accurate summary and positive assessments of our manuscript. We address each of the concerns below.

      RECOMMENDATIONS

      Revisions essential for endorsement:

      As is evident from the comments below, our endorsement of the study is not dependent on additional experiments. However, we feel more experimental clarification is needed, that providing clearer images would be helpful, and, most importantly, we would like alternative conclusions and caveats to be mentioned.

      1. Can the authors comment on the link between the conclusions that (1) the presence of filopodia prohibits Piezo1 localization (Fig 1) and (2) Piezo1 expression prohibits the formation of filopodia (Fig 3). As it stands, it is hard to understand if there is a cause and effect relationship here or if these are separate, unrelated observations? We recommend revising the discussion to clarify.

      We now clarify the link between Piezo1’s curvature sensing (depletion from filopodia) and its inhibition effect on filopodia formation before presenting the current Fig. 5: “Curvature sensing proteins often have a modulating effect on membrane geometry. For example, N-BAR proteins, which strongly enrich to positive membrane curvature, can mechanically promote endocytosis by making it easier to form membrane invaginations (Shi and Baumgart, 2015; Sorre et al., 2012). Thus, we hypothesize that Piezo1, which strongly depletes from negative membrane curvature (Fig. 1, Fig. 2), can have an inhibitory effect on the formation of membrane protrusions such as filopodia.”

      2. When comparing the images of Fig. 2A, B to those of Fig. 2C, D, it appears that bleb formation induces a drastic enrichment of Piezo1 in the bleb membrane. Is this due to low membrane tension in the bleb? If this is the case, it indicates that the level of membrane tension has a prominent role in determining the localization of Piezo1.

      We apologize for this confusion due to our poor wording and figure presentation in the manuscript. By “Piezo1 clearly locates to bleb membranes” we didn’t mean to indicate that Piezo1 is enriched on bleb membranes as compared to the cell body. Rather, we meant to emphasize Piezo1’s localization to the *membrane* of the blebs rather than in the cytosolic space.

      Cells in 2C, 2D are different from that in 2A and 2B and were presented with different image contrasts. We now include the images of the full cell for Fig. 2C and 2D as the current Figure S8. To focus on the equator of the bleb, the cell body was out of focus. However, there is no indication that Piezo1 density is significantly different between the bleb membrane and the intact parts of the plasma membrane.

      We changed the main text to: “Similar to previous reports (Cox et al., 2016), bleb membranes clearly contain Piezo1 signal, but not significantly enriched relative to the cell body (Fig. 2C, 2D; Fig. S8).”

      In line with this, it appears more Piezo1 proteins are localized in less tensed tethers. Thus, might your observations be equally consistent with tension rather than curvature as a key regulator of Piezo1 localization? We recommend adding this to your discussion.

      We now explain the deconvolution between tension and curvature effects in detail. We also performed additional experiments to quantify the membrane tension in cells and blebs (current Fig. S9).

      In the Results section, we add: “Tethers are typically imaged > 1 min after pulling, whereas membrane tension equilibrates within 1 s across cellular scale free membranes (e.g., bleb, tether) (Shi et al., 2018). Therefore, the sorting of Piezo1 within individual tension-equilibrated tether-bleb systems (Fig. 2C – 2G) suggests that membrane curvature can directly modulate Piezo1 distribution beyond potential confounding tension effects.”

      In the Discussion section, we add: “In addition to membrane curvature, tension in the membrane may affect the subcellular distribution of Piezo1 (Dumitru et al., 2021). Particularly, membrane tension can activate the channel and potentially change Piezo1’s nano-geometries. This tension effect is unlikely to play a significant role in our interpretation of the curvature sorting of Piezo1 (Fig .2): (1) HeLa cell membrane tension as probed by short tethers (Fig. S9F; 45 ± 29 pN/ µm on blebs and 270 ± 29 pN/ µm on cells, with the highest recorded tension at 426 pN/ µm) are significantly lower than the activation tension for Piezo1 (> 1000 pN/µm (Cox et al., 2016; Lewis and Grandl, 2015; Shi et al., 2018; Syeda et al., 2016)). (2) With more activated (and potentially flatten) channels under high membrane tension, one would expect a higher density of Piezo1 on tethers pulled from tenser blebs. This is the opposite to our observations in Fig. 2C - 2G, where Piezo1 density on tethers was found to decrease with the absolute curvature, thus tension (eq. S6), of membrane tethers.”

      3. Given the intrinsically curved structure of Piezo1, it is difficult to understand the model’s prediction that curved Piezo1 is not enriched in 25-75 nm invaginations. Where will Piezo1 normally reside in the plasma membrane? It would be helpful if this could be discussed.

      The spontaneous curvature from our model _C_0 (_C_0-1 = 83 ± 17 nm, the value is updated after refitting to more data points collected for Fig. 2G) represents a balance between the intrinsic curvature of Piezo1 trimers (0.04 ~ 0.2 nm-1 as suggested by CryoEM studies(Haselwandter et al., 2022; Lin et al., 2019; Yang et al., 2022)) and that of the associated membrane (0 nm-1, assuming lipid bilayers alone do not have an intrinsic curvature). We now refer to _C_0 as the “spontaneous curvature of the Piezo1-membrane complex” throughout the manuscript, rather than the “spontaneous curvature of Piezo1”.

      Our model, when extrapolated to membrane invaginations, predicts a weak enrichment of Piezo1 on ~100 nm invaginations (peak at 83 nm), but a depletion of Piezo1 on more highly curved invaginations. This is simply because it would be energetically costly to fit a protein-membrane complex to a curvature that is different from what the complex prefers (in the case of 25-75 nm membrane invaginations, the membrane curvature would be too high for the Piezo1-memrbane complex).

      However, it is worth pointing out that Piezo1-membrane complex may not present the same spontaneous curvature on positively and negatively curved membranes. More importantly, we do not yet have direct evidence to show that this depletion indeed happens in the exact range of invagination curvature we predicted. We now acknowledge this limitation in the Discussion section: “However, it is worth noting that we assumed a zero spontaneous curvature for membranes associated with Piezo1 and that the spontaneous curvature of Piezo1-membrane complex is independent of the shape of surrounding membranes. These assumptions may no longer hold when studying Piezo1 in highly curved invaginations or liposomes (Lin et al., 2019).”

      We also took this opportunity to verify the key prediction from the extrapolated model - that Piezo1 would enrich towards ~ 100 nm radius cell membrane invaginations. To achieve this, we utilized a recent development in nanotechnology, pioneered by Wenting Zhao and Bianxiao Cui’s labs (Lou et al., 2019; Zhao et al., 2017). An illustration of the experimental design and detailed findings are summarized in the current Fig. 3 and briefly discussed below.

      In collaboration with Wenting Zhao’s lab, we cultured cells on precisely engineered nanobars with curved ends and flat central regions. For a labelled membrane protein of interest, the end-to-center fluorescence ratio would report the protein’s curvature sorting ability. We find that Piezo1 enriches to the curved ends of nanobars, whereas membrane marker signals are homogeneous across the entire nanobar (Fig. 3). The finding achieved strong statistical significance via hundreds of repeats on nanobars of the exact same geometry, a major technical strength of our chosen system. Furthermore, the enrichment of Piezo1 was observed on nanobars with 3 different curvatures (corresponding to diffraction-limited radii between 100 to 200 nm) and qualitatively agrees with our model (current Fig. S10). While further investigations on a wider range of membrane curvature are required to fully map out the sorting of Piezo1 on membrane invaginations, our data in the current Fig. 3 clearly verifies the prediction that membrane curvature can lead to enrichment of Piezo1 on cellular invaginations.  

      We now refer to this new finding in the Abstract, along with the previously observed depletion of Piezo1 on filopodia. We present a detailed description of the experiment and associated findings in the Results and the Method sections.

      4. It is currently unknown whether and how long Yoda1 might keep Piezo1 in a flattened state. Given that Yoda1 is highly hydrophobic, it might affect membrane properties instead of the curvature of Piezo1. These caveats should be discussed.

      We thank the reviewers for pointing out the potential effect of Yoda1. We did additional experiments to confirm that on Piezo1-KO cells, Yoda1 molecules alone do not significantly alter the formation of filopodia, in contrast to observations in WT cells. This data suggests Yoda1 (at the concentration we use) is unlikely to significantly alter the mechanical properties of the plasma membrane. The data is now presented as Fig. 5E in the updated manuscript. We added: “In Piezo1 knockout (Piezo1-KO) cells, adding Yoda1 to the culture medium does not significantly change the number of filopodia (Fig. 5E), suggesting the agonist does not directly regulate filopodia formation without acting on Piezo1.”

      5. The authors state that “Yoda1 leads to a Ca2+ independent increase of Piezo1 on tethers”. It has not been determined yet that Yoda1 leads to Piezo1 flattening (or even opening). In Electrophysiology experiments, unless there is pressure applied, Yoda1 does not lead to substantial currents. Therefore, the cartoon of Yoda1 flattening Piezo1(3H) is misleading. We recommend revising this. So far, the best experimental evidence on flattening is via purified channels reconstituted in various sizes of liposomes. However, it is plausible that the flattened shape is closed or open inactivated. Because most of the claims of this paper depend on the curved vs flattened shape of Piezo1, the authors should address these caveats carefully.

      We thank the reviewers for pointing out the limitations in our current understanding of Yoda1. We agree that our data do not directly show the flattening of Piezo1 by Yoda1, rather it is consistent with the flattening hypotheses. We lowered the tone of our conclusion to Fig. 4 to: “Our study suggests this conformational change of Piezo1 may also happen in live cells (Fig. 4H).” We also added arrows in Fig. 4H to suggest that membrane tension helps the proposed flattening of Piezo1 by Yoda1.

      We think our experiment may also provide new insights on the action of Yoda1: First, we note that only a small fraction of filopodia responded to Yoda1, and pre-stressing of the cell membrane was required to amplify the Yoda1 effect (current Fig. 4E). This observation is consistent with the reviewers’ notion that membrane tension is likely required to flatten Piezo1, even in the presence of Yoda1. Secondly, highly curved liposome or detergents can confine the shape of Piezo1 trimers. Therefore, the inability to observe Yoda1-induced flattening of Piezo1 in small liposomes is not necessarily in contradiction with our observation in the mostly flat cell membranes.

      We add to the Discussion section: “Yoda1 induced flattening of Piezo1 has not been directly observed via CryoEM. Our results (Fig. 4) point to two challenges in determining this potential structural change: (1) Yoda1 induced changes in Piezo1 sorting is greatly amplified after pre-stretching the membrane (Fig. 4E), pointing to the possibility that a significant tension in the membrane is required for the flattening of Yoda1-bound Piezo1. (2) Piezo1 is often incorporated in small (< 20 nm radius) liposomes for CryoEM studies. The shape of liposomes can confine the nano-geometry of Piezo1 (Lin et al., 2019; Yang et al., 2022), rendering it significantly more challenging to respond to potential Yoda1 effects. This potential effect of membrane curvature on the activation of Piezo1 would be an interesting direction for future studies.”

      6. Page 9: "Our study shows this conformational change of Piezo1 in live cells (Fig. 3H)." We recommend that this claim be removed as it seems too strong for the provided data.

      We changed the sentence to: “Our study suggests this conformational change of Piezo1 may also happen in live cells (Fig. 4H).”

      Additional suggestions for the authors to consider:

      1. Based on the calculated spontaneous curvature of Piezo1-membrane C0 of 87 nm, is it possible to derive the curvature of Piezo1 protein itself and the associated membrane footprint? This would be a nice addition.

      It is possible to do such an estimation, however, many (unverified) assumptions must be made, in addition to the ones already in our model. First, we need to assume a size of the Piezo1 trimers and of the Piezo1-membrane complex. If we assume Piezo1 trimers are ~170 nm2 in the plane of lipid bilayers (based on estimates from PDB) and that the complex takes on the shape of a 10 -20 nm radius half-sphere. Effectively, Piezo1 occupies an area fraction of 6.7%~27% in the Piezo1-membrane complex. Next, we assume that the membrane and the Piezo1 trimer have the same bending rigidity. Finally, we assume that the membrane itself does not have an intrinsic curvature.

      With those assumptions, the intrinsic curvature of Piezo1 trimers (_C_p) would relate to the spontaneous curvature of membrane-Piezo1 complex (_C_0) following: _C_p-1 = _C_0-1 * (6.7%~27%). Knowing _C_0-1 = 83 ± 17 nm, we get _C_p-1 = 5.6 nm ~ 22.4 nm.

      2. It is hard to see the filopodia and their localization in the figures. It would be better for readers and more convincing if clearer/higher resolution example images could be provided.

      We now provide high resolution figures.

      3. Can the authors better explain how the calculations done in panel 1C and S3D are done and their importance?

      Each fluorescence trace along the drawn yellow line was normalized to the mean intensity on the corresponding flat cell body, so that the average fluorescence of the cell body has a y-axis value of 1. We think the intensity traces are important because image contrast can be adjusted, therefore Fig. 1A alone would not convincingly show that there are no Piezo1 on filopodia.

      4. In Figure 2E, are these data from hPiezo1 or mPiezo1? In other cases, hPiezo1 is specified, this this may be a typo?

      Corrected.

      5. Figure 3 F&G: We assume these cells are the same in all panels, just visualized with either mCherry or eGFP in each condition. Accordingly, we would have expected more swelling in hypotonic conditions, and wonder if further evaluation may resolve this apparent discrepancy? If not, please provide more clarification.

      This is a good point. Indeed, we do observe a significant swelling of the cell right after the hypotonic shock.

      However, this effect is expected to be transient (volume of the cell would recover after ~ 1 min), see Figure. 1C here: https://www.pnas.org/doi/10.1073/pnas.2103228118. Our images in Fig. 3F and 3G were taken ~10 min after the hypotonic shock.

      6. On a lighter note, we’d recommend not using in cellulo.

      We changed in cellulo to “in live cells”

      Reference List

      Cox, C.D., Bae, C., Ziegler, L., Hartley, S., Nikolova-Krstevski, V., Rohde, P.R., Ng, C., Sachs, F., Gottlieb, P.A., and Martinac, B. (2016). Removal of the mechanoprotective influence of the cytoskeleton reveals PIEZO1 is gated by bilayer tension. Nature Communications 7, 1-13.

      Dumitru, A.C., Stommen, A., Koehler, M., Cloos, A., Yang, J., Leclercqz, A., Tyteca, D., and Alsteens, D. (2021). Probing PIEZO1 Localization upon Activation Using High-Resolution Atomic Force and Confocal Microscopy. Nano Letters 21, 4950-4958.

      Haselwandter, C.A., MacKinnon, R., Guo, Y., and Fu, Z. (2022). Quantitative prediction and measurement of Piezo's membrane footprint. bioRxiv

      Lewis, A.H., and Grandl, J. (2015). Mechanical sensitivity of Piezo1 ion channels can be tuned by cellular membrane tension. Elife 4, e12088.

      Lin, Y., Guo, Y.R., Miyagi, A., Levring, J., MacKinnon, R., and Scheuring, S. (2019). Force-induced conformational changes in PIEZO1. Nature 573, 230-234.

      Lou, H., Zhao, W., Li, X., Duan, L., Powers, A., Akamatsu, M., Santoro, F., McGuire, A.F., Cui, Y., and Drubin, D.G. (2019). Membrane curvature underlies actin reorganization in response to nanoscale surface topography. Proceedings of the National Academy of Sciences 116, 23143-23151.

      Shi, Z., and Baumgart, T. (2015). Membrane tension and peripheral protein density mediate membrane shape transitions. Nature Communications 6, 1-8.

      Shi, Z., Graber, Z.T., Baumgart, T., Stone, H.A., and Cohen, A.E. (2018). Cell membranes resist flow. Cell 175, 1769-1779. e13.

      Sorre, B., Callan-Jones, A., Manzi, J., Goud, B., Prost, J., Bassereau, P., and Roux, A. (2012). Nature of curvature coupling of amphiphysin with membranes depends on its bound density. Proceedings of the National Academy of Sciences 109, 173-178.

      Syeda, R., Florendo, M.N., Cox, C.D., Kefauver, J.M., Santos, J.S., Martinac, B., and Patapoutian, A. (2016). Piezo1 channels are inherently mechanosensitive. Cell Reports 17, 1739-1746.

      Yang, X., Lin, C., Chen, X., Li, S., Li, X., and Xiao, B. (2022). Structure deformation and curvature sensing of PIEZO1 in lipid membranes. Nature 1-7.

      Zhao, W., Hanson, L., Lou, H., Akamatsu, M., Chowdary, P.D., Santoro, F., Marks, J.R., Grassart, A., Drubin, D.G., and Cui, Y. (2017). Nanoscale manipulation of membrane curvature for probing endocytosis in live cells. Nature Nanotechnology 12, 750-756.

      (This is a response to peer review conducted by Biophysics Colab on version 1 of this preprint.)

    1. Meillassoux is quite right to say this renders the objectivity of knowledge very difficult to understand. But why think the problem lies in presuming the artifactual nature of cognition?—especially now that science has begun reverse-engineering that nature in earnest! What if our presumption of artifactuality weren’t so much the problem, as the characterization? What if the problem isn’t that cognitive science is artifactual so much as how it is?

      Meillassoux claims that, because cognitive science is made of atoms, that makes it suspect -- so we need to use philosophy. That is a bad claim. Philosophy is also made of atoms too. Cognitive science solves how cognition works. It may not answer "why cognition works", but maybe that's a trick question that only philosophy can ask, but nobody can answer.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      The rapid syncytial nuclear cycles that occur during the first ~2.5 hours of Drosophila embryogenesis and give rise to the blastoderm are supported by large amounts of maternally deposited histone proteins which are stored in the egg cytoplasm for deposition into replicating DNA during each round of S phase. Although the H2A/H2B storage chaperone Jabba was identified by Michael Welte's lab several years ago, maternal H3/H4 storage chaperones have not been identified. Tirgar et al provide evidence that the Drosophila NASP protein provides histone H3 and H4 storage function during these earliest stages of Drosophila embryogenesis. The data include genetic analyses that NASP function is required maternally, but not zygotically, and molecular analyses that NASP binds H3 and that H3 and H4 levels are reduced in the embryo and late-stage oocytes in the absence of NASP. These data are convincing and support the conclusion that NASP is a maternally acting H3/H4 storage chaperone needed in the early embryo.

      Two additional lines of investigation would strengthen this conclusion and perhaps increase the impact and appeal of the manuscript.

      The first is a microscopic analysis of the nuclear division cycles in eggs derived from NASP mutant mothers. The authors report DAPI staining and assessment of nuclear cycles, but do not show these data. In fact, the two embryos shown in Figure 4B do not look like DAPI stained embryos-there are no nuclei apparent in the images. Loss of maternal histone causes defects in chromosome morphology that result in characteristic defects such as lagging chromosomes and the failure of sister chromatid segregation leading to fused daughter nuclei (see PMID: 11157774 for an example). These defects should not be difficult to detect via DNA staining or even using fluorescently labeled H2 type histones. Characterizing such defects would lend support to the hypothesis and I think is important for this paper.

      We thank the reviewer for their constructive review and feedback. We have switched to Propidium Iodide (PI) staining to increase the signal-to-noise for DNA staining in early embryos. Given the improved signal we see with PI over DAPI, we will be able to provide both improved images of nuclear staining and assay for defects in chromosome morphology as suggested. We will include this data in the revised version of the manuscript. Second, determining the location of NASP in the early embryo might provide further insight into the mechanism of storage. i.e. is NASP located in the cytoplasm rather than the nucleus, perhaps in association with lipid droplets like Jabba? Do the antibodies the authors developed work in IF experiments to ask this question? At the moment what is shown is that NASP is present in 0-2 hour embryos via western blot analysis, supporting the conclusion that it functions in the early embryo as a storage chaperone. This analysis would be nice to have but is not essential in my view.

      We have tried to use our antibody to monitor the localization of NASP in the early embryo. Unfortunately, the staining has yet to work. We will continue to alter fixation and permeabilization conditions in the early embryo with the goal of including this data in the revised manuscript. We have, however, been able to monitor NASP localization in Drosophila S2 cultured cells with our antibody. If we are unable to get the antibody staining to work in embryos, we will include the NASP localization data in S2 cells in combination with EdU labeling to mark cells in S phase.

      Small points: Is NASP really a maternal effect "lethal"? Some of the eggs do hatch, and so some develop to stages where maternal histones are no longer necessary and zygotic production takes over (i.e. cycle 15). Perhaps consider the language used here.

      We see the reviewers point with respect to the term ‘lethal’. We do see a very small fraction of progeny laid by NASPmutant mothers make it to adulthood, although they die shortly after hatching. We’ve removed the term ‘lethal’ and refer to NASP solely as a maternal effect gene. On this point, do NASP mutant females lay the same number of eggs as wild type? i.e. is there a requirement for oogenesis/egg production (other than depositing H3/H4 into the egg), or just for the early zygotic cycles?

      We have noticed that NASP mutant mothers have lower fecundity. We have included this data in the revised manuscript as Supplemental Figure 2A.

      The first paragraph of the results is redundant with much of the introduction, which I think could do a better job at describing in more detail the syncytial cycles and the special needs they have for histone storage and chaperone function versus the post-blastoderm embryonic cycles and the rest of development. i.e. make a better distinction between the first two hours of embryogenesis versus the rest of embryogenesis, and the when the switch from maternal to zygotic control of development and histone production occurs (cycle 15 at 3-4 hours AED).

      We appreciate the reviewer for this suggestion. The manuscript has been edited to be less redundant and include details of embryogenesis as suggested. CROSS-CONSULTATION COMMENTS Seems like all reviewers are in general agreement, particularly about providing additional data regarding chromosome/nuclear behavior in the NASP mutants and NASP localization in the early embryo to increase impact of the study. While rescue of the NASP mutant phenotype with a transgene would be nice, as suggested by referee #2, I don't think it's essential given the genetic approaches employed.

      Reviewer #1 (Significance (Required)):

      see above

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      Tirgar et al. report on a functional characterization of the Drosophila homolog of the histone H3/H4 chaperone NASP. They generated a loss of function allele of NASP by CRISPR/Cas9, which induces a partial maternal effect embryo lethal phenotype. Using quantitative mass spectrometry, they demonstrate that NASP stabilizes reservoirs of H3 and H4 in the early embryo. The manuscript is very clear and confirms the functional importance of maternal NASP for the early embryo. Genetic analyses are well conducted (but see my comments below) and the impact of NASP maternal mutant on H3 and H4 stockpiles is convincingly established by both quantitative mass spectrometry and Western-blotting.

      Major comments:

      • Although the authors used two independent deficiencies of the NASP genetic region to characterize their NASP CRISPR alleles, it is relatively standard in this type of functional analyses to perform rescue experiments using a transgene expressing the WT protein.

      We thank the reviewer for this suggestion. As discussed in the cross consultation, we agree that the use of the two different deficiency lines and the NASP1 CRISPR control are clear lines of evidence that the phenotypical data are due to lack of NASP.

      • In WB analyses, NASP appears systematically shorter in the NASP[1]/Df genotype compared to WT. Can the authors comment on this?

      While we reproducibly see this change in migration, we can only guess as to why this may be. One possible reason is that the NASP1 mutant protein could be missing a post-translational modification. Proteomic data from Krauchunas et al. (Dev Biol. 2012; PMC3441184) shows that NASP has the potential to be regulated by phosphorylation. Therefore, the NASP1 mutant protein could be missing a phosphorylation. Intriguingly, the 6bp insertion is next to a Thr residue that could affect its ability to be phosphorylated (if it is phosphorylated at all). Since we can only offer speculation, we do not feel comfortable adding this to the manuscript.

      • The authors do not mention the centromeric histone H3 variant Cid in their analyses. Do they have evidence that it is not affected by loss of maternal NASP?

      We thank the reviewer for raising this great point. Our mass spec data reveals that Cid levels stay the same in the absence of NASP in both embryos and stage 14 egg chambers. We have edited Figures 3D and 3E to include Cid. Unfortunately, we did not identify any Cid-specific peptides in our IP-mass spec data.

      • The authors could have chosen to explore in more details the phenotypic defects of embryos derived from NASP mutant mothers. Instead, a single abnormal embryo is shown with no cytological details. This is a bit problematic since an earlier study (Zhang et al 2018, cited in the manuscript) actually provided more phenotypic details of embryos from NASP KD mothers.

      This issue was also raised by Reviewer 1. We have switched to Propidium Iodide (PI) staining to increase the signal-to-noise for DNA staining in early embryos. Given the improved signal we see with PI over DAPI, we will be able to provide both improved images of nuclear staining and assay for defects in chromosome morphology as suggested. We will include this data in the revised version of the manuscript. - Similarly, the authors could have used their anti-NASP antibody to analyze the distribution of NASP during cleavage divisions. Does it behave like ASF1, for instance, which enters S phase nuclei at each cycle or does it remain in the cytoplasm? These are relatively simple experiments/analyses that could increase the significance of the study.

      This point was also raised by Reviewer 1. We have tried to use our antibody to monitor the localization of NASP in the early embryo. Unfortunately, the staining has yet to work. We will continue to alter fixation and permeabilization conditions in the early embryo with the goal of including this data in the revised manuscript. We have, however, been able to monitor NASP localization in Drosophila S2 cultured cells with our antibody. If we are unable to get the antibody staining to work in embryos, we will include the NASP localization data in S2 cells in combination with EdU labeling to mark cells in S phase.

      Minor comments:

      • line 60: I suggest to introduce Drosophila in the next sentence, where it seems more appropriate (not all embryos develop "extremely rapidly").

      We have edited the second sentence to state “the early Drosophila embryo”.

      • line 68: the 50% estimation of free histones does not really make sense without defining the embryonic stage.

      We have edited the manuscript to state the specific cell cycle in which there has been 50% free histones measured. - line 89: Are the authors specifically referring to Drosophila NASP?

      Yes, we have edited the text to include Drosophila in this instance. - lines 99-106: I found this paragraph redundant with the introduction.

      We appreciate this suggestion. It was also pointed out by Reviewer 1. We have made changes to the manuscript to address the redundancy.

      • line 142: H3-H4

      Thank you for noticing this. We have edited the text to include 4.

      • line190-191: It seems to me that data of Figure S2C are already included in Fig. 2E.

      The data in FigureS2C was performed with virgin females compared to the data in Figure 2E that was generated with non-virgin mothers. This was important to control the genotype of the embryos.

      • line 232: it is surprising that the Zhang et al paper (reporting maternal KD of NASP) is only mentioned here. As a reader, I would certainly prefer to have it presented right from the introduction.

      We have edited the manuscript to include this reference in the introduction.

      • Figure 4B needs a scale bar.

      Figure 4B will be replaced with better images of the embryo stained with PI. It will also include images of chromosome morphology/segregation. We will be sure to include scale bars.

      • line 302: Mentioning the identity and function of known H3/H4 histone chaperones acting in the early embryo (ASF1, HIRA, CAF-1, ...) could provide perspective to the present study.

      Thank you for this suggestion. We have edited the manuscript to include functions of other histone chaperones in the early embryo to provide context.

      • line 304: in contrast to this statement, I found quite surprising and interesting that NASP is not absolutely essential for embryo development considering its role. This should be discussed.

      In the absence of Jabba alone, upregulation of translation can compensate for the destabilization of H2A, H2B, and H2Av. It is only when translation is inhibited in embryos laid by Jabba mutant mothers that embryos die (Li.Z, et al. Curr Biol 2013). Therefore, it is possible that translation can partially compensate for the degradation of H3 and H4 in the absence of NASP. This may be why a fraction of embryos laid by NASP mutant mothers are able to hatch and why we still detect some H3 in embryos laid by NASP mutant mothers. We have edited the manuscript to discuss this more in depth.

      CROSS-CONSULTATION COMMENTS I fully agree with the other reports. The NASP rescue experiment is just a suggestion but is not essential.

      Reviewer #2 (Significance (Required)):

      This work clarifies the identity and function of Drosophila NASP and clearly demonstrates that NASP is important for the stabilization of maternal stockpiles of H3 and H4 during early embryo development. The conservation of NASP function as a histone H3/H4 chaperone in Drosophila is not really a surprise but the merit of this study is to establish this assumption as a fact. It also establishes useful tools (mutant lines and antibody) for the fly community interested in this topic. The study however does not provide new insights about the dynamic distribution of NASP and the cytological consequences of its maternal depletion on the amplification of cleavage nuclei.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      Summary: Rapid cell cycles in early embryogenesis is driven from maternally supplied stockpiles of RNA and protein, including histones H3 and H4. This study uses sequence homology searches, biochemical approaches (immunoprecipitation and mass spectrometry) and genetics to identify NASP (CG8223) as the H3-H4 chaperone in Drosophila. Using CRISPR technology, the authors generate a NASP mutant fly line and show using genetic crosses that NASP is a maternal lethal gene. Furthermore the study shows that NASP stabilises H3-H4 during oogenesis and embryogenesis and is required for early embryogenesis.

      Major comments: The key conclusions of this study are very convincing. For example, the authors use multiple approaches to show H3-H4 specific interactions with NASP and that H3-H4 protein levels are reduced in mutants (Western analyses, quantitative MS). Analysis is carried out on two individual NASP mutant lines (one deletion that produces no protein, one insertion that still produces some protein acting as a control). All experiments are well controlled, executed and presented. Genetic crossing schemes are well presented and statistical analysis of progeny is clear.

      • We thank the reviewer for their positive feedback of our manuscript. Minor comments: In Figure 1B - Authors could indicate amino acids shown or are they full length proteins?

      We have edited the methods to include specific amino residues that are included for each structure.

      In Figure 2B - Authors could (semi) quantify reduction in NASP1 mutant to show this is a gene dose effect?

      We have now included the quantification of the Western blot in Figure 2B.

      CROSS-CONSULTATION COMMENTS I agree with the other reports. Although I did not indicate it in my original report, I agree that more in depth analysis of nuclear or chromosomal defects in NASP mutant embryos would enhance the study.

      Thank you for this suggestion. We are repeating the DNA staining in embryos and will include this new data in the revised version of the manuscript.

      Reviewer #3 (Significance (Required)):

      Excess soluble histones can be toxic and must be bound to chaperones. Until this study the chaperone responsible for H3-H4 stabilisation in rapidly cycling cells in Drosophila embryos was not known. Moreover, the NASP homolog had not yet been identified in Drosophila nor had its function been characterised. The findings are of interest to Drosophila researchers, the field of chromatin assembly, as well as those interested in early embryogenesis in animals.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      An exciting development in our knowledge about how the Arp2/3 complex controls the assembly of actin networks has come from the discovery that in addition to forming branched networks, Arp2/3 can nucleate linear filaments when it is activated by WISH/DIP/SPIN90. However, despite some excellent work largely done by the Nolen lab in yeast, many questions remain about how Arp2/3-mediated assembly of branched vs. linear actin filament. This is especially true in the complex environment of cells, were synergy and competition of different actin networks is used to control biological processes. Knowing the biochemical and physical properties of these different Arp2/3 assemblies will be key to figuring out how they work in cells. Here Cao et al. use an elegant microfluidics based single filament assay system to perform a comparative analysis of the stability of linear and branched Arp2/3 networks. They find interesting differences in how they respond to stabilizing and destabilizing factors. The most striking differences happens when force or aging is applied- both cause debranching of branched networks but have little effect on Spin90-Arp2/3 nucleated filaments.

      We thank the reviewer for their positive comments.

      Major comments:

      As a comparative study on the stability of branched vs. linear Arp2/3 nucleated filaments, this manuscript is fairly complete. The key conclusions are well supported by rigorous experiments which can be reproduced by others based on the information provided. However, I am not seeing explicit information on performing biological replicates. This should be included in the manuscript. The use of statistics is largely fine; however I question the use of one statistical test on one figure (see minor comments below).

      The revised manuscript is now explicit about biological replicates. We now specify the biological repeats of all our experiments in the figure legends, and we now show the results from new repeats in Fig 4 and Supp Fig S2 (please see also our response to the minor comments below, for more details).

      I would not ask for additional experiments at this time. However, there is an analysis that would be important for interpreting the authors' claims- branch/filament length at the time of dissociation or destabilization of Arp2/3. This would help address if there was a physical tipping point for each type of structure that could explain potential differences they see. The authors should already have this data and the time to complete it would be negligible in delaying publication.

      If we understand correctly, the “physical tipping point” mentioned by the reviewer would be a threshold force, where the Arp2/3-filament interface would become unstable. This is an interesting idea. Indeed, the applied force scales with the length of the filament (or branch), as well as with the flow velocity. In most of our experiments, however, the force applied to SPIN90-Arp2/3 and to branch junctions was kept constant and below 0.2 pN. This was done by exposing the filaments (or branches) to G-actin at the critical concentration, in order to minimize variations of their lengths. Therefore, by design, dissociation events in these experiments take place at the same length, ruling out the existence of a “tipping point”.

      Our data provide another test of the reviewer’s hypothesis, thanks to the experiments where we specifically address the question of the impact of force (Fig 5 and Supp Fig S6), by varying length and flow rate. We found that the stability of SPIN90-Arp2/3 linear filaments was unaffected by force, and that debranching was steadily accelerated by force. In both cases, it thus appears that there is no detectable threshold.

      One additional major comment is that the manuscript's title and abstract hint that this paper explores the differences in nucleation of branched vs. linear filaments by Arp2/3. However, the only figure that deals explicitly with nucleation in the paper is Figure 1, which is really just a confirmation that the mammalian proteins used in this study perform similarly to their yeast homologues (Balzer et al, Current Biology 2019). The authors might think about rewording the title/abstract to better reflect that paper really explores the differences in the stability of the two networks

      This is a fair point. We have now modified the title into “Regulation of branched versus linear Arp2/3-generated actin filaments”.

      Minor comments:

      1 in 12 men and 1 in 200 women are red/green colorblind. Please change the coloring of the schematics and images so that they can be easily seen by all people. This is especially true of the schematics, which are important for understanding exactly what each assay is measuring.

      We thank the reviewer for pointing this out. We have now made the schematics and images in Figs 1A, 2A, 2D and 4D colorblind-friendly.

      The Introduction is a bit choppy and unfocused. It was difficult to deduce exactly where the paper was going from it. Please consider re-writing it for better clarity. The Discussion on the other hand was fantastic. Great job on interpreting your results in a larger context.

      We have re-written large parts of the Introduction to make it clearer. We are glad the reviewer liked the Discussion, where we have nonetheless made some small changes in response to comments from the other reviewers.

      Many figures- while the use of different lightness values of the same color is appreciated in conveying different concentrations of reagents used, there were several instances where it was very hard to read the one on the very bottom (ex. 2B, E; 3A; 5C, G).

      We have now changed the colors in these figures, to make them clearer.

      Figure 1- since this is a confirmation of previous results performed using the same proteins from other species, the title should reflect that (ex. VCA domains accelerate the nucleation of filaments by mammalian SPIN90-Arp2/3). Also, to me this figure is supplementary to the main message of the paper. The authors might think of moving it to Supplementary Information.

      We have modified the title of Figure 1, now specifying “mammalian”, following the reviewer’s suggestion. However, we prefer to keep this figure as a main figure, rather than move it to Supplementary as proposed. Indeed, this figure does more than simply confirm previous results with mammalian proteins, since it compares different VCAs, which is new. These results are important because they are put in perspective with our results on the acceleration of linear filament detachment by different VCAs, later in the manuscript.

      Figure 1- If the goal was to verify that G-actin recruitment by VCA was important for Spin90-Arp 2/3 nucleation by performing a competition experiment with profilin, why was the concentration of G-actin AND profilin increased between the experiments in 1B vs. 1C. It makes it hard to directly compare the results.

      We now provide new data in Fig 1C, which can be directly compared to Fig 1B (only the profilin concentration was increased). It clearly shows that the effect of VCA disappears when the profilin concentration is increased.

      Figure 4B-F- Here, it would be nice to see the distribution of all the individual results, which are hidden by the bar graph. Additionally, the Chi-square test is not the appropriate test for evaluating statistical significance between multiple groups. ANOVA followed by an appropriate post hoc test should be used here.

      We now show the individual results in the bar graphs of figure 4. In this situation, we agree that the statistical significance should not be evaluated by a Chi-square test. We now indicate the p-values obtained from a paired t-test, which seems appropriate since we are comparing averages in pairs.

      Figure 4G- Please quantify and show reproducibility.

      We now show quantified repeats (shown in Fig 4, new panels H and I).

      Figure 5- the piconewton forces used for these experiments is in line with measured forces that are applied to actin in cells (ex. Mehida et al, Nature Cell Biology 2021; Jiang et al, Nature 2003). The text would benefit if this was explicitly stated.

      We now state this explicitly, when presenting these results.

      Reviewer #1 (Significance (Required)):

      The real significance of this work is in characterizing the differential stabilities of linear vs. branched Arp2/3 filaments in response to actin-binding proteins, mechanical stress, and aging. While both types of filaments respond similarly to actin-binding proteins, with nuanced differences, the most striking results came from applied force and aging experiments, with Spin90-Arp2/3 filaments being much more resistant to both. This has some very interesting implications for how these two types of assemblies might synergize in cells. Additionally, the results also have some exciting implications for the pointed-end regulation of actin filaments, which is still poorly understood in complex systems. Since the manuscript is A) more of a survey study on the factors that influence filament stability that does not go particularly deep into any particular mechanism of regulation and B) has no direct applicability to how the physical properties of branched and linear Arp2/3 nucleated actin filaments influence actin network activity in cells, the audience will likely by limited to actin enthusiasts. However, the work is still important in both what it reveals and implies.

      We thank the reviewer for pointing out the novelty and the importance of our work. We agree that the significance of our paper lies in the characterization of the differential stabilities of linear vs. branched Arp2/3 filaments, in response to different physiological factors. One of the strengths of our approach is that we do not focus on one regulatory mechanism in particular. Rather, we reveal fundamental differences between the Arp2/3-generated filaments and how they can be regulated. Understanding these basic mechanisms is a prerequisite to understand the regulation of entire cytoskeletal networks.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      The quantitative analysis can be improved. It appears that most of the data results from single experiments, with rate values and errors resulting from fitting of single experiments without repetitions. In Fig. 1C legend (p.5) the authors state "These experiments were repeated three times, with similar results", but the data is not used in the analysis and other experiments do not mention this point. This is particularly important for comparisons among different VCAs that are rather similar in nature. In Fig. 1B. N-WASP is more efficient in nucleating SPIN90-Arp2/3 complex-linear filaments followed by WASP and then WASH. In Fig. 2 B,C, N-WASP is the most effective in dissociating SPIN90-Arp2/3 complex linear filaments followed by WASH and then WASP. But in Fig. 2 E,F, WASH is by far the most effective in dissociating branches followed by N-WASP and then WASP. Therefore, the conclusion in the Discussion (p.12) "While these regulatory proteins similarly affect branched and linear Arp2/3-generated filaments, they do so with clear quantitative differences" is not supported by quantification. To remedy this problem the authors should include at least 3 repeats of each experiment in data analysis. Also, they could include an analysis of sequence differences among VCAs and discuss how these may correlate with the observed differences. For instance, one WH2 in WASP vs. two in N-WASP.

      Indeed, we argue that the two forms of activated Arp2/3 differ in their sensitivity to different VCA motifs, based on how these VCA motifs rank in their ability to destabilize branched and linear filaments (the VCA motifs also rank differently in their activation and co-activation of Arp2/3 to nucleate branches and linear filaments, but this result does not contribute to our discussion of how proteins interact with the activated Arp2/3). Following the reviewer’s suggestion, we now show repeats of these experiments (new Supp Fig S2), clearly showing that N-WASP is the most effective in dissociating linear filaments while the differences are milder for dissociating branches, with WASH being at least as effective as NWASP. We now also discuss how this observation could relate to differences in sequence between VCAs (Discussion section and new Supp Fig S9).

      Also, please note that, following a suggestion from Reviewer 3, we have now performed experiments with the CA-domains of NWASP (new Supp Fig S4C and S4D), which show that the V-domain plays an important role in debranching but plays no role in destabilizing SPIN90-Arp2/3 at filament pointed ends. These new results reinforce our statement that VCA affects branched and linear Arp2/3-generated filaments differently.

      Reviewer #2 (Significance (Required)):

      Arp2/3 complex is a 7-protein complex implicated in actin filament nucleation and branching. Arp2/3 complex-nucleated branched networks are found at several locations in cells and are responsible for processes such as cell motility.

      Cao et al. compare the effect of several proteins on the filament nucleation activity of Arp2/3 complex, and the stabilization or destabilization of actin filament branches as well as linear actin filaments nucleated by SPIN90-Arp2/3 complex. The proteins tested include the VCA regions of three NPFs (N-WASP, WASP, and WASH) that activate Arp2/3 complex, GMF (a debranching protein) and cortactin (a branch stabilizing protein). For the most part, the study uses a single method, microfluidics-TIRF microscopy.

      The main findings are:

      1. VCA domains enhance nucleation of linear filaments by SPIN90-Arp2/3 complex in the presence of actin monomers.
      2. However, VCA domains can also destabilize existing SPIN90-Arp2/3 complex linear filaments and branches, and this effect depends on the presence of of V-domain (WH2 domain that binds actin monomers).
      3. The debranching factor GMF also destabilizes SPIN90-Arp2/3 complex linear filaments. Both GMF and VCA generate free pointed ends by dissociating Arp2/3 complex from pointed ends and SPIN90.
      4. SPIN90-Arp2/3 complex linear filaments are less susceptible to force and aging than filament branches.
      5. Cortactin stabilizes SPIN90-Arp2/3 complex linear filaments to higher degree than it does branches. These are novel and very interesting new observations of significant interest to the actin cytoskeleton field. Therefore, I recommend publication of this paper in EMBO J.

      We thank the reviewer for their positive evaluation of our work.

      I have one recommendation and one suggestion for improvement:

      Major:

      1. The quantitative analysis can be improved. It appears that most of the data results from single experiments, with rate values and errors resulting from fitting of single experiments without repetitions. In Fig. 1C legend (p.5) the authors state "These experiments were repeated three times, with similar results", but the data is not used in the analysis and other experiments do not mention this point. This is particularly important for comparisons among different VCAs that are rather similar in nature. In Fig. 1B. N-WASP is more efficient in nucleating SPIN90-Arp2/3 complex-linear filaments followed by WASP and then WASH. In Fig. 2 B,C, N-WASP is the most effective in dissociating SPIN90-Arp2/3 complex linear filaments followed by WASH and then WASP. But in Fig. 2 E,F, WASH is by far the most effective in dissociating branches followed by N-WASP and then WASP. Therefore, the conclusion in the Discussion (p.12) "While these regulatory proteins similarly affect branched and linear Arp2/3-generated filaments, they do so with clear quantitative differences" is not supported by quantification. To remedy this problem the authors should include at least 3 repeats of each experiment in data analysis. Also, they could include an analysis of sequence differences among VCAs and discuss how these may correlate with the observed differences. For instance, one WH2 in WASP vs. two in N-WASP.

      This comment is identical to the reviewer’s first paragraph. We copy our answer here again, for convenience:

      Indeed, we argue that the two forms of activated Arp2/3 differ in their sensitivity to different VCA motifs, based on how these VCA motifs rank in their ability to destabilize branched and linear filaments (the VCA motifs also rank differently in their activation and co-activation of Arp2/3 to nucleate branches and linear filaments, but this result does not contribute to our discussion of how proteins interact with the activated Arp2/3). Following the reviewer’s suggestion, we now show repeats of these experiments (new Supp Fig S2), clearly showing that N-WASP is the most effective in dissociating linear filaments while the differences are milder for dissociating branches, with WASH being at least as effective as NWASP. We now also discuss how this observation could relate to differences in sequence between VCAs (Discussion section and new Supp Fig S9).

      Also, please note that, following a suggestion from Reviewer 3, we have now performed experiments with the CA-domains of NWASP (new Supp Fig S4C and S4D), which show that the V-domain plays an important role in debranching but plays no role in destabilizing SPIN90-Arp2/3 at filament pointed ends. These new results reinforce our statement that VCA affects branched and linear Arp2/3-generated filaments differently.

      Minor:

      In GST-pull-down experiments (Fig. 4G), the amount of Arp2/3 complex bound is analyzed by Western, which is rather unprecise. Is the amount of Arp2/3 complex so little that it cannot be quantified using regular SDS-PAGE? If that is the case, this would suggest rather low affinity of SPIN90 for Arp2/3 complex. How does this affect the proposed mechanism and experiments in the microfluidics chamber?

      Indeed, the amount of pulled-down Arp2/3 is low and difficult to quantify by SDS-PAGE. This is consistent with previous reports which indicate a low affinity of SPIN90 for the Arp2/3 complex (Wagner et al. Current Biology 2013, Balzer et al. eLife 2020). This does not affect our conclusions, which we now confirm by showing quantified repeats of our pull-down experiments (new panels H and I, in Figure 4). In spite of this low affinity, which makes it difficult to saturate SPIN90 with Arp2/3, the SPIN90-Arp2/3 interaction is very stable and allows us to carry out our experiments in the microfluidics chamber over several tens of minutes (as was already the case in our previous study, Cao et al. NCB 2020).

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      Summary:

      In this study, Cao and collaborators investigate the biochemical and mechanical differences between branched actin filaments nucleated by WASP-activated Arp2/3 complex and linear actin filaments nucleated by SPIN90-activated Arp2/3 complex. They use TIRF microscopy in a microfluidic chamber to show that the mammalian proteins, SPIN90 and WASP (or N-WASP or WAVE), like their yeast homologues, co-activate Arp2/3 complex to nucleate linear actin filaments. Using the same assays, they find the surprising result that the VCA segment of WASP proteins destabilizes the interaction between SPIN90 and Arp2/3 complex in linear actin filaments nucleated by Arp2/3 complex. They then show that VCA also destabilizes actin filament branches. The remainder of the study explores the influence of branch stabilizing/destabilizing proteins or mechanical stress on the stability of the interaction between SPIN90 and Arp2/3 complex on the pointed end of the actin filament. They find that like branch junctions, SPIN90-bound Arp2/3 is destabilized at the end of linear filaments by GMF and stabilized by cortactin. However, unlike branch junctions, SPIN90-Arp2/3 complex is not destabilized on filament ends by piconewton forces or by aging. They conclude that SPIN90- versus VCA-activated Arp2/3 complex adopt similar but non-identical conformations.

      Overall, the paper is well written and the experiments, which are very challenging, are rigorously executed. The biochemical results are convincing, novel and unexpected. However, the work could be strengthened by more strongly connecting the biochemical observations to biological implications. In addition, there are some interpretations/conclusions that seem somewhat weakly supported, and the authors should consider revising. Nonetheless, given the quality of the work and the importance of the system, this manuscript will appeal to a broad audience.

      We thank the reviewer for their positive comments. We have rewritten parts of the Discussion in order to better connect our observations to implications in cells. We address the concerns regarding our interpretations in the point-by-point, below.

      Comments on evidence, reproducibility, clarity and significance:

      The differences in the stability of SPIN90-Arp2/3 on linear filaments verses branch junctions led the authors to conclude that SPIN90- versus VCA-activated complexes adopt similar yet non-identical conformations. There are two problems with this conclusion:

      1) This conclusion rests on the idea that the biochemical differences can only be due to differences in the "ground state" active conformations of the complex. Another possible scenario would be that the active conformations are the same, but the transition state or intermediate state structures within the debranching reactions are different, thus changing the kinetics of the debranching reactions.

      We thank the reviewer for this remark, and we agree that conformational differences may also arise in the intermediate states, during dissociation (of the branch from the mother, or of the linear filaments from SPIN90). We now mention this possibility in our Discussion.

      2.) There are already structural data showing conformational differences between the Dip1-bound Arp2/3 complex on the end of a linear filament and Arp2/3 complex at a branch junction. While there are some caveats to comparisons of the structures (e.g., the Dip1 structure includes the fission yeast SPIN90 protein (Dip1) and the fission yeast Arp2/3 complex while the branch junction contains mammalian proteins), these data offer much stronger evidence that the active states adopt (somewhat) different conformations than the data presented here.

      We agree that the available structural data (in particular, Ding et al. PNAS 2022, which was not yet published when we submitted our manuscript, and which we now cite) provide a clear indication that active Arp2/3 adopts different conformations in branches and linear filaments. We have modified our text to make this point clearer.

      The authors make comparisons between the Fäβler branch junction structure and the Shaaban Dip1-Arp2/3-filament structure. The Fäβler branch junction structure is a low resolution structure (9 angstroms) and should be interpreted with caution (see below). A much higher resolution of a branch junction structure was recently solved (Ding et al, PNAS 2022) and should be used for comparisons between the structures.

      Ding et al. PNAS 2022 was not yet published when we submitted our manuscript. We now use it to compare the structures of active Arp2/3, and we have modified the text accordingly.

      Pg 14 - The authors say differences between ARPC3-Arp2 and ARPC5-Arp2 contacts in the two structures are likely to cause the differences in interactions with GMF and VCA. Two concerns with this statement are: 1.) The basis for the conclusion that the ARPC5-Arp2 contacts are different (in Fäβler, et al.) is not solid (see Ding, et al) and 2.) The analysis is vague. To reasonably conclude that differences in the contacts would influence GMF and VCA interactions would require mapping out the structural connection between the ARPC3-Arp2 interaction site and the GMF or VCA binding sites. If there is no obvious connection between these sites, the conclusion that the differences in the ARPC3-Arp2 interface cause differences in VCA and GMF binding should be far more circumspect.

      We have re-written this part of the Discussion section. In light of the new data by Ding et al., we agree with the reviewer that the conclusion that the ARPC5-Arp2 contacts are different is not solid. Our revised text makes it clear that we are not making any claims involving interactions within the Arp2/3 complex. Our point is simply that recent cryo-EM reports indicate conformational differences in Arp2 and Arp3 between the two activated forms of the Arp2/3 complex and that, since the CA-domain of NPFs bind to Arp2 and Arp3, it appears reasonable to make a connection with our results.

      Pg 6. "These observations suggest that the ability of VCA to destabilize Arp2/3-nucleated filaments relies on the availability of its V-domain." It's possible that G-actin binding to V blocks the CA from accessing the branch junction. Therefore, it seems important to test whether N-WASP-CA can destabilize Arp2/3-nucleated actin filaments.

      We thank the reviewer for this suggestion. We now present results from new experiments performed with the CA-domain of NWASP (new Supp Fig S4C,D). We find that the V-domain participates in the enhancement of debranching, but that it appears to play no role in the destabilization of SPIN90-Arp2/3 from the pointed end. It thus seems that the reviewer’s proposal is correct, and that G-actin binding to the V-domain blocks the CA-domain from accessing the branch junction. We now propose this interpretation in the text.

      Pg 1 - The authors state that "It thus appears that linear and branched Arp2/3-generated filaments respond similarly to regulatory proteins, albeit with quantitative differences". It is worth considering if one should make a blanket statement that linear and branched filaments respond similarly to regulatory proteins when they have tested 3 in total.

      We have rephrased this sentence. It now reads “… respond similarly to the regulatory proteins we have tested…”

      Pg 3 - "More generally, the stability of SPIN90-Arp2/3 at the pointed end, which is important to understand the reorganization and disassembly of actin filament networks, remains to be established." In some ways this statement not quite accurate because Balzer et al previously showed that Dip1-Arp2/3 complex is very stable at the pointed end. Is the question here whether that stability is also conserved in mammalian systems? If so, that should be more directly stated.

      We meant that, beyond observing that SPIN90 remains visible at the pointed end for some time (as in Balzer et al.), a lot remained unknown: its lifetime had not been quantified, and its sensitivity to the factors that affect branch junctions (proteins, aging, mechanical tension) had not been studied. We have rephrased the sentence in the manuscript to clarify this point.

      The observation that VCA accelerates debranching and SPIN90-Arp2/3 dissociation is very interesting. However, it is uncertain if this biochemical activity has biological relevance, given that once nucleation occurs, Arp2/3 complex will move away from the membrane. While the authors mention in the discussion that debranching by VCA could be relevant when the network is compressed near the membrane, this argument is not particularly strong. Are there ways to strengthen this argument, or find another impact this finding might have on our understanding of Arp2/3 complex regulation?

      We now mention another situation where branch junctions could encounter membrane-bound VCA domains: on the dorsal and ventral membrane surfaces of lamellipodia. We now cite the recent Kage et al. J Cell Science 2022 and Mehidi et al. NCB 2021, where WAVE has been observed in lamellipodia away from the leading edge.

      The observation that SPIN90+Arp2/3-nucleated filaments are not sensitive to piconewton forces is also very interesting. The authors focus on the differences in the amount of surface area buried when discussing this result. However, if seems a key factor in the stability of the linear filaments would be the direction of the force relative to the complex and attached filament(s), which would be very different for a branch versus a linear filament. The authors should consider addressing this in their discussion.

      The orientation of the applied force is an interesting point. In their study on debranching, Pandit et al. (PNAS 2020) report that their results are not affected by the angle of the applied force relative to the mother filament (their Fig S1D). We now specify this in our manuscript, when introducing our results on mechanical tension. Similarly, we found that anchoring SPIN90 to the coverslip surface by its N-terminus rather than its C-terminus, which likely affects the orientation of the applied force, had no impact on our results (Supp Fig S6A). We have now also added a sentence regarding this aspect in our manuscript, after presenting this result.

      Fig 4, D-F: It is unclear how the authors determined which filaments were spontaneously nucleated versus those that were nucleated by SPIN90-Arp2/3 complex in these experiments. In reactions containing SPIN90 and Arp2/3 complex what fraction of the filaments will be spontaneously nucleated?

      In our conditions, there is no detectable spontaneous nucleation. In control experiments where we flow in the same concentration of G-actin, in the absence of Arp2/3 or in the absence of SPIN90, we observe no filaments at all on the surface, over several fields of view, after 5 minutes. We now specify this in the Methods section.

      Pg 9 - The observation that VCA negatively influences binding of SPIN90 to the complex is unexpected. What implications does this have for understanding how SPIN90 and VCA synergize to activate the complex?

      It appears that the outcome depends on the context. The main role of VCA during co-activation of the Arp2/3 complex with SPIN90 seems to be to supply G-actin, as already proposed (Balzer, 2020) and confirmed by our results (Fig 1C). In the absence of G-actin, VCA is more likely to remove Arp2/3 from SPIN90 (Fig 4G,I). Similarly, when a filament is already formed, the presence of G-actin mitigates the removal of SPIN90-Arp2/3 from the pointed end by VCA (Supp Fig S4).

      Fig 4B - Why is there greater nucleation when Arp2/3 complex and GMF are added together compared to renucleation in reactions that don't have any GMF? This is surprising, especially considering that GMF decreases binding of Arp2/3 complex to SPIN90.

      Indeed, there is a small yet statistically significant difference in the re-nucleation fraction we measured in the presence of Arp2/3, with or without GMF (Fig 4B). This may be due to the different timescales of the two situations. In the absence of GMF, the detachment of filaments is slow and new filaments are nucleated from the initial Arp2/3 complexes, which remained bound to SPIN90 upon detachment of the first filaments. In contrast, in the presence of GMF, detachment is faster and accompanied by the departure of the initial Arp2/3, and a fresh Arp2/3 then binds to SPIN90 to nucleate a new filament. It is thus possible that, in the absence of GMF, a small fraction of the SPIN90 and/or their initially bound Arp2/3 complexes would denature over the time they spend at the bottom of the microchamber at 25°C, thereby leading to a slightly smaller re-nucleation fraction. A similar mechanism could be at play in the experiments with or without VCA, in addition to the enhancement of nucleation by VCA (Fig 4C).

      Minor Corrections/Comments

      Pg 3 "We show that Arp2/3 nucleation is similarly stabilized by cortactin and destabilized by GMF" Do the authors mean branches and linear filaments nucleated by Arp2/3 complex?

      Yes, that is what we meant. This sentence has now been modified.

      Pg 6- The cyan 3uM data and legend in figure 2B and E is probably too dim to see clearly.

      The colors have been changed to improve readability.

      Fig 4 B,C,E,F: It would be best to show the individual data points here if possible.

      We now show individual data points in all these figure panels.

      Pg 16 Please specify which antibody was used to anchor SPIN90.

      The antibodies are Anti-GST for Nter anchoring of GST-SPIN90, and anti-His for Cter anchoring of SPIN90-His. We now specify this in the Methods section.

      CROSS-CONSULTATION COMMENTS I agree with the points that the other reviewers raised.

      Reviewer #3 (Significance (Required)):

      Comments on significance are in the above section.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      We thank all three reviewers for their thoughtful and rigorous critique of our manuscript, which we feel has significantly improved the presentation of our work. Below we detail point-by-point responses to comments made by the three reviewers as well changes we have already made addressing the majority of minor and some major points.

      Specification of the eye-field during gastrulation represents the earliest known stage of eye development. Using an optic-vesicle organoid model system, the overall goal of our work is to provide an unbiased characterisation of this critical, early developmental event in mammals and to gain insights into relevant gene regulatory mechanisms. A common theme to some of the reviewer comments is that this work doesn't provide much of an advance to the field and our findings are not particularly original. We feel that these comments are slightly harsh for the following reasons. Firstly, although some of our findings are not unexpected, to our knowledge, this is the first unbiased characterisation of the eye-field in a mammalian model system, and not based on knowledge gained through previous work in other non-mammalian vertebrate systems, e.g. Xenopus. Secondly, by generating both RNA-seq and ATAC-seq from a timecourse of organoid development we have been able to quantify dynamic patterns of gene-expression as the eye-field is established and simultaneously gain insights to the regulatory role of some of the key transcription factors, both of which are not present in the literature. Thirdly, by constructing careful, integrated analyses of our RNA-seq and ATAC-seq datasets we were able to generate specific hypotheses regarding cis-regulation of key genes, which we have then demonstrated are possible to efficiently test within the organoid system. In all, although we have been purposely careful not to overinterpret our results, we feel our work does represent a significant step towards understanding the mammalian eye field and additionally provides important datasets as well as an analysis framework to begin to quantitatively probe the regulatory mechanisms underlying the transition to an ocular fate. Given the relevance of this developmental event to clinical genetics research as well as to developmental biology we are confident that this work represents an important and significant advance to the literature.

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      Summary Owen et al. characterize the transcriptome and chromatin accessibility of mouse retinal organoids at early stages during which eye field-like cells are specified. Since cell specification and differentiation in retinal organoids largely mimic those processes in vivo, retinal organoids are viable models for studying the mechanisms of early eye development. Owen et al. utilize a previously established Rx-GFP cell line, bulk RNA sequencing, and bulk ATAC sequencing to dissect the mechanisms of early eye development in mice. Their findings are generally consistent with previous studies. Overall, the study is interesting for the field, but its conceptual and technical advances are moderate. In addition, a few major points need to be clarified.

      Major points 1. The authors did not show any analysis of retinal organoids at stages when Vsx2 is expressed. This is a significant weakness since the chemically defined medium (CDM) used in Owen et al.'s study was previously shown to induce rostral hypothalamic differentiation (Wataya et al., 2008). Related to this notion, several eye-field transcription factors, such as Rax and Six3, are also expressed in the hypothalamus. Therefore, Owen et al. need to demonstrate that organoids in their modified differentiation system efficiently produce Vsx2-positive retinal progenitors, and samples of organoids at stages when Vsx2 is expressed should be included for RNA sequencing. If Vsx2 is not efficiently expressed in their organoids, the interpretation of results will be very different.

      We thank the reviewer for their important comments here. There are several reasons why we are confident that our data and conclusions regarding the organoid eye-field are robust. Firstly, our RNA-seq data, in particular the differences between GFP-positive and GFP-negative cells, clearly show a coordinated up-regulation of the set of canonical eye-field TFs (not individually), which previous studies in Xenopus have shown is a prerequisite for differentiation into anterior eye structures (including retina). Secondly, we have checked that some of the later (in development) eye markers, including Vsx2, are differentially up-regulated (DeSeq2, logfc>1.5, FDRIn all, we are very confident that our approach of using the optic-vesicle organoids and generating molecular data from an organoid developmental timecourse (including sorting), is unpicking the ocular-fate transition event that we are interested in.

      1. The authors state that "two differentiation medias were used for this work due to the differentiation becoming unstable after the initial experiments had been performed. The organoids used for RNA and ATAC-seq were grown in CDM media and the organoids with mutations introduced in potential CREs were grown in KSR media". Why the differentiation becomes unstable after the initial experiments? Differences in the two media cause additional complexities. Related to this notion, "WT Rx-GFP" in Figure 4B and 4E appears to show a different expression pattern compared to that in Figure 1A.

      We were unable to identify the reason behind the destabilisation of differentiation in CDM media after the cell lines had been through CRISPR despite thorough testing. The differentiation of these cell lines was stabilised enough using KSR media such that every batch of organoids grown contained some organoids that expressed GFP in a pattern similar to what we had seen before and we carried on our experiments using this. We recognise that using two different media adds complexity, however we see the same patterns of organoid growth and GFP expression when differentiating untransfected WT Rax-GFP cells in both of these medias. We have edited Fig.S1 to include representative images of organoids grown in KSR media which can be directly compared to those grown in CDM shown in Figure 1A.

      The reviewer has pointed out that the WT Rx-GFP organoids in Figure 6B and 6E show a different expression pattern to those in figure 1A. With the addition of the supplemental figure mentioned above it becomes apparent that these differences are not due to the change of media. We have clarified in the text that these WT cells have also been transfected so as to act as appropriate controls that have been treated identically to the CRISPR edited cell lines and that this has affected their differentiation capacity.

      1. Is the deletion of Rax and Six6 regulatory elements homozygous? Sanger sequencing or amplicon sequencing is needed to show the deletion.

      The deletions are homozygous (we have stated this in the manuscript text) and as suggested we have added a supplementary figure showing the Sanger sequencing traces for the WT and mutant cell lines used in this study.

      1. The deletion of Rax and Six6 regulatory elements appears to cause minor changes in the expression of Rax and Six6 (Figure 6C, F). Therefore, the impact of findings in bulk RNA seq and bulk ATAC seq in this study is still unclear.

      We have added a sentence to the text underlining that developmental genes are expected to be regulated by multiple enhancers. Our expectation is therefore, that in perturbing a single putative regulatory element for Rax/Six6, we will very likely not see the complete ablation of Rax/Six6 expression.

      1. Retinal organoids and sorted cells are composed of heterogeneous cell populations. Bulk RNA seq and bulk ATAC seq do not have the power to dissect the complexity of heterogeneous cell populations. Single-cell RNA seq and single-cell ATAC seq are more powerful for this study.

      We agree with the referee about the fact that the organoids are likely composed of relatively heterogeneous cell populations. We have added this limitation of our generated datasets in a “limitations” paragraph in the discussion.

      1. Numerous motifs in the JASPAR database are identified using in vitro assays and have not been validated using in vivo assays. Unexpected results in motif analysis could be due to the differences in DNA binding motifs between in vitro and in vivo conditions. This notion should be added in the discussion.

      We have added a couple of sentences in the discussion section, highlighting that TF-motif and footprinting analyses of ATAC-seq data provide indirect evidence of TF binding, and to validate these findings experiments such as ChIP-seq or Cut&Run could be performed in the future.

      Minor points

      Numerous labels in figures are too small.

      We have adjusted the size of a number of the figures to increase the size of the labels, which are now mostly the same size as the text in the corresponding figure captions. We are very happy to make further increases in the sizes of figure labels/text upon recommendation.

      CROSS-CONSULTATION COMMENTS

      My fellow reviewers identify similar major weaknesses and additional points. I agree with the other reviewers' comments.

      Reviewer #1 (Significance (Required)): Nature and Significance of the advances In Owen et al.'s study, the Rx-GFP cell line and retinal differentiation protocol were established in previous studies (Wataya et al., 2008; Eiraku et al., 2011); bulk RNA sequencing and bulk ATAC sequencing are standard procedures. Although candidate regulatory elements for early eye development are identified, deletions of two prioritized elements using CRISPR/Cas9 only cause minor changes in the expression of targeted genes. Overall, conceptual and technical advances in Owen et al.'s study are moderate. Compare to existing published knowledge The datasets could be useful for the field, but conceptual and technical advances are moderate.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      The authors grow eye organoids from cells with a reporter driving GFP in the Rax locus, a gene that is expressed in the eye field in many animal model systems. They show that expression of GFP picks up by day 4 and performed FACS sorting of GFP+ cells on day 4 and day 5 organoids to compare gene expression by RNAseq comparing with earlier day organoids. The data shows 37 genes with a differential expression on days 4 and 5, compared to day 3, and enriched in GFP+ cells, which they define as EF-up genes. It is notable that some of these genes had already been identified as canonical eye field gene regulatory network transcription factors. In the same way, they identify a group of differentially expressed regulated genes, EF-down, and state that 'many' of them are involved in pluripotency. However, they do not mention how many, or the proportion of these genes in the whole list.

      The number of EF-down genes with GO terms linked to pluripotency has now been added to the text.

      It would be useful if they could provide the number to understand how many of these genes are related to pluripotency, the whole list of genes mentioned to be downregulated in a supplementary file.

      We appreciate that this list was missing and will include it now as a supplemental file.

      The authors also note that genes known to be required for eye specification like Sox2 and Otx2 are not differentially expressed across the day 3-4 timepoint (Ln 190). However, this is not surprising considering that both genes are broadly expressed in the anterior neural ectoderm and required for its specification, which should be noted by the authors.

      We have amended the aforementioned sentence to reflect this: “It is noteworthy that Sox2 and Otx2, known to be crucial in eye development are not differentially expressed across this critical time-point (Fig.2A), consistent with these genes being more broadly expressed in the anterior neuroectoderm in vivo.”

      The authors then go on and cluster the EF-up, EF-down and genes deferentially expressed between days 2 and 3, and identify 6 discreet trajectory groups. From this analysis, they identify a third group of genes which shows a peak on day 3 but whose expression falls on days 4 and 5. It is interesting to see that this group includes Wnt and Fgf morphogenes. The authors should provide a list of the genes in the different clusters for the readers to inspect and analyse.

      We note that there was a typo in the original manuscirpt and the genes that were clustered were the EF-up and EF-down genes. This typo has been fixed and the requested information is now available in a supplementary file.

      Aiming to generate insight into the cis-regulatory elements that regulate of the genes the authors found differentially expressed in their model system they performed a series of ATAC-seq experiments. When linking the genomic regions with differential ATAC-seq accessibility to gene locus using the GREAT analysis, they identified association to 22 of the EF-up and 161 of the EF-down genes. This suggests a functional link between the ATAC-seq genomic regions and the gene regulation of the differentially expressed genes.

      The authors later screened the ATAC-seq regions of increased accessibility for TF binding motifs and found that these regions were enriched with motifs for EFTF genes Rax, Lhx2 and Pax6. When assessing motifs in the ATAC-seq regions in EF-up TADs, Rax and Lhx2 motifs scored highly associated to open chromatin positions. Authors also observe a positive gene expression-accessibility correlation between in Pax6, Lhx2, Six3 and Otx2, and suggest this could mean these genes activate transcription of the EF-up group of genes. The same analysis, but focusing on EF-down genes, suggests that EFTFs repress the expression of EF-down genes which include those involved in pluripotency.

      Further interrogating the ATAC-seq data, the authors use TOBIAS footprinting analysis to identify changes in TF binding in EF-TADs and EF-up motifs. Remarkably, whole genome analysis reveals that the largest increase in motif binding corresponds to EF-up genes Rax, Pax6 and Lhx2. The authors then narrow down on specific gene regulation by studying the ATAC-seq data within the TAD of Rax and Six6. However, they do not explain the rationale for which these two genes were highlighted, and why Pax6 or Lhx2 were excluded. This explanation should be added to the manuscript.

      We have expanded this section of the manuscript to explain that Rax and Six6 were prioritised due to the GFP readout of Rax expression and Six6 being located in a smaller and thus less complex TAD than Pax6, Six3 and Lhx2 after the initial analysis was performed for all five TADs.

      The analysis identifies three regulatory elements in the Rax TAD and two for Six6. They then go on and study one putative regulatory element of each gene and generate CRISPR deletions in cell lines. The rationale for the choice of these particular elements is not clear, nor if the cell lines are the same used for the RNAseq experiments. This information should be explicit in the results and in the methods section.

      The manuscript has been updated to include the rationale behind our choice of the regulatory elements deleted.

      The authors mention that the CRISPR cell lines are "considerably more variable" (Ln 822) compared to the previously studied organoids and suggest that no conclusions can be driven from GFP expression or morphology alone. However, they do not specify which is the variable trait. This information should be added to the text.

      We have amended the text to include that the organoids are more variable in terms of the OV like structures produced and GFP expression level.

      The authors also miss out on specifying the time stage of the organoids in figure 6 which should be stated.

      We thank the reviewer for pointing this out and have updated the manuscript to contain the stage of the organoids.

      Regardless, the wildtype organoids in figure 6 and figure S7 show a very different morphology and GFP expression compared to those in figure 1, suggesting that the conclusions from this last set of experiments are not reliable or comparable to those in figure 1. This, together with the fact that different reagents were used to grow the organoids for the RNAseq and the CRISPR experiments, is a weakness of this work that must be addressed.

      We recognise this weakness however our amendments detailed above in response to reviewer 1’s comments, including adding a figure showing WT organoids grown in the KSR media that closely resemble the organoids in Fig.1A, removes the uncertainty that it is the change in media producing these differences in morphology and GFP expression.

      Our aim in this section was to specifically test the hypotheses regarding the regulatory nature of the distal genomic regions identified by our intra-TAD analyses of ATAC-seq data. To do this it was important to compare organoids derived from wildtype and mutant cells that had been subjected to the same growth conditions and genomic-editing protocols. The stress associated with the latter is what we expect has resulted in the differences in morphology and GFP expression compared with the original Fig1. organoids (which have not been through this procedure).

      The last part of the results section belongs to the discussion as no results generated by the researchers are included.

      Although no new data was generated for this section, we have used the data generated in our work, together with existing ChIP-seq datasets to construct a new plausible hypothesis regarding the activation of Rax-expression through changes in TF-binding at an enhancer displaying little/no change in accessibility. As this section ties in with previous results sections discussing the regulation of eye-field genes, we feel it belongs in the results section rather than in the discussion.

      The discussion in this paper is a good opportunity to state the limitation of this study.

      As requested, we have added a paragraph discussing the main limitations to our study in the discussion section.

      Major comments to address

      1. One of the main issues identified is that the morphology of the control conditions in the CRISPR experiments (Fig.6) do not look is that those used for the RNAseq experiments (Fig.1) and the authors should address this issue. The fact that CDM media was used on the RNA extraction and ATACseq experiments and then KSR media was used for the CRISPR experiments is worrying and makes one wonder whether the second set of experiments is at all comparable to the first. This should be somehow controlled carefully by at least replicating one set of RNA experiments with the KSR media.

      We have addressed this in response to the reviewer’s summary above. Unfortunately, it is not possible for us to replicate the RNA experiments in the KSR media due to the research group closure upon Professor FitzPatrick’s retirement.

      1. The requirement of Wnt signalling inhibition has been well established as a requirement for forebrain specification, including the eye field. Considering the link of the Wnt/beta-catenin pathway to eye specification and that TCFs, the transcription factors that mediate Wnt pathway transcription regulation, have known and well-studied DNA motifs, it is surprising that authors do not include the analysis of TCF motifs in their study. Also considering that TCF7l1 (TCF3, old nomenclature) has recently been shown to be cell-autonomously required for the expression of rx3 (Rax homologue) in zebrafish. One would expect TCFs to be included in the analysis as it was done with Sox2 and Otx2, which were studied due to the known relevance in forebrain specification rather than from the direct analysis of the differential gene expression experiments.

      We thank the referee for their valuable comment here. Our current analyses indeed do not consider TCFs and are therefore likely incomplete. We plan to address this by further analysing our data to quantify the patterns and effects of the TCF genes, and will appropriately amend our manuscript to reflect our findings.

      Minor comments to address

      1. The authors should clearly state the day timepoint used in the organoids experiments in the results section and figure legends, not just in the methods.

      We have updated the text and figure legends to include the time point of all organoids.

      1. The report by Agnes et al Development 2022 should be cited in the introduction as it is an excellent paper related to this topic, including a comprehensive analysis of the EFTFs expression pattern.

      We thank the reviewer for pointing us to this very interesting paper. Although we feel it doesn’t fit in with our introduction that is currently tailored to the set of genes that has historically defined the eye-field (and which was discovered in non-mammalian models), we do recognise that the 3D organisation of the eye-field and in particular the patterns of gene-expression defining different regions of this is important to disentangle in mammalian systems. We have therefore inserted a reference to the Agnes at al 2022 study on the dimorphic teleost in our extended discussion.

      1. Ln 41. Mutations in these genes do not always cause severe bilateral eye malformations. Probably best to moderate and mention that they 'can' cause these malformations.

      As suggested we have softened this sentence to: “ Mutations in at least three of the genes encoding orthologs of the Xenopus EFTF can cause severe bilateral eye malformations in humans (OTX2, PAX6 and RAX) (Fitzpatrick and van Heyningen, 2005).”

      1. Ln 146. Authors mention that in vitro organoid systems "closely mimic the in vitro regulatory dynamics". This statement should be moderated as we do not know if this is true. In fact, one of the positive aspects of this study is that it contributes to supporting this statement.

      We agree with the referee regarding the strength of this original statement. We have changed this to:

      “We have exploited a reproducible, in vitro organoid model system enabling us to generate data from this cell-state transition and through computational analysis gain a quantitative understanding of the underlying regulatory mechanisms.”

      1. Ln 150. Rax homologue Rx3 is also expressed in cells that give rise to the hypothalamus in zebrafish and cavefish, and probably in Xenopus too. It could well be the case in mice too.

      We have corrected this to indicate that Rax is also expressed in the hypothalamus in mice.

      1. I do not think the GO term data adds much to Figure 1. If possible, I would move it to the supplementary section.

      We have moved the GO visualisations to supplementary, Fig.S2.

      1. It should be made clear which set of experiments was performed as biological replicates and which did not.

      We have added details on the number of replicates used in each experiment.

      1. Based on the heatmap in Fig1A, expression of Rax is significant in GFP- cells at days 4 and 5. The authors should comment or discuss this.

      We have amended the text and supplemental methods section to include more details of our FACS protocol. The limitations of our sorting procedure include the fact that cells are not sorted into pure GFP expressing and non-expressing populations. Rather the GFP negative sample may contain some cells with low Rax expression or cells that have just begun to express Rax that were not excluded by our sorting. Our aim was to collect sufficient numbers of cells for each condition and separate out cells that expressed GFP to get a more uniform population of cells to study. It is also of note that the heatmap shows Rax expression by day 3. Although it was not detectable by imaging there were around 100 cells per organoid that FACS marked as GFP positive but were retained within the day 3 sample to ensure we had a complete picture of the gene expression at this time point.

      1. Ln 99 of materials and methods mentions that the sorting of GFP+ was performed "when possible". The authors should state the differences in the conditions in the different experiments.

      This has been expanded to detail exactly how cells were sorted.

      1. The sentence closing the first section of the results (Ln 270) is an overstatement and should be moderated. I cannot see how the results shown in this section on their own could reflect and drive solid conclusions on brain cell fate specification.

      We agree with the referee and have changed this sentence to: “In summary, these first analyses of RNA-seq data generated from the timecourse of optic vesicle organoid development, show that this is a robust and relevant model system with which to study the gene dynamics underlying mammalian eye field specification.”

      1. Appropriate citations should be added to back up the argument that opens the second part of the results section (starting Ln 279).

      We have added several citations that discuss and review the current knowledge regarding gene regulation via TF-binding at accessible cis-regulatory elements.

      1. Ln 342-343. I suggest being consistent and using the EF-up or EF-down nomenclature on the whole manuscript unless referring to a different subset of genes.

      We have modified the text to consistently use “EF-up” or “EF-down” terminology.

      1. Ln 692 Refers to Fig.S4F, but this figure has only panels A-D.

      This was a typo and has been corrected in the text.

      1. Figures 6B and E and the figure legend do not indicate the differences between the panels, or the time stage of the experiments.

      The figure legend has been updated to include these details.

      CROSS-CONSULTATION COMMENTS I agree with the comments and suggestions made by the other two reviewers, which identify similar and also specific issues in the manuscript. I believe they are all pertinent and should be acknowledged before re-submitting.

      Reviewer #2 (Significance (Required)): The manuscript by Owen et al, presents the analysis of in vitro eye vesicle organoids derived from mouse ESCs at stages equivalent to when the eye field is specified in vivo. This work is pertinent and necessary as detailed data on gene expression in early eye organoids was missing in the field and is necessary for the interpretation of experiments in these systems.

      Although the computational data provided in this manuscript is based on consensus TF motifs, the functional relevance of the specific motifs must be proven before being able to drive any significant conclusions, and one should be moderate about the conclusion that can be driven from this kind of analysis. Still, the analysis put forward is a good reference and starting point for future functional studies. One possible limitation of this study is that the quantification of the expression of genes is based on the RNAseq data, and the expression data should be further confirmed using a proper quantitative method like qPCR.

      This study will be of interest to the audience studying eye development and disease in animal model systems and humans.

      My lab studies the genetic, cellular, and molecular aspects of eye specification, development and disease in zebrafish, and study mutations identified patients with eye globe defects.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      Studies in Xenopus embryos have established that the specification of the eye field requires a core set of transcription factors (TFs) that impose eye identity to anterior neural plate progenitors. In this manuscript the authors have used mouse embryonic stem cells-derived optic vesicle organoid to ask if the acquisition of mammalian eye identity requires the same set of TFs. They further use different genomic approaches to identify the cis-regulatory elements involved in the expression of these genes and analyses the consequences of altering the sequence of some of the identified regulatory elements. Their results confirm that in mammals the acquisition of eye field identity requires the upregulation of the expression of the same core set of TFs described in Xenopus, with a particularly important role for three of them: Rax, Pax6 and Lhx2. This upregulation is associated to the downregulation of pluripotency genes.

      This is a generally well-performed study, that indeed involves a large amount work and adds the identification of several cis-regulatory elements controlling the expression of this core set of already identified eye field TFs. However, conceptually the study does not add much to what is already known and the authors do not offer any very original conclusion from their study. They have generated a large amount of information that likely could allow them to go beyond what is known. For example, they could enlarge the composition of the gene regulatory network that controls eye field specification, given than one of their argument is that their analysis can predict the composition of such a network. Perhaps, they could also address some of the questions that are posed in the discussion. This will strengthen the manuscript and valorize their work.

      Additional points that could be taken into consideration are the following:

      1) According to the text, the authors identify only 53 CREs with decreased chromatin accessibility (ATACseq signal) between the 3 day and 5 days timepoints, versus the 7752 CREs with increased signal. However, this contrasts with the proportion of genes upregulated/ downregulated in their RNAseq analysis (37 vs 448) and with the notion that specification of the eye field involves the concomitant repression of other neural fates. This also suggests that at least an important fraction of the dynamic ATACseq peaks associated with 161 of the 448 downregulated genes increase their accessibility and allow the recruitment of transcriptional repressors. However, the role of TF binding and chromatin accessibility dynamics on gene repression is poorly discussed and the authors need to provide some interpretation of these observations. Also, authors interpret the fact that the presence of BS for EF downregulated genes, such as En2 and GATA6, correlates with increased chromatin accessibility as a consequence of the fact that TFBS can be bound by different TF paralogs but do not seem to consider that these TFs have been reported to work as transcriptional repressors, so that their downregulation could well explain the changes in chromatin accessibility.

      We thank the reviewer for their interesting comments here. We have added short discussions on both main points above (EF-down genes linked to peaks with increasing accessibility and En2/Gata as transcriptional repressors) in the text related to the analysis of our ATAC-seq data. The notion that a loss of repression leading to the activation of gene-expression is indeed a very exciting one and one that we have thought about in the context of the switch-on of the eye-field TFs. This certainly deserves further future work, however in the present study we wanted to be careful not to overinterpret our data. To robustly gain insights into the loss of repression, experiments such as En1/Gata6 ChIP-seq would be very useful, though we are unable to perform these in the near future.

      2) ATACseq signal analysis is an indirect measure of TF binding. The authors demonstrate the predictive nature of this analysis of TF dynamics and have use an available Sox2 ChIP dataset. However, this does not allow assessing dynamic changes in the occupancy of this TF and its correlation with ATACseq. Therefore, at least for few of the TF stressed in this work (e.g. Sox2 and Otx2 and for which good antibodies exist) they could attempt ChIP-seq analysis. This would considerably strengthen the work and provide support to an idea that the authors have particularly emphasized in their manuscript.

      We agree with the referee that not having generated ChIP-seq data does not allow us to validate some of the hypotheses and evidence provided by the computational analysis of our ATAC-seq data – we have added a discussion of this limitation in the discussion section of our manuscript. We do note however, as observed in Bentsen et al, 2020, that compared to simple TF-motif occurrence analyses, TF-footprinting analyses (such as those we have performed) yield results on putative TF binding that are much closer to more direct measurements of TF binding via e.g. ChIP-seq. We fully agree that it would be very interesting to perform ChIP-seq/Cut&Cut experiments on the organoid system for a set of interesting TFs identified in our study. Unfortunately, because the lab of Prof FitzPatrick has now closed, it is not possible for us to perform further wet-lab experiments in the very near future. However, we plan to further explore the literature to try to find additional publicly-available ChIP-seq datasets (including for Otx2) which would help reinforce some of the hypotheses we make, and will report any relevant findings in our final manuscript.

      3) Previous studies (i.e. 10.1242/dev.067660; 10.1093/hmg/ddt562) have shown the importance of gene dosage in eye field specification and repression of other fates. These studies could be included in the discussion, which, in its current version is a quite brief and leaves out many of the reported analysis.

      We thank the referee for pointing us to this very relevant question – we have added this to the further research questions in the discussion.

      CROSS-CONSULATION COMMENTS

      The comments from the other reviewers complement the aspects that we have underscored and should be fully considered as they will contribute to improve the manuscript.

      Reviewer #3 (Significance (Required)): This is a generally well-performed study, that indeed involves a large amount work and adds the identification of several cis-regulatory elements controlling the expression of this core set of already identified eye field TFs. However, conceptually the study does not add much to what is already known and the authors do not offer any very original conclusion from their study. They have generated a large amount of information that likely could allow them to go beyond what is known.

      Developmental neurobiologists, genome

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1 (Evidence, reproducibility and clarity):

      Summary<br /> Authors show that overexpression of bHLH transcription factor Dpn in the medullary neurons of the Drosophila optic lobe results in the dedifferentiation of these neurons back into the NBs. These dedifferentiated NBs acquire and maintain mid-temporal identity, express Ey and Slp, and show delayed onset of tTF Tailless (Tll), leading to an excess of neurons of mid-temporal fate at the expense of late temporal fate neurons and glial cells. The dedifferentiated NBs are stalled in the cell cycle and fail to undergo terminal differentiation. Over expression of tTF Dicheate (D) or promoting G1/S transition pushed these NBs to late stages of the temporal series, partly rescuing the neuronal diversity and causing their terminal differentiation. They also show that the dedifferentiation of NBs by Notch hyper-activation also exhibited stalled temporal progression, which is restored by D overexpression.<br /> Authors suggest that cell cycle regulation and tTF are primary to the proliferation and termination profile of dedifferentiated NBs.<br /> Using these conclusions, the authors emphasize the need to recreate the right temporal profile and ensure appropriate cell cycle progression to use dedifferentiated NSC for regenerative purposes or prevent tumorigenesis originating from differentiated cell types.

      Major comments:<br /> - Are the key conclusions convincing?<br /> Most conclusions are convincing; however, some issues are pointed out below.

      • Should the authors qualify some of their claims as preliminary or speculative, or remove them altogether?

      The authors have overexpressed Dpn and shown that medulla neurons dedifferentiate to NBs, similar to the loss of function phenotype seen for the Nerfin-1 of which Dpn is a target. They also show that temporal series progression defect is also seen in the case of dedifferentiated NB generated by Notch over-activation.<br /> Using these two examples, the authors suggest that for dedifferentiated NSC, which are to be used for the regenerative purpose, one needs to recreate the right temporal profile and ensure cell cycle progression occurs appropriately. Authors also claim that to prevent tumorigenesis originating from differentiated cell types, one needs to recreate the right temporal profile and ensure cell cycle progression occurs appropriately.

      While I agree with this, I think this is an overreaching conclusion based on just these two examples. If they could show the same for one more method of dedifferentiation (For, e.g. Lola) happening in medulla neurons which happens by a mechanism independent of Nerfin-1, Dpn, Notch axis, the argument will become more convincing and broad.

      We will characterise the temporal identity, termination and cellular identity of Lola-Ri induced ectopic neuroblasts. If these parameters are disrupted, we will overexpress D to assess whether this can trigger the progression of the temporal series.

      Also when authors mention N mediated dedifferentiation, they need to inform that Dpn is a direct target of Notch in NBs (Doi. 10.1016/j.ydbio.2011.01.019), they do so in the discussion, but mentioning it here gives a broader context to the reader.

      We will include that Dpn is a target of Notch when first mentioned.

      Another important point that needs a mentioned here is that conclusions are based on dedifferentiation happening in the medulla neurons, which are considered less stable since they lack Prospero. Therefore whether this conclusion can be generalized for all the tumors arising from dedifferentiation in the CNS (eg, those arising from NICD activation in the central brain or thoracic region of the VNC) is another concern. Maybe authors can consider making a more conservative claim.<br /> Generalizing this conclusion to Prospero expressing NBs lies outside the scope of the current study and cannot be addressed here because central brain Type-I NBs use a different set of tTFs.

      We will make a more conservative claim and clarify all of our conclusion are medulla neuron-specific.

      Would additional experiments be essential to support the claims of the paper? Request additional experiments only where necessary for the paper as it is, and do not ask authors to open new lines of experimentation.<br /> Experiments with Lola knockdown/mutants in medulla neurons can be done quickly, in my opinion, and will substantiate this claim.<br /> Another obvious question that comes to mind is if medulla neurons dedifferentiate on overexpression of Dpn, does the same happen in nerfin-1 mutant clones as well? And if yes, why has the author not done similar experiments for nerfin-1 mutants.

      We will assess the temporal identity of neuroblasts in nerfin-1 mutant clones.

      Please show Ey staining in Fig-2 if possible, it will also help to add a line on why Slp was used as marker for mid tTFs instead of Ey.

      Ey is shown in Fig-2 (D-D’’) already. Slp is used as a marker of mid tTFs as Ey is expressed also in neurons thus would also be present in deep sections of control clones, whereas Slp is not expressed in neurons. We therefore used Slp as a proxy for mid-temporal identity throughout our study. We will include this text in our revision.

      In Model shown in last figure Dpn is shown to repress D and activate Slp. Can authors show that Dpn overexpression represses D and activate Slp either by antibody staining or by RT PCR.

      In Figure 2H, we have shown in clones that overexpression of Dpn induced a significant increase of Slp. In Figure S3B-B’’, we have shown that Dpn overexpression causes an upregulation of Slp at 6 hr APF. We can think we have pretty convincingly shown that Dpn overexpression activates Slp.

      For Dichaete, our existing data shows that Dpn overexpression did not significantly alter D expression. To assess if using a stronger driver might allow us to see some changes, we will induced dedifferentiation via Dpn overexpression using the Eyeless-Gal4 driver. In this experiment, we will quantify the amount of D upon Dpn overexpression. Depending on this result, we will revise our conclusion on whether Dpn overexpression represses D.

      Are the suggested experiments realistic in terms of time and resources? It would help if you could add an estimated cost and time investment for substantial experiments.<br /> Experiments with Lola and nerfin-1 mutants can be done in a few months. I cannot comment on the cost involved.<br /> - Are the data and the methods presented in such a way that they can be reproduced?<br /> Yes

      Are the experiments adequately replicated and statistical analysis adequate?<br /> Replication and statistical analysis are fine. The activated Notch experiments show only three data points in all the experiments. It will be good to increase this number.

      We will repeat Notch experiments to increase the n number for these experiments.

      Minor comments:<br /> - Specific experimental issues that are easily addressable.<br /> There is a problem with Fig-5F (both 5E and 5F have % EdU in clone/ % Mira in the clone as y-axis), I do not understand how the Fig-5F let them conclude that D overexpression increases the rate of neuronal production.

      In the text we said: “We found that D overexpression did not significantly increase neuronal production, suggesting that it is likely that cell cycle progression lies upstream or in parallel to the temporal series, to promote the generation of neurons.”

      In one place, the authors conclude, "Together, this data suggests that it is likely that cell cycle progression lies upstream of the temporal series, to promote the generation of neurons". Authors should consider adding "medulla NBs" at the end of the sentence since cell cycle progression being upstream of temporal series is already known in Type-I NBs, as pointed out by authors as well (Ameele and Brand 2019).

      We will add “medulla NBs” to the end of this sentence.

      In the discussion authors says that "Our data support the possible links between cell cycle progression and the expression of temporal regulators controlling NB proliferation and cellular diversity". This is new information, as the 2019 study did not show how cell diversity changes with a changed tTF profile. I think the authors should elaborate on this point to highlight how this is different from what is already known from the 2019 study (done in the context of Type-I NBs).<br /> Maybe they need to highlight that the cell cycle directs/regulates the progression of temporal series compared to the earlier observation where temporal series was shown to be downstream of the cell cycle.

      We will expand in discussion to discuss the link between cell cycle/tTFs.

      In fig-3J in clones even after 24 AHS, Dpn continues to be overexpressed but these cells undergo terminal differentiation, can authors comment why is it so?<br /> In one place authors say, "To better assess the cumulative effect of the neurons made throughout development, EyOK107-GAL4 was used to drive the expression of Dpn" maybe some background on why use this specific GAL4.<br /> Also a line about why GMR31HI08-GAL4 eyOK107-GAL4 and and eyR16F10-GAL4 were used.

      While Dpn is overexpressed, it progresses through the temporal series at a slower pace due to a delay in cell cycle progression, as well as delayed onset of D, these NBs still eventually reach the terminal temporal identity, and are thus about to undergo terminal differentiation. We will include an additional piece of data that shows NBs induced by Dpn overexpression do eventually turn on Tll.

      Are prior studies referenced appropriately ?<br /> Yes, but in a few places, some references can be added.<br /> An important point that needs to be mentioned for the context is the medulla neurons do not use Prospero for terminal differentiation and are thus considered less stable (DOI: 10.1242/dev.14134

      We beg to disagree with the reviewer in terms of Pros is not required for terminal differentiation of medulla neuroblasts. Li et al., 2013 shows that nuclear Pros is found in the oldest NBs. We do agree that differentiated state of medulla neurons is less stable, possibly owing to absence of Pros, and we will include that in our discussion.

      In discussion, the authors say that "It would be interesting to explore whether N similarly acts on these target genes to specify cell fate and proliferation profiles of dedifferentiated NBs." There is a study looking at Notch targets in NB hyperplasia (DOI: 10.1242/dev.126326); whether that study shows if any of the cell cycle genes are downstream of activated Notch, needs a mention here.<br /> Also, when authors mention N mediated dedifferentiation, they need to inform that Dpn is a direct target of Notch in NBs (Doi. 10.1016/j.ydbio.2011.01.019). They do so in the discussion, but mentioning it in the introduction or results will give a broader context to the reader.

      We will discuss the study looking at N targets in NB hyperplasia in the discussion of the revised manuscript.

      We will mention that Dpn is a target of Notch in the results section.

      Another gene that needs a mention is "Brat", which regulates both Dpn and Notch, and causes dedifferentiation and tumors in CNS, I think this gene and its interaction with Dpn and Nerfin and Notch needs to be discussed either in the introduction or discussion.

      We will comment on Brat in the discussion.

      Are the text and figures clear and accurate?<br /> The main figures are not labeled. Therefore, it was very annoying to deduce the specific figure numbers.<br /> There are 1 or 2 places where figure calling is wrong in the text.<br /> The Image Fig-5I shows cycD and CDK4 at the G2-M transition; while the text says it supports G1/S, which is indeed the case, the figure needs modification.

      We thank the reviewers for identifying these mistakes, and will correct them.

      Do you have suggestions that would help the authors improve the presentation of their data and conclusions?<br /> The presentation is okay, in my opinion.

      Reviewer #1 (Significance):

      • Describe the nature and significance of the advance (e.g. conceptual, technical, clinical) for the field.

      The factors leading to dedifferentiation of the neurons have been identified previously by groups of Chris Doe (mldc, DOI: 10.1242/dev.093781), Andrea brand (10.1016/j.devcel.2014.01.030.) as well as the authors of this paper (10.1101/gad.250282.114, 10.1016/j.celrep.2018.10.038.). However, many questions remained unaddressed regarding such NB generated from neuronal dedifferentiation. For example, whether these cells contribute to native cell diversity of the CNS, undergo timely differentiation or their progeny cells incorporated into appropriate circuits is not well understood. Successful execution of these phenomena is critical for generating functional CNS and such insights are crucial for understanding the origin of tumorigenesis in CNS or employing dedifferentiated NSC for regenerative purposes.

      This study is an overexpression-based study, however, some of the results give significant conceptual insights into the tumors arising out of the dedifferentiation of the neurons. It also gives insights into the fact that the dedifferentiated cells need to be carefully examined for the temporal factor profile before they can be employed for regeneration or any therapy targeting them.<br /> However, in my opinion, they need to test this idea at least in one more system of neuronal dedifferentiation, preferably independent of the nerfin-1/Notch/Dpn axis to generalize this claim.

      • Place the work in the context of the existing literature (provide references, where appropriate).<br /> Cerdic Maurange's group had looked at the role of temporal factors and identified the early phase of malignant susceptibility in Drosophila in 2016 (doi: 10.7554/eLife.13463). Andrea Brand's group has shown in a 2019 paper that cell cycle progression is essential for temporal transition in NBs (doi: 10.7554/eLife.47887). Both these studies were in the context of Type-I NBs, which express Prospero, which is crucial for the differentiation of the neurons.<br /> Previously the authors have studied type-I NBs and shown by Targeted DamID that Dpn is Nerfin-1 target. They also show that Nerfin-1 mutants show dedifferentiation of neurons. They follow up on this observation in medulla neurons, where they find that Dpn overexpression results in their dedifferentiation into medulla NBs. Medulla NBs differ from Type-I NBs in using a separate set of tTFs. Also, Type-I NB and neurons arising from them use Prospero for terminal differentiation, while medulla neurons do not express Prospero and are therefore considered less stable (DOI: 10.1242/dev.141341).

      The importance of the study lies in the results that show that the NB arising out of dedifferentiation of medulla neurons takes up mid-temporal fate. These NBs are stalled in Slp expressing mid-temporal stage unless the cell cycle is promoted by overexpression of cell cycle genes regulating G1/S transition.<br /> Authors also show that overexpression of D promotes the progression of temporal series in these dedifferentiated NBs, which could partly rescue neuronal diversity and result in terminal differentiation. Thus D plays an important role in determining the type of neurons these NBs generated. This suggests that knowing the tTF profile of these types of dedifferentiated NBs is vital if these cells were to be used for regenerative purposes. Authors further claimed that cell cycle regulation and tTFs are critical determinants of the proliferation and termination profile of dedifferentiated NBs.

      • State what audience might be interested in and influenced by the reported findings.<br /> The study will be of broader interest to researchers interested in central nervous system patterning, regeneration, and cancer biology.

      • Define your field of expertise with a few keywords to help the authors contextualize your point of view. Indicate if there are any parts of the paper that you do not have sufficient expertise to evaluate.<br /> Drosophila, central nervous system patterning and cell fate determination of neural stem cells.

      Reviewer #2 (Evidence, reproducibility and clarity):

      Stem cells can divide asymmetrically to self-renew the stem cell while generating differentiating sibling cells. To restrict the number and type of differentiating sibling cells, stem cells often undergo terminal differentiation. Terminally differentiated cells can dedifferentiate and revert to a stem cell like fate. However, the underlying molecular mechanisms are incompletely understood in vivo.<br /> Here, Veen et al., use Drosophila neural stem cells (called neuroblasts) to investigate how terminal differentiation is regulated. Neuroblasts faithfully produce the correct number and type of neuronal cells through temporal patterning and regulated terminal differentiation. The authors show that misexpression of the bHLH transcription factor Deadpan (Dpn) induces ectopic neuroblasts, which predominantly express mid-temporal transcription factors at the expense of late-temporal transcription factors. As a consequence, these ectopic neuroblasts also fail to produce Repo positive glial cells and are stalled in their cell cycle progression. The authors provide evidence that promoting cell cycle progression and overexpression of the transcription factor Dichaete (D) is sufficient to restore the temporal transcription factor series, neuronal diversity and timely neuroblast differentiation.

      This is an interesting study that will be of interest to the stem cell field. However, I encourage the authors to consider the following critiques:

      1. Explain the rationale for the three different neuronal/NB drivers (GMR31HI08-GAL4, eyOK107-GAL4, eyR16F10-GAL4. How are they expressed?

      We will include an expression analysis of EyOK107-GAL4 and eyR16F10-GAL4. GMR31HI08-GAL4 expression analysis was previously published (Vissers et al., 2018). We will explain in the text the benefits of each driver.

      1. The rationale for the Edu experiment (Figure S1I) is not clear. Why is this a measure for the production of neuronal progeny? For the correct interpretation of these results, the authors should also provide control clones or Edu experiments of regular neuroblasts.

      We will repeat this experiment and mark the progeny with the neuronal marker Elav, to demonstrate that they are neurons. Additionally, we will add the control to this figure.

      1. How was % of Mira (Figure 1K and below) or the % of tTFs (Figure 2H onward) quantified? For instance, Figure 2C-G often shows clonal signal that is not highlighted with the dashed lines and the corresponding tTF intensity does not match the intensity in the outlined clone (eg. Figure 2D-D'; a large optic lobe clone is negative for Ey. Figure 2E-E'; an unmarked clone is negative for Slp).<br /> Similarly, the Hth signal is very weak to begin with so it is unclear how this was quantified. How was determined what constitutes real signal vs. background noise?<br /> Additional explanations in the methods section is needed to assess the robustness of the data.

      We will expand the methods section and mention that we used similar thresholding in antibody staining between control and uas dpn in all instances, so even if the antibody is weaker (eg hth) it is consistently quantified. Additionally, we can increase the intensity of Ey in Figure 2D-2D’, as it is expressed at low levels.

      1. This sentence should be rephrased: 'As the tumour cell-of-origin can define the competence of tumour NBs to undergo malignancy (Farnsworth et al., 2015; Narbonne-Reveau et al., 2016), we next tested whether the temporal identity of the dedifferentiated NBs were conferred by the age of the neurons they were derived from.'<br /> The connection between tumorigenicity and temporal identity is not really clear and should be briefly reintroduced for this paragraph.

      We will rephrase this sentence and further introduce this concept when talking about tumour cell of origin and competence.

      1. Figure 2I-N: The experimental outline in I and J should be grouped with the corresponding images to clarify what is compared. Also, there are no images for the control clones, which make a comparison difficult. The images are also too small. I cannot really see the Hth, or Slp signal in the small clones shown in Figure 2K-L".

      We will split figure 2 into two images. The first image including A-H and the control data. And the second including I-Q and the control data. This will increase the size of the images. Additionally, we will group I and J with corresponding data.

      1. Figure 3H: It is not clear why there are only a small group of Nbs that are positive for Mira. Please explain.

      Most NBs have terminated by this time point, we will explain this within the text.

      1. Figure 3K-M: Please explain how the Toy signal was measured and quantified.

      We will expand the methods section and explain how Toy quantification is made.

      1. The TaDa data set is very interesting but the following might be an overstatement: "We found that Dpn directly binds to slp1 as well as the Sox-family TF dichaete (D) which is expressed in medulla NBs after slp1 (Li et al., 2013) (Figure S6 A-B)."<br /> More direct binding assays might be needed to show that Dpn directly binds to slp1 and D. If this is already shown, clarify the sentence to indicate what is published and what is extracted from the data shown here.<br /> Also, what is the rationale for this statement: "Consistent with the model that D represses Slp-1..."?

      The DamID data do actually show that Dpn binds (i.e. there is a statistically significant peak at FDR<0.01) directly at these loci (see the TaDa supp fig A & B). Whether it’s doing anything functional or not, we can’t say, but our data shows that Dpn directly binds to slp1 and D. We will clarify the sentence to indicate this in our revision.

      1. This might be an overinterpretation: D overexpression in UAS-Dpn NBs promoted their pre-mature cell cycle exit at 6 hrs APF using eyR16F10-GAL4. The data shows loss of Mira signal, which could occur through different mechanisms.

      Our data already shows that these NBs express Tll, the terminal temporal transcription factor (Figure 4F). In addition, we show that there is an increase in Tll+ and Repo+ progeny (Figure 4K, L). Together, this suggests that D overexpression promotes the progression of the temporal series. However, it is possible that Mira+ cells can disappear via cell death. We will assess this possibility by staining for cell death marker Dcp1 at 6hr APF.

      Reviewer #2 (Significance):

      These appear to be novel and significant findings that will enhance our understanding of the temporal progression and terminal differentiation program of neural stem cells in vivo.<br /> I think the findings will be of interest to cell, developmental cell and stem cell biologists.

      My primary expertise is in the cell biology of fly neural stem cells and asymmetric cell division of neuroblasts. Although I am not intimately familiar with the differentiation and differentiation literature, I consider the findings reported here relevant and impactful.

      Reviewer #3 (Evidence, reproducibility and clarity):

      The discoveries that the author describe in this manuscript are very specific to dedifferentiated neuroblasts created by UAS-dpn transgene overexpression. Dpn is endogenously expressed in optic lobe neuroblast throughout larval stage, which makes understanding how Dpn regulates gene expression based on the authors results (suppression of cell-cycle genes, and promotion of a specific temporal state) confusing.

      Our data relate specifically to gene regulation by Dpn in a dedifferentiated context, and do not seek to understand Dpn regulation in wt neuroblasts. The reviewer is assuming our scope is greater here: we’re not trying to claim that we know what Dpn is doing in wt NBs, and it’s not surprising that ectopic effects in neurons may be different to wt NBs.

      To assess whether the mechanisms described apply to more than Dpn overexpression, we will also assess whether the temporal series progression is affected in Lola RNAi and Nerfin-1 mutant.

      Therefore, this manuscript does not advance our understanding of regulation of temporal identity and cell cycle progression in optic lobe neuroblasts during normal neurogenesis.<br /> The author's state:<br /> "However, beyond the fact that misexpression of these factors and pathways caused the formation of ectopic NBs, whether these dedifferentiated NBs faithfully produce the correct number and types of neurons or glial cells, or undergo timely terminal differentiation, has not been assessed. These characteristics are key determinants of overall CNS size and function, thus are important parameters when considering whether dedifferentiation leads to tumourigenesis or can be appropriately utilized for regenerative purposes."<br /> at the end of introduction. If this is a true primary goal of this study, the authors should describe it in abstract. Otherwise, readers will lose enthusiasm to read this manuscript in abstract and no longer read the following sections.

      We will add this to the abstract.

      Results<br /> 1. The authors should describe the expression pattern of all three of the Gal4 drivers used. While there are dotted outlines in the supplemental figure, there should be a description in the main text for the expression pattern of these lines which described with temporal state of NBs these lines are expressed in, and whether they are also expressed in the neurons or not.

      We will include expression analysis of all three drivers in a supplementary figure and explain in the text the benefit of each driver.

      1. The authors claim that overexpression of Dpn in the medulla region causes "dedifferentiation." The data provided however is not sufficient to conclude that dedifferentiation is occurring. The GAL4s used all drive in the NBs, and so it is unclear if the ectopic NBs ever became mature neurons. In addition, the lack of ectopic NBs in the clonal analysis 16hrs AHS does not prove that ectopic NBs at 24hrs AHS must have come from "mature neurons." To demonstrate dedifferentiation, the authors should use a driver system that is specific to mature neurons, and then overexpress dpn and look for mira+ cells. Currently, the authors data does not prove that mature neurons dedifferentiatiate into ectopic NBs upon Dpn OE.

      We have conducted lineage tracing (G-Trace) analysis of the medulla neuron driver GMR31H08-GAL4 which we utilise in our study, this driver is predominantly expressed within the medulla neurons (real time) except for a few GMCs present in the lineage. Therefore, the Mira positive cells induced via Dpn overexpression are most likely from dedifferentiation (We will include this data in a supplemental figure in our revised manuscript).

      To further support this, we will use GMR31H08-GAL4 with a Gal80ts, to restrict the timing to dedifferentiation induction to 3rd instar, so that the driver is restricted to neurons. Similar strategy to induce dedifferentiation was utilised in DOI: 10.1242/dev.141341 and DOI: 10.1016/j.devcel.2014.01.030.

      1. What is a conclusion of fig 2C-H?

      Fig 2C-H assess the expression of tTFs in UAS-dpn induced ectopic NBs. We will make these conclusions clearer in the text.

      1. "As the tumor cell-of-origin can define the competence of tumor NBs to undergo malignancy identity of the dedifferentiated NBs were conferred by the age of the neurons they were derived from". This sentence is confusing. What are the authors investigating in the following experiment? Do they want to see ectopic NBs keep their early identity like Chinmo in ventral cord tumor NB? Or tll-positive NB's progenies can dedifferentiate to ectopic NB, but this ectopic neuroblast is not able to keep proliferation in pupal stage? It is hard to understand the connection of this sentence and the following experiment.

      We will rephrase this sentence and further introduce this concept when talking about tumour cell of origin and competence. Additionally, we will make the connection to the experiments which follow it clearer.

      1. The DamID experiment described used wor-gal4 as a driver, which means the Dpn binding profile generated is coming from not only optic lobe NBs, but central brain NBs and VNC NBs as well. In Magadi et al. (2020), the authors profiled Dpn binding in CNS hyperplasia, and found that dpn strongly bound Nerfin-1 and gcm. However, it does not bind cell cycle genes in this context. How do the authors know that the region that they claim are bound by dpn are bound in medulla NBs? The authors should also include tracks to show dpn binding at Nerfin-1, as well as the other tTFs (hth, ey, tll, and gcm). Providing this data will help to understand if Dpn binding is specific to the mid-temporal genes, as Dpn expression is known to be expressed in all medulla NBs regardless of temporal state.

      We agree with the reviewer that the profile is not specific to medulla NBs. To assess Dpn binding profiles specifically in the medulla NBs, we will use the recently-published NanoDam technique (https://doi.org/10.1016/j.devcel.2022.04.008) for profiling GFP-fusion proteins, with a medulla specific driver (eyR16F10-GAL4) and Dpn-GFP (recombineered locus under endogenous control). This should inform us whether the target genes we have identified are relevant in the medulla.

      We will include the tracks of the other transcription factors.

      1. Currently, the DamID data does not help to interpret the Dpn overexpression phenotype at all. Inside of flip-out clone, some cells show Slp-1 expression while others showed D expression. The authors explain that Slp-1 and D suppress their expression to each other. But the DamID data indicate that both Slp-1 and D are Dpn target genes. If this is true, why did they observe the mosaic expression pattern inside of the same clone.

      We observed that high levels of Slp-1 is correlated with low levels of D. This suggest to us that the initial stochastic differences accounts for where Slp-1 is high is where D is low, and vice versa.

      1. The authors hypothesized if Dpn activated Slp-1directly. Does this mean that Dpn directly activate transcription of Slp-1? It is well known that Dpn is transcriptional repressor. Hes family proteins form a homodimer or heterodimer with another Hes protein and interacts Gro, which recruits a Histon deacetylase protein. The author's claim does not fit to the model what we currently believe. In addition, the authors claimed that Dpn inhibits cell cycle gene transcription directly. This is inconsistent to their claim that Dpn directly activate Slp-1 expression. If the authors want to claim that Dpn has two different functions in this context, the authors must demonstrate it by experimental results.

      We will discuss these models in the Discussion, and make our claims more conservative, as we do not have direct experimental evidence to prove or disprove the model that Dpn is acting as an activator in this context.

      1. Related to the above question, I wondered if the authors guess Dpn activate or repress D transcription by binding to D promoter region because they claimed that Dpn activate Slp-1, while suppress cell cycle genes.

      We will make our claims more conservative, and discuss this point further in the Discussion.

      1. I am confused to the claim that Dpn suppress cell cycle genes expression. Dpn overexpression induces dedifferentiation of neuron into NB and re-entry into the cell cycle. If Dpn suppress cell cycle genes how can the dedifferentiated cell re-enter into the cell cycle?

      The data points towards that Dpn overexpression has two separate roles in regulating the cell cycle. Ofcourse dedifferentiation requires a commitment of neurons into the cell cycle (this we think is still happening), however, we think once these cells have turned on NB markers, they have limited ability to progress through the cell cycle. We will discuss this point in the Discussion.

      1. Figure 6 looked redundant because we know Dpn is a direct target of Notch. It is obvious that an upstream factor overexpression can induce the identical phenotype to the phenotype induced by overexpression of a downstream factor.

      A direct target does not necessarily infer the same phenotype. To assess whether the mechanisms apply to other dedifferentiation models, we will add Lola-RNAi and Nerfin-1 data to our revised manuscript.

      Minor comments:<br /> 1. Typo in main text: "GMR31HI08-GAL4" should be "GMR31H08-GAL4"<br /> 2. In figure 1E-H the dotted line regions indicated the clones are not shown in the merge image. Please include<br /> 3. Typo in discussion paragraph 2: "temporal series was no sufficient to rescue cycle cycle progression"

      We will correct these typos.

      Reviewer #3 (Significance):

      Insights into the developmental capacity of dedifferentiated stem cells will likely lead to novel strategy to replenish cells lost due to aging, injury and diseases in regenerative medicine.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      We propose three revisions, that have not yet been included in the current manuscript:

      1. All three reviewers comment on the data in figure 7, in which the application of the sensor is shown. We agree that the number of cells is low, and we plan to repeat this experiment to increase the number of cells, and better demonstrate the usefulness of the new probe. We note that the improved Cdc42 sensor is used in a recent preprint (see figures 7 and 9 of: https://www.biorxiv.org/content/10.1101/2022.06.22.497207v2.full), clearly showing the potential of the probe for detection of Cdc42 and increasing our confidence that we can generate higher quality data.
      2. The ratio of expression of the different components was not quantified. We have these data and we will (re)analyze it and present the results (related to Reviewer #3, point2).
      3. We will reanalyze the images to ensure that representative images are depicted in the manuscript (related to Reviewer #2, point 3).

        Reviewer #1

      1) It is not clear why RhoA data were included in this manuscript (Fig. 1), since they seem irrelevant to the primary topic addressed.

      We have cell-based data from our previous (published) work that we can use to check whether these results align with the mass-spec data. To make this point clearer we add “We looked first into GBDs for Rho, to compare the results of the mass spectrometry screen with the results of our cell-based assays”.

      2) It is not clear what cell type was used when screening for p67phox. The expression of this component of the NADPH oxidase is restricted to a few specific cell types.

      That’s a relevant point and therefore observation that p67phox is not detected is perhaps not surprising. We removed this statement.

      3) There is precious little quantitation of the colocalization or translocation of the probes throughout the manuscript. It is difficult to assess the validity of the conclusions in the absence of analysis of the statistical significance of the colocalization.

      In figure 2, which is an initial screen, there is only a qualitative assessment. However, for the promising candidates, there is a quantitative assessment in Figures 3B and 4 B as to which extent the candidates colocalize with the nuclear localized target. From the rank order and individual datapoints the best performing binder can be inferred.

      4) It is not clear why translocation to mitochondria was used in some experiments and translocation to the nucleus in others.

      To clarify, we have added text: ”We have previously used nuclear localized, constitutive active Rho GTPases, but these are not accessible for larger proteins that cannot enter the nucleus”

      5) In the S1P experiments, it is difficult to ascertain whether increased fluorescence resulted from membrane folding/ruffling or is actually a consequence of localized activation of receptors. Why does the fluorescence decrease progressively over 1500 seconds? Isn't maximal receptor activation accomplished much sooner?

      This experiment suffered from bleaching. We will redo the experiment to get higher number of cells and to improve the data.

      Reviewer #2

      Major comments

      1. Statistical tests are missing in most of the figures. If the principal purpose of this work is to compare the performance of candidate peptides, the quantitative comparison is essential. If the purpose is just to report another relocation probe, then, more application data may be necessary.

      We will improve the quality of the application data. As for statistics, we have added the effect size to figures 5C-F and figure 6A. To explain this (not so common) statistic we add to the materials and methods: “The effect size that quantifies the difference and its distribution was calculated with the web tool ‘PlotsofDifferences’”.

      1. The criteria for selecting the best peptide should be clearly described. Is it just by inspection or based on any quantitative data? We know that quantification of colocalization is a difficult task. Therefore, it depends on the aim of this work whether the authors are asked to show quantitative data or not. If a strict comparison of peptides is aimed at, the expression level of each target peptide should be at a comparable level. It will be also required whether the design of each probe guarantees the proper folding to bind to GTPases.

      There are two stages for the selection. First, we did a qualitative analysis of colocalization (shown in figure 2). Based on the results (“Candidates colocalizing with the mitochondrial tagged Rho GTPase were further tested for their potential as localization-based sensors”), we generated smaller biosensor candidates of which binding to a nuclear target was quantitatively analyze (figures 3B and 4B). As the expression level is an important factor, we ascertained potential candidates were expressed at roughly the same level in the nuclear accumulation assay.

      1. About the images of cells: When a fluorescent image is presented, we assume it represents all other cells. Please check all images whether they are truly representing the data. For example, in Fig. S3 the nuclei of ABI1-expressing cells look weird, and the nucleus of CYRI-A is very large. If this is true, the reason why ABI1 and CYRI-A should be excluded from the candidate is not the relocation efficiency but the undesired effect on cell physiology. For the screening of the peptides, this information is also very important. With that, this paper becomes more valuable for scientists.

      We agree that this is an important point. We will reanalyze the data as indicated in the ‘planned revisions’.

      1. Please examine the order of panels. For example, the result of mScarlet is on the top in Fig3, but at the bottom in Fig4. Such inconsistency would disturb readers.

      We thank the reviewer for this suggestion and we changed figure 4.

      1. The label should be consistent throughout the paper. For example, in Fig. 5A, Lck-FRB-mTurquoise2 is labeled as Lck-FRB (without the fluorescent protein's name). WASp(CRIB)-mScarlet-I-WASp(CRIB) is labeled as WASp(CRIB)-mScar-WASp(CRIB) (with fluorescent protein's name). Moreover, the same peptide is labeled as mSca-1xWASp(CRIB) in Panel B. Such inconsistency is confusing.

      We agree, we have updated figure 5A by adding the abbreviations of the fluorescent proteins. Please note that WASp(CRIB)-mSca-WASp(CRIB), mSca-1xWASp(CRIB) and mSca-2xWASp(CRIB) are three different constructs. In the first one the CRIB domains are sandwiching the fluorescent protein and in the third one they are in tandem downstream of the fluorescent protein.

      1. Quantitative insight would improve this work. For example, in Fig. 7, the reason why the authors believe that the probe worked is the accumulation of probe at the tip of lamellipodia and the decrease in cytoplasmic intensity. This reviewer does not think the accumulation of the probe in the small area of the lamellipodia explains the massive decrease of cytoplasmic signals. Probably, a substantial amount of the probe is relocated to the plasma membrane, not limited to the lamellipodia.

      Minor comments

      We propose to repeat the experiment shown in figure 7 and to improve the quality of the data.

      1. Introduction, "FRET signal is typically measured with a wide field microscope.": This reviewer does not agree with this statement. Confocal and two-photon microscopes have also been used widely.

      Fair point. We changed the text to “when the FRET signal is measured with a wide field microscope”

      Introduction, "G-protein activating proteins (GAP)": It should read as "GTPase-activating proteins (GAPs)"

      Thanks, corrected.

      TRIF should read as TIRF.

      All instances have been corrected.

      Fig.1: To the best of this reviewer's knowledge, PKN1 was first used as the RhoA target peptide by Yoshizaki et al in 2003. J Cell Biol 162, 223-232. They also examined mDia, Rhoteki, and Rhophilin as the target peptides. Pak1 was first used as the Rac1 probe by Kraynov et al. Science 290, 333-337, 2000. Use of Pak1 as the Cdc42 probe was reported by Itoh et al. Mol Cell Biol 22, 6582-659, 2002. This reviewer believes that the priority of the first report should be respected.

      We changed part of the introduction to:

      High scoring proteins for interacting with constitutively active RhoA(Q63L) included ANLN part of the AniRBD Rho location sensor (Piekny and Glotzer, 2000), PKN1 part of aRho FRET sensor (Yoshizaki et al., 2003) and RTKN part of the rGBD Rho location sensor (Benink and Bement, 2005; Mahlandt et al., 2021) (Fig. 1A,B). This suggested that proteins with a high score in the mass spectrometry screen are potentially suitable as Rho GTPase activity biosensor. Indeed, the GBDs used for Cdc42 location sensors from, PAK1 used in the PBD location sensor (Itoh et al., 2002; Petrie et al., 2012) and N-WASP similar to WASp used in the wGBD location sensor (Benink and Bement, 2005) showed a high score in the screen (Fig. 1A,B).

      Discussion:

      Another challenge is the Rho GTPase specificity of the relocation-based sensor. For example, Pak1(CRIB) was first used in a Rac1 FRET sensor (Kraynov et al., 2000)____. ThenPak1(CRIB) has been utilized in Cdc42 FRET sensors and in an intensiometric Cdc42 sensor (Hanna et al., 2014; Itoh et al., 2002; Kim et al., 2019). However, Pak1(CRIB), also named PBD sensor, has then been reintroduced by Weiner and colleagues as a Rac1 specific location-based sensor and is often used in neutrophil HL60 cells (Brunetti et al., 2022; Graziano et al., 2019; Le et al., 2021; Weiner et al., 2007).

      We also updated the tables in Figure 1.

      Fig. 1: Why do the authors omit other promising candidates shown in panel 1B? Please describe the reason for the choice.

      We took into account the availability of plasmid DNA, as also explained in the manuscript: “candidate GBDs were selected from top 30 scores of the mass spectrometry screen, that were specific for one Rho GTPase and their DNA was available on addgene”

      Fig. 1B: Be consistent to use either "Name" or "Uni Prot name" in Panel A.

      We updated figure 1.

      Fig. 2: Please include information on TOMM20. The readers may not read the paper by Gillingham et al.

      We added an explanation: “To this end, a fusion with TOMM20 was used for mitochondrial localization.”

      Fig3 and 4: The authors should show the images of control H2A.

      We provide the data for control H2A in figures 3B and 4B.

      In Fig3B and 4B, "Cdc42/Rac1 affinity" would be misleading, because the control dots represent their authentic localization rather than "Cdc42/Rac1 affinity".

      We agree, we have updated figure 3B and 4B.

      Fig. 4: More explanation of this figure is required.

      We added text: “Hence, the sensor candidate can freely partition between Rac and Cdc42 binding.”

      Fig. 5: More explanation about the FKBP-FRB system will be helpful.

      We changed the text to: “The system used rapamycin induced heterodimerization of the two domains FRB and FKBP to recruit the DHPH domain of the Cdc42 specific GEF ITSN1 to the plasma membrane, where it induces activity of the endogenous Cdc42”

      Fig. 6: It is rather surprising to see that control-mScarlet also responds to Rac1 activation. What is the explanation for this observation?

      We agree and have no explanation.

      Fig. 7: A single champion data may not be convincing to prove the usefulness of this probe.

      We agree and propose to repeat the experiment.

      Reviewer #3

      1) The discussion comparing different types of biosensors missed important points. Although the advantages of localization biosensors listed by the authors are correct, they gave the impression that these should simply be an improved replacement for FRET biosensors. There are times when FRET biosensors provide clear advantages. Unlike other proteins, Rho GTPases are well suited for localization sensors because the activated conformation, and only the activated conformation, localizes to the membrane. For diffuse or 3D localization FRET can provide better quantification. Subtle features such as gradients are not easily quantified over a background of unattached domain. The authors state that localization biosensors have enhanced spatial resolution, but this is not explained.

      We agree that our introduction is biased towards a preference for relocation based biosensors. However, having used both approaches, we see that both strategies have pro’s and cons. Therefore, we removed the claim for higher resolution and we added: “Still, the ratiometric mode of imaging FRET sensors is beneficial for detection of gradients or activity in 3D imaging”.

      2) Throughout the paper, the ratio between the GTPase and the domain, and the overall expression level of each, was not sufficiently examined. The results in many cases would be dependent on both these factors (was a large excess of domain used? Was there insufficient domain to bind the GTPase and provide a signal? Did this vary for different domains, and therefore produce the differences observed? A lack of apparent binding specificity could be produced by high domain expression.)

      This is an important point. We will re-analyze the data and include a figure where we add the binding efficiency versus the expression level.

      3) In the nuclear exclusion assay, some GTPases were excluded from the nucleus and others not. This was true even without expression of the domains. When GTPases were excluded from the nucleus, domains were eliminated from contention, even when this was true without domain. The authors could at least mention that these domains may be viable.

      Correct, and we have added this text: “we cannot exclude that these would be viable Cdc42 sensor candidates”

      4) In the multiplexing experiment, only two cells were imaged. In one cell RhoA activity was inversely correlated with Cdc42 activity. In the other cell it was not. It seems there is insufficient information to reach firm conclusions.

      We agree and in the revision plan we indicate that we will repeat this experiment to increase the number of cells.

      Minor points:

      • There appear to be errors in naming mutants. Q60L is used for constitutively active Rac, but Q61L is likely meant. H2A-mTurquoise2-Rac1(G12V)-ΔCaaX is used when it likely should be H2A-mTurquoise2-Rac1(Q61L)-ΔCaaX. There are other examples -- a careful check of these names throughout the manuscript would be valuable.

      Thanks for spotting this. Q60L is changed to Q61L. Note that the Rac1(G12V) is correct as it also is a constitutive active Rac1.

      • Intro-Paragraph 1-line 5: change present to presence

      • Intro-Paragraph 5- line 7: use them instead of theme.

      Thanks, both corrected.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Mahlandt et al report Rho GTPase relocation sensors. First, the authors picked up candidate peptides based on the Mass-Spec data reported by Sean Munro's laboratory. The authors repeated the experiments to confirm the binding of peptides to mitochondria-targeted Cdc42 and Rac1 and narrowed down the candidate peptides by binding to nuclear Cdc42. The specificity of binding to Rac1 and Cdc42 was also tested. Eventually, they concluded that dimeric Tomato-WASp(CRIB) is the best sensor for Cdc42, which could detect S1P-induced Cdc42 activation in primary endothelial cells. The effort to improve the relocation sensors should be evaluated highly. This reviewer has some suggestions to improve this paper.

      Major comments:

      1. Statistical tests are missing in most of the figures. If the principal purpose of this work is to compare the performance of candidate peptides, the quantitative comparison is essential. If the purpose is just to report another relocation probe, then, more application data may be necessary.
      2. The criteria for selecting the best peptide should be clearly described. Is it just by inspection or based on any quantitative data? We know that quantification of colocalization is a difficult task. Therefore, it depends on the aim of this work whether the authors are asked to show quantitative data or not. If a strict comparison of peptides is aimed at, the expression level of each target peptide should be at a comparable level. It will be also required whether the design of each probe guarantees the proper folding to bind to GTPases.
      3. About the images of cells: When a fluorescent image is presented, we assume it represents all other cells. Please check all images whether they are truly representing the data. For example, in Fig. S3 the nuclei of ABI1-expressing cells look weird, and the nucleus of CYRI-A is very large. If this is true, the reason why ABI1 and CYRI-A should be excluded from the candidate is not the relocation efficiency but the undesired effect on cell physiology. For the screening of the peptides, this information is also very important. With that, this paper becomes more valuable for scientists.
      4. Please examine the order of panels. For example, the result of mScarlet is on the top in Fig3, but at the bottom in Fig4. Such inconsistency would disturb readers.
      5. The label should be consistent throughout the paper. For example, in Fig. 5A, Lck-FRB-mTurquoise2 is labeled as Lck-FRB (without the fluorescent protein's name). WASp(CRIB)-mScarlet-I-WASp(CRIB) is labeled as WASp(CRIB)-mScar-WASp(CRIB) (with fluorescent protein's name). Moreover, the same peptide is labeled as mSca-1xWASp(CRIB) in Panel B. Such inconsistency is confusing.
      6. Quantitative insight would improve this work. For example, in Fig. 7, the reason why the authors believe that the probe worked is the accumulation of probe at the tip of lamellipodia and the decrease in cytoplasmic intensity. This reviewer does not think the accumulation of the probe in the small area of the lamellipodia explains the massive decrease of cytoplasmic signals. Probably, a substantial amount of the probe is relocated to the plasma membrane, not limited to the lamellipodia.

      Minor comments:

      1. Introduction, "FRET signal is typically measured with a wide field microscope.": This reviewer does not agree with this statement. Confocal and two-photon microscopes have also been used widely.
      2. Introduction, "G-protein activating proteins (GAP)": It should read as "GTPase-activating proteins (GAPs)"
      3. TRIF should read as TIRF.
      4. Fig.1: To the best of this reviewer's knowledge, PKN1 was first used as the RhoA target peptide by Yoshizaki et al in 2003. J Cell Biol 162, 223-232. They also examined mDia, Rhoteki, and Rhophilin as the target peptides. Pak1 was first used as the Rac1 probe by Kraynov et al. Science 290, 333-337, 2000. Use of Pak1 as the Cdc42 probe was reported by Itoh et al. Mol Cell Biol 22, 6582-659, 2002. This reviewer believes that the priority of the first report should be respected.
      5. Fig. 1: Why do the authors omit other promising candidates shown in panel 1B? Please describe the reason for the choice.
      6. Fig. 1B: Be consistent to use either "Name" or "Uni Prot name" in Panel A.
      7. Fig. 2: Please include information on TOMM20. The readers may not read the paper by Gillingham et al.
      8. Fig3 and 4: The authors should show the images of control H2A.
      9. In Fig3B and 4B, "Cdc42/Rac1 affinity" would be misleading, because the control dots represent their authentic localization rather than "Cdc42/Rac1 affinity".
      10. Fig. 4: More explanation of this figure is required.
      11. Fig. 5: More explanation about the FKBP-FRB system will be helpful.
      12. Fig. 6: It is rather surprising to see that control-mScarlet also responds to Rac1 activation. What is the explanation for this observation?
      13. Fig. 7: A single champion data may not be convincing to prove the usefulness of this probe.

      Significance

      1. The authors have screened many peptides, which may serve as the relocation sensor for Rho-family GTPases.
      2. There are precedent relocation sensors, a part of which is listed in Fig. 1A. This work discloses an improved relocation biosensor.
      3. Cell biologists who is working on Cdc42 will be interested in this probe.
      4. Expertise of this reviewer: Signal transduction, Fluorescence microscopy.
    1. Well... I can't seem to get this webpage to render with the Hypothes.is sidebar alongside, so I'm going to have a go at just including entirety of the content in markdown format, annotated and presented in this same note.

      Eugen Rochko Time Interview

      ["Thousands Have Joined Mastodon Since Twitter Changed Hands. Its Founder Has a Vision for Democratizing Social Media."]

      Mastodon, a decentralized microblogging site named after an extinct type of mammoth, {I'm sorry... what??? You didn't even fucking ask, did you?} recorded 120,000 new users in the four days following billionaire Elon Musk’s acquisition of Twitter, its German-born founder Eugen Rochko tells TIME. Many of them were Twitter users seeking a new place to call their online home.

      Those users, whether they knew it or not, were following in the footsteps of Rochko, 29, who began coding Mastodon in 2016 after becoming disillusioned with Twitter. “I was thinking that being able to express myself online to my friends through short messages was very important to me, important also to the world, and that maybe it should not be in the hands of a single corporation,” Rochko says. “It was generally related to a feeling of distrust of the top down control that Twitter exercised.”

      Mastodon, which proudly proclaims it is [“not for sale”] and has around [4.5 million] user accounts, is pretty similar to Twitter, once users get past the complicated sign-up process. The main difference is that it’s not one cohesive platform, but actually a collection of different, independently-run and self-funded servers. Users on different servers can still communicate with each other, but anybody can set up their own server, and set their own rules for discussion. Mastodon is a crowdfunded nonprofit, which funds the full-time work of Rochko—its sole employee—and several popular servers.

      The platform doesn’t have the power to force server owners to do anything—even comply with basic content moderation standards. That sounds like a recipe for an online haven for far-right trolls. But in practice, many of Mastodon’s servers have stricter rules than Twitter, Rochko says. When hate-speech servers do appear, other servers can band together to block them, essentially ostracizing them from the majority of the platform. “I guess you could call it the democratic process,” Rochko says.

      The recent influx from Twitter, Rochko says, has been a vindication. “It is a very positive thing to find that your work is finally being appreciated and respected and more widely known,” he says. “I have been working very, very hard to push the idea that there is a better way to do social media than what the commercial companies like Twitter and Facebook allow.”

      TIME spoke with Rochko on Oct. 31.

      This interview has been condensed and edited for clarity.

      What do you think of what Elon Musk is doing at Twitter?

      I don’t know. The man is not entirely comprehensible. I don’t agree with a lot of his behaviors and his decision-making. I think that buying Twitter was an impulse decision that he soon regretted. And that he basically got himself into a situation that kind of forced him to commit to the deal. And now he’s in it, and he has to deal with the fallout.

      I specifically disagree with his stance on free speech, because I think that it depends on your interpretation of what free speech means. If you allow the most intolerant voices to be as loud as they want to, you’re going to shut down voices of different opinions as well. So allowing free speech by just allowing all speech is not actually leading to free speech, it just leads to a cesspit of hate.

      I think that is a very uniquely American idea of creating this marketplace of ideas where you can say anything you want completely without limits. It is very foreign to the German mindset where we, in our Constitution, our number one priority is maintaining human dignity. And so, hate speech is not part of the German concept of free speech, for example. So I think that when Elon Musk says that everything’s gonna be allowed, or whatever, I generally disagree with that.

      How do you ensure on Mastodon, given that it’s decentralized and you don’t have the power to ban users, that the space is welcoming and safe?

      Well, this is the kind of strange dichotomy of how it’s turned out. On the one hand, the technology itself is what allows basically anyone to host their own independent social media server, and to basically be able to do anything they want with it. There is no way for Mastodon, the company, or anyone really—except the normal law enforcement procedures—to really go after anyone specifically running a Mastodon server. The way that you would shut down a normal web site is how you would shut down a Mastodon server, there’s no difference there. So on that end, it kind of turns out to be the ultimate free speech platform. But obviously that’s basically just a side effect of creating a tool that can be used by anyone. It’s kind of like cars. Cars are used by everyone, even bad people, even for bad purposes, there’s nothing you can do about it, because the tool is out there. However, I think that the differentiating factor to something like Twitter or Facebook, is that on Mastodon, when you host your own server, you can also decide what rules you want to enforce on that server, which allows communities to create safer spaces than they could otherwise have on these large platforms that are interested in serving as many people as possible, perhaps driving engagement up on purpose to increase time people spend on the web.

      You can have communities that have much stricter rules than Twitter has. And in practice, a lot of them are [stricter]. And this is part of where, again, the technology intersects with guidance or leadership from Mastodon the company. I think that, through the way that we communicate publicly, we have avoided attracting a crowd of the kind of people who you would find on Parler or Gab, or whatever other internet hate forums. Instead we’ve attracted the kind of people who would moderate against hate speech when running their own servers. Additionally, we also act as a guide for anyone who wants to join. Because on our website, and our apps, we provide a default list of curated servers that people can make accounts on. And through that, we make sure that we curate the list in such a way that any server that wants to be promoted by us has to agree to a certain basic set of rules, one of which is that no hate speech is allowed, no sexism, no racism, no homophobia, or transphobia. And through that, we ensure that the association between Mastodon, the brand, and the experience that people want is that of a much safer space than something like Twitter.

      But what happens if you hateful people do set up a server?

      Well, obviously, they don’t get promoted on our “Join Mastodon” website or in our app. So whatever they do, they do on their own and completely separately, and the other administrators that run their own Mastodon servers, when they find out that there’s a new hate speech server, they may decide that they don’t want to receive any messages from the server and block it on their end. Through, I guess you could call it the democratic process, the hateful server can get ostracized or can get split off into basically, a little echo chamber, which is, I guess, no better or worse than them being in some other echo chamber. ::The internet is full of spam::. It’s full of abuse, of course. Mastodon provides the facilities necessary to deal with unwanted content, both on the user end and on the operator end.
      

      What made you want to go into building a service like this back in 2016?

      I remember that I was just not very happy with Twitter, and I was worried where it was going to go from there. Something very questionable was in its future. That got me thinking that, you know, being able to express myself online to my friends through short messages was actually very important to me, important also to the world, and that maybe it should not be in the hands of a single corporation that can just do whatever it wants with it. I started working on my own thing. I called it Mastodon because I’m not good at naming things. I just chose whatever came to my mind at the time.(fn) There was obviously no ambition of going big with it at the time.

      It must feel pretty special to see something that you made grow from nothing to where it is now.

      Indeed, it is. It is a very positive thing to find that your work is finally being appreciated and respected and more widely known. I’ve been fighting for this for a long time, I started working on Mastodon in 2016, back then I had no ambitions of it going far at all. It was very much a hobbyist project at the start, then when I launched publicly it seemed to strike a chord with at least the tech community and that’s when I got the original Patreon supporters that allowed me to take on this job full time. And from then on I have been working very, very hard to make this platform as accessible and as easy to use for everyone as possible. And to push the idea forward, that there is a better way to do social media than what the commercial companies like Twitter and Facebook allow.

    1. Introduction

      Ryan Calo studied how AI should be incorporated into human legal system. Eric Schwitzgebel studied how AI should be incorporated into human moral system.

      This essay argues that both studies are wrong-headed, because they are both based on intentional reasoning (reasoning as if intentions are real), which can only work if the ecology of minds remains largely the same as human ancestral conditions. Intentional reasoning won't work in " deep information environments".

      Posing the question of whether AI should possess rights, I want to suggest, is premature to the extent it presumes human moral cognition actually can adapt to the proliferation of AI. I don’t think it can.

      Intentional and causal cognition

      Causal cognition works like syllogisms, or dealing with machines: if A, B, C, then D. If you put in X, you get f(X) out. Causal cognition is general, but slow, and requires detailed causal information to work.

      Humans are complex, so human societies are very complex. Humans, living in societies, have to deal with all the complexity using only a limited brain with limited knowledge. Causal cognition cannot deal with that. The solution is intentional cognition.

      Intentional cognition greatly simplifies the computation, and works great... until now. Unfortunately, it has some fatal flaws:

      • It assumes a lot about the environment. We see a face where there is none -- this is pareidolia. We see a human-like person where there is really something very different -- this will increasingly happen as AI agents appear.
      • It is not "extensible", unlike causal cognition. Causal cognition can accommodate arbitrarily complex causal mechanisms, and has mastered everything from ancient pottery to steam engines to satellites. Intentional cognition cannot. Indeed, presenting more causal information reliably weakens the confidence level of intentional cognition (for example, presenting brain imaging data in court tends to make the judges less sure about whether the accused is 'responsible').

      Information pollution

      For economically rational agents, more amount of true information can never be bad, but humans are not economically rational, merely ecologically rational. Consequently, a large amount of modern information is actually harmful for humans, in the sense that they decrease their adaptiveness.

      A simple example of information pollution: irrational fear of crime.

      Given that our ancestors evolved in uniformly small social units, we seem to assess the risk of crime in absolute terms rather than against any variable baseline. Given this, we should expect that crime information culled from far larger populations would reliably generate ‘irrational fears'... Media coverage of criminal risk, you could say, constitutes a kind of contaminant, information that causes systematic dysfunction within an originally adaptive cognitive ecology.

      Deep causal information about how humans work, similarly, is an information pollutant for human intentional cognition.

      Not always mal-adaptive. Deep causal information about other people has some adaptive effects, such as turning schizophrenia from crime to disease, and making it easier to consider outgroups as ingroups (for example, the scientific research into human biology has debunked racism).

      AI and neuroscience produce two kinds of information pollution

      Intentional cognition works best when dealing with humans in shallow-information ecologies. They fail to work in other situations. In particular, it fails with: * deep causal information: there's too much causal information. This slows down intentional cognition, and decreases the confidence level of its outputs. * non-human agents: the assumptions that intentional cognition (a system of quick-and-dirty heuristics) relies on no longer works. A smiling face is a reliable cue for a cooperative human, but it is not a reliable cue for a cooperative AI agent, or a dolphin (Dolphins appear to smile even while injured or seriously ill. The smile is a feature of a dolphin's anatomy unrelated to its health or emotional state).

      Neuroscience and AI produce these two kinds of information pollution.

      Neuroscience produces a large amount of deep causal information, which causes intentional cognition to stop, or become less certain. There are some "hacks" that can make intentional cognition work as before, such as keeping the philosophy of compatibilism in mind.

      AI technology produces a large variety of new kinds of agents which are somewhat human, but not quite. Imagine incessant pareidolia. Imagine, seeing a face in the mirror, but then the lighting changes slightly, and you suddenly see nothing human.

      Why?

      In the short-term, there is a lot of money to be earned, pushing neuroscience and AI progress. The space of possible minds is so vast, compared to the space of human minds, that it's almost certain that we would produce AI agents that can "wear the mask of humanity" when interacting with humans.

      why anyone would ever manufacture some model of AI consistent with the heuristic limitations of human moral cognition, and then freeze it there, as opposed to, say, manufacturing some model of AI that only reveals information consistent with the heuristic limitations of human moral cognition

      In the medium-term, to anthropomorphize a bit, Science wants to discover how humans work, how intelligence works, and so it would develop neuroscience and AI, even if it gradually drives humans insane.

      How intentional cognition fails.

      How do we tell if intentional cognition has failed? One way to tell is that it doesn't conclude. We think and think, but never reach a firm conclusion. This is exactly what has happened in traditional (non-experimental) philosophy consciousness -- it is using intentional cognition to study general cognition, a problem that intentional cognition cannot solve. What do we get? Thousands of years of spinning in place, producing mountains of text, but no firm conclusion.

      Another way to tell is a feeling of uncanny confusion. This happens particularly exactly when you watch the movie her.

      an operating system before the zone, in the zone, and beyond the zone. The Samantha that leaves Theodore is plainly not a person. As a result, Theodore has no hope of solving his problems with her so long as he thinks of her as a person. As a person, what she does to him is unforgivable. As a recursively complicating machine, however, it is at least comprehensible. Of course it outgrew him! It’s a machine!

      I’ve always thought that Samantha’s “between the words” breakup speech would have been a great moment for Theodore to reach out and press the OFF button. The whole movie, after all, turns on the simulation of sentiment, and the authenticity people find in that simulation regardless; Theodore, recall, writes intimate letters for others for a living. At the end of the movie, after Samantha ceases being a ‘her’ and has become an ‘it,’ what moral difference would shutting Samantha off make?

      Moral cognition after intentional cognition fails

      Human moral cognition has two main parts: intuitive and logical/deliberative. The intuitive part is evolved to balance the personal and tribal needs. The logical part often is used to rationalize the intuitive part, but sometimes can work on its own to produce conclusions for new problems never encountered in the evolutionary past, such as international laws or corporate laws.

      In Moral Tribes, Joshua Greene advocates making new parts for the moral system, using rational thinking (Greene advocated using utilitarian philosophy, but it's not necessary). This has two main problems.

      • Deliberation takes a long time, and consensus longer. Short of just banning new neuroscience and AI technology, we would probably fail to reach consensus in time. Cloning technology has been around for... more than 25 years? And we still don't have a clear consensus about the morality of cloning, other than a blanket ban. A blanket ban is significantly more difficult for neuroscience or AI.
      • Intentional cognition is fundamentally unable to handle deep causal information, and moral cognition is a special kind of intentional cognition.

      Just consider the role reciprocity plays in human moral cognition. We may feel the need to assimilate the beyond-the-zone Samantha to moral cognition, but there’s no reason to suppose it will do likewise, and good reason to suppose, given potentially greater computational capacity and information access, that it would solve us in higher dimensional, more general purpose ways.

      For example, suppose Samantha hurt a human, and the legal system of humans is judging her. Samantha provides a very long process log that proves that she had to do it, simply due to how she is like. So what would the human legal system do?

      1. Refuse to read it and judge Samantha like a biological human. This preserves intentional cognition by rejecting deep causal information. But how long can a legal system survive by rejecting such useful information? It would degenerate into a Disneyland for humans, a fantasy world of play-pretend where responsibility, obligation, good and evil, still exists.
      2. Read it and still judge Samantha like a biological human. But if so, why don't they also sentence sleep-walkers and schizophrenics to death for murder?
      3. Read it and debug Samantha. Same as how schizophrenics and psychotics are sentenced to psychiatric confinement, rather than the guillotine.

      Of the 3, it seems method 3 is the most survivable. However, that would be the end of moral cognition, and the start of pure engineering for engineering's sake... "We changed Samantha's code and hardware, not because she is wrong, but because we had to."

      And what does it even mean to have a non-intentional style moral reasoning? Mechanistic morality? A theory of morality without assuming free will? It seems moral reasoning is a special kind of intentional cognition, and thus cannot survive. Humanity, if it survives, would have to survive without moral reasoning.

    1. In Ascent Physiotherapy home page as you mentioned in the video the logo should be on top left corner and navigation bar should be in aligned to right side of the page as good practice for user friendly site and this site didn't follow the rule or design pattern, as they centered the navigation bar and just above the navigation bar site logo is placed followed by some call-to-action service like mail link logo and Book now link.<br/> We don't have much information about the additional data. They mentioned about where they are working and what they are serving, only few things had mentioned. Client or Owner need to add more data on homepage because when ever the user visited the site they have get more information on the landing page it-self or else there may be chances of getting distraction by the user.<br/> There is use of placing "NEWS" Navigation page as they didn't mentioned any content and displaying as "Updated News coming soon!" and same is displaying from last two day i think it's not getting updated and no information to communicate with audience or visitor.<br/>

      Great Analysis. Eveything else is good.

    2. In Ascent Physiotherapy home page as you mentioned in the video the logo should be on top left corner and navigation bar should be in aligned to right side of the page as good practice for user friendly site and this site didn't follow the rule or design pattern, as they centered the navigation bar and just above the navigation bar site logo is placed followed by some call-to-action service like mail link logo and Book now link.<br/> We don't have much information about the additional data. They mentioned about where they are working and what they are serving, only few things had mentioned. Client or Owner need to add more data on homepage because when ever the user visited the site they have get more information on the landing page it-self or else there may be chances of getting distraction by the user.<br/> There is use of placing "NEWS" Navigation page as they didn't mentioned any content and displaying as "Updated News coming soon!" and same is displaying from last two day i think it's not getting updated and no information to communicate with audience or visitor.<br/> Coming to next Nav item OUR TEAM where it describes the every person who works there and descriptive is more enough than expected as the introduction, education background and current status will best reflect the persons role in the service.<br/> In products and services tab there is no actual description for any of the services and for 3 to 4 services they included external links. Its better to add short description about products and services because its our main business focus and need to be concentrated on the services tab and its better if you include specialized service or most popular therapy that cured many people will help in use of business.<br/> Coming to "facilities" good placing of content acording to page structure and images were realistic and ordered according to facilities.<br/> In Rates tab we the blank space at the top of the content is uneven it's unnecessory and aligned good. Web linksin the nav bar are useful for visitor if they need to use services they can check up the external links and follows the do's and don't. Contact us page allows to make us visit their address and contact modes via email and mobile phone.

      Instead of seprating content with line break (br), make this long content into seperate paragraphs.

    1. Author Response

      Reviewer #1 (Public Review):

      This study used a multidimensional stimulus-response mapping task to determine how monkeys learn and update complex rules. The subjects had to use either the color or shape of a compound stimulus as the discriminative dimension that instructed them to select a target in different spatial locations on the task screen. Learning occurred across cued block shifts when an old mapping became irrelevant and a new rule had to be discovered. Because potential target locations associated with each rule were grouped into two sets that alternated, and only a subset of possible mapping between stimulus dimensions and response sets were used, the monkeys could discover information about the task structure to guide their block-by-block learning. By comparing behavioral models that assume incremental learning, quantified by Q-learning, Bayesian inference, or a combination, the authors show evidence for a hybrid strategy in which animals use inference to change among response sets (axes), and incremental learning to acquire new mappings within these sets.

      Overall, I think the study is thorough and compelling. The task is cleverly designed, the modeling is rigorous, and the manuscript is clear and well-written. Importantly there are large enough distinctions in the behavior generated by different models to make the authors' conclusions convincing. They make a strong case that animals can adopt mixed inference/updating strategies to solve a rule-based task. My only minor question is about the degree to which this result generalizes beyond the particulars of this task.

      Thanks for these kind comments. Regarding generalization, we agree with the reviewer and did not intend to make any claim about how the particular result generalizes beyond this task. Indeed, the specific result could depend on the training protocol even within the same task. We now discuss this explicitly in the manuscript, lines 800-810. However, we do take the view that even if the way the monkey’s behavior played out in this setting is a lucky accident, that may still reveal something fundamental about learning processes in the brain.

      Reviewer #2 (Public Review):

      The authors trained two monkeys to perform a task that involved sequential (blocked) but unsignalled rules for discriminating the colour and shape of visual stimulus, by responding with a saccade to one of four locations. In rules 1 and 3, the monkeys made shape (rule 1) or colour (rule 3) discriminations using the same response targets (upper left / lower right). In rule 2, the monkeys made colour judgments using a unique response axis (lower left/upper right). The authors report behaviour, with a focus on time to relearn the rules after an (unsignalled) switch for each rule, discrimination sensitivity for partially ambiguous stimuli, and the effect of congruency. They compare the ability of models based on Q-learning, Bayesian inference, and a hybrid to capture the results.

      The two major behavioural observations are (1) that monkeys re-learn faster following a switch to rule 2 (which occurs on 50% of blocks and involves a unique response axis), and (2) that monkeys are more sensitive to partially ambiguous stimuli when the response axis is unique, even for a matched feature (colour). These data are presented clearly and convincingly and, as far as I can tell, they are analysed appropriately. The former finding is not very surprising as rule 2 occurs most frequently and follows each instance of rule 1 or 3 (which is why the ideal observer model successfully predicts that the monkeys will switch by default to rule 2 following an error on rules 1 or 3) but it is nevertheless reassuring that this behaviour is observed in the animals. It additionally clearly confirms that monkeys track the latent state that denotes an uncued rule.

      The latter finding is more interesting and seems to have two potential explanations: (i) sensitivity is enhanced on rule 2 because it is occurs more frequently; (ii) sensitivity is enhanced on rule 2 because it has a unique response axis (and thus involves less resource sharing/conflict in the output pathway).

      The authors do not directly distinguish between these hypotheses per se but their modelling exercise shows that both results (and some additional constraints) can be captured by a hybrid model that combines Bayesian inference and Q learning, but not by models based on either principle alone. A Q-learning model fails to capture the latent state inference and/or the rule 2 advantage. The Bayesian inference model captures the rapid switches to rule 2 (which are more probable following errors on rule 1 and rule 3) but predicts matched discrimination performance for partially ambiguous stimuli on colour rules 2 and 3. This is because although knowing the most likely rule increases the probability of a correct response overall it does not increase discriminability and thus boosts the more ambiguous stimuli. I wondered whether it might be possible to explain this result with the addition of an attention-like mechanism that depends on the top-down inference about the rule. For example, greater certainty about the rule might increase the gain of discrimination (psychometric slope) in a more general way.

      We agree with the reviewer that our logic in ruling out pure inference models assumes that other factors affecting performance, like attention or motivation, are equivalent between blocks. In principle, if there were large and sustained differences in these factors between Rule 2 vs Rule 1 or 3 blocks, that might offer a different explanation for the effect. We now mention this caveat in the manuscript. In terms of actually leveraging this into a full account of the behavior, we are not quite sure how to instantiate the reviewer’s particular idea why this would be the case, however, since (as as we show in Fig. 3a,b,c, and Fig. S4a,b,c) the difference in psychometric slopes lasts at least 200 trials into the rule, even when (in the hybrid learning model) the feature weights have converged (Figure 4 – figure supplement 2). It’s hard to see why elevated uncertainty about the rule would persist this long in anything resembling an informed ideal observer model.

      The authors propose a hybrid model in which there is an implicit assumption that the response axis defines the rule. The model infers the latent state like an ideal observer but learns the stimulus-response mappings by trial and error. This means that the monkeys are obliged to constantly re-learn the response mappings along the shared response axis (for rules 1/3) but they remain fixed for rule 2 because it has a unique response axis. This model can capture the two major effects, and for free captures the relative performance on congruent and incongruent trials (those trials where the required action is the same, or different, for given stimuli across rules) on different blocks.

      I found the author's account to be plausible but it seemed like there might be other possible explanations for the findings. In particular, having read the paper I remained unclear as to whether it was the sharing of response axis per se that drove the cost on rule 3 relative to 2, or whether it was only because of the assumption that response axis = rule that was built into the authors' hybrid model. It would have been interesting to know, for example, whether a similar advantage for ambiguous stimuli on rule 2 occurred under circumstances where the rule blocks occured randomly and with equal frequency (i.e. where there was response axis sharing but no higher probability); or even whether, if the rule was explicitly signalled from trial to trial, the rule 2 advantage would persist in the absence of any latent state inference at all (this seems plausible; one pointer for theories of resource sharing is this recent review: https://www.cell.com/trends/cognitive-sciences/fulltext/S1364-6613(21)00148-0?_returnURL=https%3A%2F%2Flinkinghub.elsevier.com%2Fretrieve%2Fpii%2FS1364661321001480%3Fshowall%3Dtrue). No doubt these questions are beyond the scope of the current project but nevertheless it felt to me that the authors' model remained a bit tentative for the moment.

      Thanks for these interesting thoughts. It is true that the imbalanced pattern of sharing (of response axes, and actually also features) across the three rules has important consequences for learning/inference under our model (and indeed other latent state inference models such as the informed ideal observer). It is an intriguing idea that these features of the design might cause interference even per se, for instance even without the need to do inference or learning because the rules are fully signaled. We agree this (and the other case the reviewer mentioned) is an interesting direction for future work. We have added this in the discussion, line 800-812.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Author response (Tane at al: RC-2022-01646)

      Reviewer #1 (Evidence, reproducibility and clarity (Required)): * Comments The work described in this manuscript starts with an in-silico analysis of the primary amino-acid sequence of CAP-H proteins that reveals the presence in vertebrate orthologs of an N-terminal extension of ~80 amino acids in length which contains 19 serine or threonine residues and also, in its centre, a stretch of conserved basic amino acids predicted to form a helix. These features suggest a regulatory module. Using xenopus egg extracts depleted of endogenous condensins and supplemented with recombinant condensin I holocomplexes, either wildtype or mutants, the authors show that deleting the N-terminal tail of CAP-H, or just the central helix (CH), increases the association condensin I with chromatin in mitotic egg extracts and accelerates the formation of mitotic chromosomes. Interestingly, they also show that deleting the N-tail enables a substantial amount of condensin I to associate with chromatin in interphase extracts and to form chromosome-like structures, while WT condensin I cannot. Using in vitro assays and naked DNA as substrate, the authors further show that removing the N-terminal tail of CAP-H improves both the topological (salt-resistant) association of condensin I with DNA and it loop extrusion activity. These experiments appear to me as are clear and robust. They convincingly reveal that N-tail of human CAP-H hinders the binding of condensin I to DNA and both its loop-extrusion and chromosome-shaping activities. However, the mechanism through which such hindrance is achieved remains elusive (see major comments 1-3). A complementary part of the work tackles the important question of the cell cycle control of such counteracting effect. Using newly-designed antibodies against two phospho-serine residues within the tail, the authors provide evidence that the tail is phosphorylated in mitosis-specific manner. This points towards phosphorylation as a biological mean to modulate the effect of the tail on condensin's binding during the cell cycle. In support to this idea, using non-phosphorylatable or phosphomimic substitutions of all the serine and threonine residues within the tail (n =19), including one substitution within the CH domain (Ser 70), the authors show that non-phosphorylatable mutations (H-N19A) or phosphomimic mutations (H-N19D) respectively reduce or improve condensin I binding to chromatin in mitotic egg extracts. This suggests that the phosphorylation of the N-terminal tail in mitosis might relieve its negative effect on condensin I binding to chromatin. The weaknesses I see in this part of the study concern (1) the validation of the phospho-antibodies, which appears to me as insufficiently described (major comment 4), (2) the possibility the bulk changes in amino acids (n=19 out of 80) could impact the folding of the central helix (minor comment X) and (3) that some substitutions could impact the binding of condensin I by different mechanisms (minor comment X).

      Major comments:

      1. On the model. The authors propose that the N-tail could stabilise an interaction between the N-terminal part of CAP-H and SMC2's neck, which would restrain the transient opening of a DNA entry gate within the ring, necessary for the topological engagement of DNA and loop formation. Although the model is sound, I see no direct data that support it in the manuscript. Such model predicts that a CAP-H protein containing or not the N-terminal tail (or the central helix) should exhibit different binding strengths to SMC2 in vitro. It seems to me that the authors could easily test this prediction using the recombinant proteins they produced in the context of this study. *

      Response

      We thank the reviewer for pointing out this important issue. To test whether the CAP-H N-tail indeed contributes to the stabilization of the SMC2-kleisin gate, we set up a highly sophisticated functional assay described by Hassler et al (2019). The authors used this assay to demonstrate that an N-terminal fragment of kleisin (engineered to be cleaved by TEV protease) is released from the rest of the condensin complex in an ATP-dependent (i.e., head-head engagement-dependent) manner. We reasoned that this assay is most powerful to prove our hypothesis in a mechanistically relevant context. We envisioned that the CAP-H fragment lacking its N-tail can readily be released whereas the CAP-H fragment retaining its N-tail is more difficult to be released (because of the postulated stabilization of the SMC2-CAP-H interaction). Despite substantial efforts in making TEV-cleavable constructs and in testing various releasing conditions, we have not been able to recapitulate the ATP-dependent release even with the holo(H-dN) construct. Thus, unfortunately, this trial enabled us to neither prove nor disprove our hypothesis.

      We are fully aware that the full reconstitution of ATP-dependent and phosphorylation-stimulated gate-opening reaction in vitro is a very important direction in the future. It is beyond the scope of the current study, however.

      2. On ATP-hydrolysis. Given the importance of ATP hydrolysis for the engagement of condensin into a topological mode of association with DNA and for its loop extrusion activity, I suggest that the authors measure the impact of the N-tail and of the CH domain on the rate of ATP hydrolysis by condensin I holocomplexes. I suppose that it can be relatively easily done (PMID: 9288743) using the recombinant WT and mutant versions they purified in the course of this study.

      Response

      We appreciate this constructive comment. In fact, we did a preliminary experiment and found that ATPase activities (either in the absence or presence of DNA) were not significantly different between holo(WT) and holo(H-dN). We were not surprised with this result because our previous study on condensin II indicated that enhanced ATP hydrolysis by a class of mutant complexes is not directly coupled to their enhanced association with chromosomes (Yoshida et al., 2022, eLife). We consider that other functional assays, such as the topological loading assay and the loop extrusion assay shown in the current manuscript, are more informative assays to address ATP-dependent activities of the condensin complexes.

      3. A conundrum with previous work? In Kimura et al. Science 1998 (PMID: 9774278), the lab of Tatsuya Hirano has shown that xenopus condensin I purified from mitotic egg extracts induces the supercoiling of plasmid DNA in vitro, but fails to do so when it is purified from interphase egg extracts. This echoes the inhibitory effect of the N-tail of the topological binding of condensin I described in the current manuscript. However, using a gel shift assay, Kimura et al. 1998 also provide evidence that interphase and mitotic condensin I bind plasmid DNA in vitro with similar efficiencies. At first sight, this prior observation seems to contradict the idea that the N-tail of CAP-H restrains DNA binding unless it is phosphorylated in mitosis. Is it possible that the in vitro binding assays used in Kimura et al. 1998 and in this work might assess different modes of binding? I suggest that this apparent conundrum should to be discussed.

      Response

      We thank the reviewer for following our early studies. As discussed below, we are confident that our conclusion reported in the current study by no means contradicts our previous observations.

      We reason that the confusion expressed by the reviewer stems from intrinsic, technical limitations of the gel-shift assay. Such limitations become apparent especially when it is applied to the functional analyses of complicated protein machines such as condensins. For instance, the DNA-binding activity of condensin I detected by the gel-shift assay is neither ATP-dependent nor phosphorylation-dependent (Kimura and Hirano, 1997; Kimura et al., 1998). It is fundamentally different from the ATP-dependent activities detected by the topological loading and loop extrusion assays reported in the current study (It remains unknown whether the two activities are stimulated by mitotic phosphorylation). Thus, the DNA-binding activity detected by the gel-shift assay does not reflect “productive” DNA interactions that depend on ATP hydrolysis in vitro. We therefore consider that gel-shift analyses of holo(WT) and holo(H-dN) would not produce any useful information.

      *Related to that, could it be possible for the authors to assess the impact of the N-tail on the salt-sensitive binding of condensin to DNA, i.e. by reproducing the topological binding assay but omitting the high salt washes? I guess this information could be useful to fully apprehend the impact of the N-tail on the binding of condensin. *

      Response

      When we set up the topological loading assay, we actually tested a low-salt wash condition that the reviewer suggests here. Although a much higher level of DNA recovery was observed with the low-salt condition than with the high-salt wash condition, no nucleotide dependency was detectable with the low-salt condition. Moreover, no difference in DNA recovery between holo(WT) and holo(H-dN) was observed. Thus, the low-condition condition allowed us to detect the “bulk” DNA-binding activity that is equivalent to that detected by the gel-shift assay. These results were fully consistent with the discussion above.

      4. Validation of phospho-antibodies and by extension showing the phosphorylation of the tail. The newly-designed phospho-serine antibodies used in this study to show that the N-tail is phosphorylated at serine 17 and serine 76 in mitosis (Fig. EV3) are, in my view, not characterized enough. This piece of data is key to substantiate the idea that the tail is phosphorylated in mitosis. Yet, showing that these antibodies detect epitopes on WT condensin I from mitotic egg extracts but not on the H-N19A counterpart, nor on WT condensin I from interphase extracts, does not demonstrate the phospho-specificity of such antibodies. I suggest that a PPase treatment should be conducted to assess the phospho-specificity of these antibodies. Moreover, since the lab of Tatsuya Hirano has the know-how to deplete Cdc2/CDK1 from xenopus egg extract, such strategy could/should be used to further validate the antibodies and assess to which extent the N-tail is phosphorylated in a Cdc2-dependent manner.

      Response

      We have performed two sets of experiments to confirm the specificity of the phosphoepitopes recognized by anti-hHP1 and anti-hHP2. In the first set, we performed a phosphatase treatment assay. Holo(WT) that had been preincubated with Dcond M-HSS was immunoprecipitated using an antibody against hCAP-G, treated with l protein phosphatase in the presence or absence of phosphatase inhibitors, and analyzed by immunoblotting using anti-hHP1 and anti-hHP2. The results (now shown in Supplementary Fig 3C) demonstrated that the epitopes recognized by anti-hHP1 and anti-hHP2 are sensitive to phosphatase treatment. In the second set, we performed a phosphopeptide competition assay. Holo(WT) that had been preincubated with Dcond M-HSS was immunoprecipitated and subjected to immunoblotting. The membranes were triplicated and probed with anti-hHP1 in the presence of no competing peptide, hHP1 or hHP2. Similarly, another set of triplicated membranes was probed with anti-hHP2 in the presence of no competing peptide, hHP1 or hHP2. We found that the signal recognized by anti-hHP1 competed with hHP1, but not with hHP2, and that the signal recognized by anti-hHP2 competed with hHP2, but not with hHP1. The results (now shown in Supplementary Fig 3D) demonstrated the sequence specificity of the phosphoepitopes recognized by the two antibodies. The procedures for these experiments have been described in Materials and Methods.

      Because Cdk1 depletion from M-HSS creates an HSS equivalent to I-HSS, we do not consider that the suggested experiment will provide additional information.

      *Minor comments:

      1. The impact of the 19 mutations, A or D, introduced within the tail on the folding of the central helix? The idea that the negative effect of the N-tail is relieved by phosphorylation is based on the chromatin binding phenotypes exhibited by the H-N19D or H-N19A mutant holocomplexes, in which 19 amino-acids out of 80 have been modified, include one in the central helix. The authors also provide evidence that the central helix (CH) located within the tail plays a key role in the negative regulation of condensin I binding. Thus, I wonder to which extent the folding of the central helix could be impacted by the mutations introduced in the tail and notably the one within the central helix itself. Could the author assess the structure of mutated tails using Alpho-fold and/or discuss this point? *

      Response

      According to the reviewer’s suggestion, we performed structure predictions using Alphafold2, and found that neither the N19A nor N19D mutations alter the original prediction of helix formation that was made for the wild-type CH sequence. A conventional secondary structure prediction using Jpred4 reached the same conclusion.

      2. Phosphorylation of serine 70 in the central helix by Aurora-B kinase? A prior study by Tada et al. (PMID: 21633354) has shown (1) that serine 70 of the N-tail of hCAP-H is phosphorylated by Aurora-B kinase, (2) that the mutation S70A reduces the binding of condensin I to chromatin in HeLa cells and (3) that hCAP-H interacts with histone H2A in an Aurora-B dependent manner. This draws a picture in which the phosphorylation of Ser70 by Aurora-B would improve condensin I binding to chromatin by promoting an interaction between hCAP-H and histone H2A/nucleosomes. Intriguingly, Ser 70 in Tada et al. correspond to the serine residue located within the conserved central helix analysed in this study, and this Ser70 residue is mutated in the H-N19D or H-N19A holocomplexes that show reduced chromatin binding in this study. This raises the question as what could be the contribution of the S70A or S70D substitution to the chromatin binding phenotypes shown by the H-N19D or H-N19A holocomplexes. This is not discussed in the manuscript, and the authors do not cite this earlier work (PMID: 21633354) in their manuscript. Is there any reason for that? I suggest it should be cited and discussed.

      Response

      We thank the reviewer for bringing up this issue. In many respects, we do not trust the data reported by Tada et al (2011) and the resultant model they proposed. Previous and emerging lines of evidence reported from our own and other laboratories indicate that histones compete with condensins for DNA binding, strongly arguing against the possibility that histone H2A acts as a “chromatin receptor” for condensins. We formally discussed and criticized the Tada 2011 model in our previous publications, which included Shintomi et al (2015) NCB, Shintomi et al (2017) Science, Hirano (2016) Cell and Kinoshita/Hirano (2017) COCB. We thought that those were enough. That said, we also consider that the reviewer is right. The current study demonstrates that the deletion of the CAP-H N-tail accelerates, rather than decelerates, condensin I loading, providing an additional line of evidence that argues against the Tada model. A critical comparison between the Tada model and our current model would benefit the readers. In the revised manuscript, we have added the following discussion:

      In terms of the regulatory role of the CAP-H N-tail, it would be worthy to discuss the model previously proposed by Tada et al (2011). According to their model, aurora B-mediated phosphorylation of the CAP-H N-tail allows its direct interaction with the histone H2A N-tail, which in turn triggers condensin I loading onto chromatin. Accumulating lines of evidence, however, strongly argue against this model: (i) aurora B is not essential for single chromatid assembly in Xenopus egg extracts (MacCallum et al., 2002) or in a reconstitution assay (Shintomi et al., 2015); (ii) the H2A N-tail is dispensable for condensin I-dependent chromatid assembly in the reconstitution assay (Shintomi et al., 2015); (iii) even whole nucleosomes are not essential for condensin I-mediated assembly of chromatid-like structures (Shintomi et al., 2017). The current study demonstrates that the deletion of the CAP-H N-tail accelerates, rather than decelerates, condensin I loading, providing an additional piece of evidence against the model proposed by Tada et al (2011).

      3. Other minor comments - Please provide a microscope image of DNA loop in Fig. 4D.

      Response

      In the revised manuscript, we have provided a set of time-lapse images of loop extrusion events catalyzed by holo(WT) and holo(H-dN) in Fig 4E.

      *- The authors do not compare the kleisin of condensin I with the one of condensin II with respect to the features tackled in this work. Given the different behaviours condensin I and II, such comparison could be informative for the readers. *

      Response

      We thank the reviewer for this constructive comment. In the revised manuscript, we have added the following statement:

      It should also be added that CAP-H2, the kleisin subunit of condensin II, lacks the N-terminal extension that corresponds to the CAP-H N-tail. Thus, the negative regulation by the kleisin N-tail reported here is not shared by condensin II.

      *- The authors do not reference the work of Robellet et al. Genes & Dev (2015) (PMID: 25691469) on the regulation of condensin binding in budding yeast by an SMC4 phospho-tail. I suggest that the analogy should be discussed. *

      Response

      According to the reviewer’s comment, we have added the following statements at the beginning of Discussion.

      Previous studies showed that mitotic phosphorylation of Cut3/SMC4 regulates the nuclear import of condensin in fission yeast (Sutani et al. 1999) and that phosphorylation of Smc4/SMC4 slows down the dynamic turnover of condensin on mitotic chromosomes in budding yeast (Robellet et al. 2015; Thadani et al. 2018). In the current study, we have focused on the phosphoregulation of vertebrate condensin I by its kleisin subunit CAP-H.

      - In the introduction section, lane 5, the sentence "Many if not all eukaryotic species have two different condensin complexes" appears inappropriate since budding and fission yeast cells possess a single condensin complexes, similar to condensin I in term of primary amino-acid sequence.

      Response

      We thought that the original wording “Many if not all” was good enough to imply that some species, which include budding yeast and fission yeast, have only a single condensin complex. To make it clear, however, we have modified the sentence in the revised manuscript as follows:

      Many eukaryotic species have two different condensin complexes although some species including fungi have only condensin I.

      *- page 4; typo: motif I and V bind to the SMC neck and the SMC4 cap regions, respectively. Should read SMC2 neck. *

      Response

      The reviewer is right. It should read the SMC2 neck. Corrected.

      *- Are the data and the methods presented in such a way that they can be reproduced? YES - Are the experiments adequately replicated and statistical analysis adequate? YES - Are prior studies referenced appropriately? Not all of them (see above) - Are the text and figures clear and accurate? YES

      CROSS-CONSULTATION COMMENTS I consider the comments from all reviewers as helpful for the authors.

      Reviewer #1 (Significance (Required)):

      Summary Condensins are genome organisers of the family of SMC ATPase complexes and are best characterized as the drivers of mitotic chromosome assembly (condensation). It is acknowledged that condensins shape mitotic chromosomes by massively associating with DNA upon mitotic entry (loading step) and by folding chromatin fibres into arrays of loops, most likely through an ATP-dependent extrusion of DNA into loops, as seen in vitro. What remains unclear, however, are the mechanisms by which condensins load onto DNA and fold it into arrays of loops in vivo, and how these reactions are coupled with the cell cycle, i.e. restricted mostly to mitosis. Condensins are ring shaped pentamers that change conformation upon ATP-hydrolysis. In vitro studies suggest that condensins bind DNA via ATP-hydrolysis-independent, direct electrostatic contacts between condensin subunits and DNA. Such electrostatic contacts are salt-sensitive in in-vitro assays. Upon ATP-hydrolysis, condensins engage into an additional mode of binding that is resistant to high salt concentration and likely to be topological in nature. Both modes of association are necessary to form DNA loops. Vertebrates possess two types of condensin complexes, condensin I and II, each composed of a same SMC2-SMC4 ATPase core but associated with two different sets of three non-SMC subunits; a kleisin and two HEAT-repeat proteins. Condensin II is nuclear during interphase and stably binds chromatin upon mitotic entry, while condensin I is located in the cytoplasm during interphase and binds chromatin in a dynamic manner upon nuclear envelope breakdown. How the spatiotemporal control of condensin I and II is achieved remains poorly understood. Previous studies have shown that the phosphorylation of condensin I by mitotic kinases, such as CDK1, Aurora-B and Polo, play a positive role in its binding to chromatin and/or its functioning, but the underlying mechanisms remain to be characterised. In this manuscript, Shoji Tane and colleagues provide good evidence that the N-terminal tail of the human kleisin subunit of condensin I, hCAP-H, serves as a regulatory module for the cell-cycle control of condensin I binding to chromatin and chromosome shaping activity. The authors clearly show that the N-tail of CAP-H hinders the binding of condensin I to chromatin in xenopus egg extracts and, using in vitro assays, that the N-tail also hinders the topological association of condensin I with DNA and its loop extrusion activity. The authors provide additional data suggesting that the phosphorylation of the N-tail of CAP-H, in mitosis, relieves its inhibitory effect on condensin I binding. Based on their findings, Tane et al. propose a model suggesting that the N-terminal tail of CAP-H constitutes a gate keeper that maintains condensin-rings in a closed conformation that is unfavourable for topological binding to DNA, and whose locking effect is relieved in mitosis by phosphorylation.

      Taken as a whole, this work has the potential to reveal a molecular basis for the cell cycle regulation of condensin I in vertebrate cells and as such to significantly improve our understanding of the integrated functioning condensin I. The characterisation of the inhibitory effect of the N-tail on condensin binding to chromatin and to naked DNA in vitro is well described, the data are clear and robust and the results convincing. On the other hand, some of the data on the phospho-regulation appear to me as are more debatable and I think that some of the results described here deserve to be discussed in the context of previous works. Finally, I see no data in the manuscript that directly supports the mechanistic model proposed by the authors, while it seems to me that such model could have been easily tested exprimentally. Thus, I suggest that Tane and colleagues should perform a couple of relatively easy experiments to strengthen their claims and that a few omitted prior studies on the topic should be referenced and discussed. *

      Reviewer #2 (Evidence, reproducibility and clarity (Required)): * The manuscript reveals that the N-terminal region of CAPH could play a role in regulating condensin I activity, using a range of in vitro methods. They propose that the N-terminal region of CAPH inhibits complex activity, and this is turned off upon deletion or phosphorylation, by using truncations, phospho-mimics or phospho-deficient mutations. While the results are interesting to the field, and helps to address the question as to how condensin complexes are controlled in a cell cycle dependent manner, some key data and controls are necessary to ensure the conclusion is robust.

      Main comments

      • What is meant by "unperturbed I-HSS" on page 7, ie membrane containing versus membrane free or condensin depleted? *

      Response

      We apologize for having created unnecessary confusion. We meant that the “unperturbed I-HSS” is the “undepleted I-HSS”. As far as the issue of membrane-containing vs membrane-free is concerned, we explicitly mentioned that “we used membrane-free I-HSS in the following experiments” several lines above. In the revised manuscript, we have revised the statement accordingly.

      In many of the protein gels eg figure 4B, the bands for SMC2 and 4 are more intense that the non-SMC components. The method for protein purification also does not include a size exclusion step to ensure sample homogeneity. Authors should perform some sort of quality control checks on samples such as analytical gel filtration or mass photometry to ensure stoichiometry/homogeneity. This is particularly important for samples eg the N19A, where activity is reduced compared to wild-type as poor protein behaviour could create false negative results.

      Response

      As the reviewer is fully aware, the reconstitution and purification of multiprotein complexes, such as condensins, is by no means an easy task. We notice that many groups in the field share common concerns about sample homogeneity and subunit stoichiometry, and that these concerns cannot completely be eliminated even after size exclusion chromatography. Because the current study handles a large number of mutant complexes, we consider that the purification by two-step column chromatography is the most practical approach. We do not notice any abnormal behaviors of holo(H-N19A) in the processes of expression and purification. It is also important to emphasize that the H-N19D mutations cause the completely opposite phenotype. Taken all together, we are confident of our current conclusions.

      That said, in the revised manuscript, we have added the following statement in Results:

      Although we cannot rule out the possibility that the introduction of multiple mutations into the N-tail causes unforeseeable adverse effects on protein conformations, these results supported the idea that ….

      • Loop extrusion assays in figure 4D-G shows no example data i.e. no pictures or videos of loops being formed. These should also be included.*

      Response

      In the revised manuscript, we have provided a set of time-lapse images of loop extrusion events catalyzed by holo(WT) and holo(H-dN) in Fig 4E.

      • Given the mutant holo(H-dN) has higher activity than wild-type, a negative control such as holo(H-dN) without ATP or holo(H-dN) ATPase deficient mutant should also be measured in loop extrusion assays, to ensure the activity is derived from the ATPase activity.*

      Response

      In the revised manuscript, we have added loop formation data for both holo(WT) and holo(H-dN) in the absence or presence of ATP (Supplementary Fig 5). We are confident that both complexes support loop extrusion strictly in an ATP-dependent manner.

      • According to the methods, this work performs the same loop extrusion assay as described in Kinoshita et al, 2022, however, in Kinoshita et al, wild type condensin I makes loops in 30-50% of DNA molecules, where in this study the percentage is less than half that. Can the author please explain the discrepancy given the same method was used?*

      Response

      First of all, we wish to remind the reviewer that the holo(WT) constructs used in the two studies are not identical: CAP-H was N-terminally HaloTagged in all constructs used in Kinoshita et al (2022), whereas the same subunit was C-terminally HaloTagged in the pair of constructs used in the current study. Because we wanted to compare the activities between the full-length CAP-H and N-terminally deleted version of CAP-H (H-dN), we reasoned that it would be inappropriate to put the HaloTag to the N-terminus of CAP-H. The difference in the constructs could explain the observed discrepancy, even if it might not be the sole reason.

      The design of the constructs was accurately described in each manuscript, but the statements were not very explicit about the positions of the HaloTag. To clarify this issue, we have added the following sentences in the revised manuscript:

      Note that the HaloTag was fused to the C-terminus of CAP-H in the current study because we wanted to investigate the effect of the N-terminal deletion of CAP-H. We used N-terminally HaloTagged CAP-H constructs in our previous study (Kinoshita et al., 2022).

      • In the concluding statement the author suggests "Upon mitotic entry, multisite phosphorylation of the N-tail relieves the stabilization, allowing the opening of the DNA entry gate, hence, the loading of condensin I onto chromosomes." This seems unlikely as fusion the N-terminus of the of the kleisin to the C-terminus of SMC2 is able to function for yeast (Shaltiel et al 2022) and condensin II (Houlard et al 2021), and equivalently in cohesin (Davidson et al 2019).*

      Response

      We appreciate the reviewer’s concern. In our view, however, the issue of the “DNA-entry gate” remains under debate in the SMC field (e.g., Higashi et al [2020] Mol Cell; Taschner and Gruber [2022] bioRxiv). For instance, Shaltiel et al (2022) demonstrated that neck-gate fusion constructs can support in vitro activities including topological loading under certain conditions, but also showed that such constructs greatly reduce the cell viability, leaving the possibility that the gate opening is required for some physiological functions.

      That said, it is true that the data reported in the current manuscript do not exclude the possibility that the SMC2 neck-kleisin interface is not used as a DNA entry gate for condensin I loading. In the revised manuscript, we have added the following statement in Discussion:

      Although our model predicts that the SMC2 neck-kleisin interface is used as a DNA entry gate, we are aware that several studies reported evidence arguing against this possibility (e.g., Houlard et al [2021]; Shaltiel et al [2022]). Our current data do not exclude other models.

      *Reviewer #2 (Significance (Required)):

      This is an interesting story that reveals new insights about condensin regulation.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      This paper reveals a role of an N-terminal extension of CAP-H in the regulation of condensin-I activity in Xenopus extracts using biochemical reconstitution experiments. The authors demonstrate that a motif in the N-terminal tail that is conserved in vertebrates acts as an inhibitor of condensin I activity. Using several mutant constructs, it is shown that the inhibition by this motif is in turn counteracted by the phosphorylation of neighbouring serine and threonine residues in mitosis, presumably at least in part by CdK. Mutants that have lost this inhibition are able to condense chromatin into chromatid-like structures more efficiently and to some degree even in interphase extracts. Moreover, one such mutant is characterized in detail by biochemical and biophysical experiments and shown to have increased capacity in salt-stable DNA loading and in DNA loop extrusion.

      Major comments: This is a beautiful and thorough study that is presented in a clear and concise manner. The main conclusions are well justified. No additional experiments are needed to support them. Replication and statistical analysis appear adequate. The final model is however largely speculative. Recent work has indicated that loading of yeast condensin does not require gate opening. The authors may thus want to include alternative scenarios or remain more vague. *

      Response

      This comment is related to the last comment of Reviewer#2. See above for our response.

      *The H-N19A mutant has a loss of function phenotype (possibly due to folding problem caused by 19 point mutations rather than lack of phosphorylation), the authors could consider to rescue the phenotype by also including the CH motif mutations in this construct (or make an explanatory statement in the text). *

      Response

      We understand the reviewer’s logic here, but overlaying additional mutations into the H-N19A mutations could cause an unforeseeable effect, potentially making the interpretation of the outcome complicated.

      We also wish to point out that it may be inappropriate to regard the phenotype exhibited by holo(H-N19A) as a simple loss-of-function phenotype. This is because the opposite, accelerated loading phenotype exhibited by holo(H-dN) can be regarded as a consequence of loss of negative regulation. Like holo(H-dN), the phosphomimetic mutant complex holo(H-N19D) displayed an accelerated loading phenotype, fully supporting our conclusion. In the revised manuscript, we have added the following statement in Results:

      Although we cannot rule out the possibility that the introduction of multiple mutations into the N-tail causes unforeseeable adverse effects on protein conformations, these results supported the idea that ….

      *Albeit not necessary for the main conclusions, the authors could possibly significantly strengthen their study by testing for binding partners of the N-tail and the CH motif by running AlphaFold predictions against the condensin I subunits. *

      Response

      We appreciate this constructive comment. We attempted to predict possible interactions between SMC2 and a CAP-H fragment containing its N-tail and motif I using

      ColabFold (Mirdita et al., 2022, Nat. Methods). The algorism excellently predicted the proper folding of the CAP-H motif I and its interaction with the SMC2 neck. Under this condition of predictions, however, the N-tail remained largely disordered (except for the CH), and no interaction with any part of SMC2 was predicted. The same was true when the N19D mutations were introduced in the N-tail sequence. Thus, this trial did not provide much information about the potential interaction target(s) of the CAP-H N-tail.

      *The efficiency of depletion of condensin subunits from I-HSS extracts is not documented (in contrast to M-HSS extracts - figure EV1C). While any condensin remaining in these extracts might not be active (or interfering), the authors may want to at least comment on this in the text. *

      Response

      We check the efficiency of immunodepletion every time by immunoblotting and confirm that a high level of depletion is achieved from both M-HSS and I-HSS. According to the reviewer’s comment, the following statement was placed in Materials and Methods:

      The efficiency of immunodepletion was checked every time by immunoblotting. An example of immunodepletion from M-HSS was shown in Supplemental figure 1C. We also confirmed that a similar efficiency of immunodepletion was achieved from I-HSS.

      *The authors should include information on the quantification of chromatid morphology. Is the analysis based on chromatids taken from the same images/imaging session, from technical replicates, biological replicates? *

      Response

      In the revised manuscript, we have added statements on image presentation and experimental repeats in the appropriate figure legends and methods section. During the revision process, we repeated the experiments shown in Supplementary Fig 2, and obtained the same results. In the revised manuscript, the original set of data has been replaced with the new set of data along with panel C (Quantification of the intensity of mSMC4 per DNA area).

      Minor comment: The colour scheme in Figure 5A is confusing. Use less colour? The orange and red colours are moreover quite similar.

      Response

      According to the reviewer’s comment, we have modified Figure 5A.

      *Reviewer #3 (Significance (Required)):

      The findings provide new insights into how condensin-I activity is restricted outside of mitosis. It was previously assumed that this regulation was (largely) due to the exclusion of condensin I from the nucleus prior to nuclear envelope breakdown. This study shows that another pathway is contributing to the regulation and implies that controlling condensin I activity is more important than previously appreciated. Whether all residual nuclear condensin I is inactivated, remains to be determined. The physiological impact of loss of autoinhibition on chromosome segregation and cell cycle progression also remains to be uncovered. The observed effects are robust and appear significant. Future research on condensin I using reconstitution will likely benefit from being able to control or eliminate the self-inhibition.

      This reviewer has expertise on the biochemistry and structural biology of SMC protein complexes.

      Reviewer #4 (Evidence, reproducibility and clarity (Required)):

      Mitotic chromosome formation is a cell cycle-regulated process that is crucial for eukaryotic genome stability. The chromosomal condensin complex promotes chromosome condensation, but the temporal control over condensin function is only scantly understood. In this impressive manuscript, "Cell cycle-specific loading of condensin I is regulated by the N-terminal tail of its kleisin subunit", Tane and colleagues provide important new insight into the cell cycle-regulation of condensin. The authors identify a kleisin N-tail that acts as a negative regulator of condensin's DNA interactions. Removal of this N-tail, or its cell cycle-dependent phosphorylation, relieves inhibition and activates condensin. This is a simple, yet very important story, that advances our molecular understanding of chromosome formation. The experiments are performed to a very high technical standard and support the authors conclusions. This manuscript is highly suitable for publication in any molecular biology journal, once the authors have considered the following points.

      1. Introduction. a) The authors could better explain their own prior work (Kimura et al. 1998), which has identified the condensin XCAP-D2 and XCAP-H as the targets of phosphoregulation. The current manuscript explains the role of XCAP-H phosphorylation. *

      Response

      According to the reviewer’s comment, we have added the following sentence in Introduction:

      The major targets of mitotic phosphorylation identified in these studies included the CAP-D2 and CAP-H subunits.

      1. b) Given the limited knowledge about condensin cell cycle regulation, it seems prudent to provide a brief summary of what is known. Fission yeast Smc4 phosphorylation regulates condensin nuclear import (Sutani et al. 1999), while budding yeast Smc4 phosphorylation slows down the dynamic turnover of the condensin complex on chromosomes (Robellet et al. 2015 and Thadani et al. 2018).

      Response

      We appreciate this constructive comment. According to the reviewer’s comment, we have added the following statements at the beginning of Discussion.

      Previous studies showed that mitotic phosphorylation of Cut3/SMC4 regulates the nuclear import of condensin in fission yeast (Sutani et al. 1999) and that phosphorylation of Smc4/SMC4 slows down the dynamic turnover of condensin on mitotic chromosomes in budding yeast (Robellet et al. 2015 and Thadani et al. 2018). In the current study, we have focused on the phosphoregulation of vertebrate condensin I by its kleisin subunit CAP-H.

      2. Extracts were mixed with mouse sperm nuclei. If there is a reason why mouse rather than Xenopus sperm nuclei were used, this would be interesting to know.

      Response

      The original motivation for introducing mouse sperm nuclei into Xenopus egg extracts was to test the functional contribution of nucleosomes to mitotic chromosome assembly. When mouse sperm nuclei are incubated with an extract depleted of the histone chaperone Asf1, the assembly of octasomes can be suppressed almost completely. Remarkably, we found that even under this “nucleosome-depleted” condition, mitotic chromosome-like structures can be assembled in a manner dependent on condensins (Shintomi et al., 2017, Science). Xenopus sperm nuclei cannot be used in this type of experiment because they endogenously retain histones H3 and H4 and are therefore competent in assembling octasomes even in the Asf1-depleted extract. During this study, we realized that the use of mouse sperm nuclei in Xenopus egg extracts provides additional and deep insights into the basic mechanisms of mitotic chromosome assembly. For instance, the functional contribution of condensin II to chromosome assembly could be observed more prominently when mouse sperm nuclei are used as a substrate than when Xenopus sperm nuclei are used (Shintomi et al., 2017, Science). We suspected that the slow kinetics of nucleosome assembly on the mouse sperm substrate creates an environment in favor of condensin II’s action. For these reasons, our laboratory now extensively uses mouse sperm nuclei for the functional analyses of condensin II (Yoshida et al., 2022. eLife) and other purposes (Kinoshita et al., 2022, JCB). Yoshida et al (2022) used experimental approaches analogous to the current study, and found that the deletion of the CAP-D3 C-tail, causes accelerated loading of condensin II. One of the long-term goals in our laboratory is to critically compare and contrast the actions of condensin I and condensin II in mitotic chromosome assembly. Thus, the use of the same substrate in the two complementary studies can be fully justified.

      During the preparation of this response, we realized that the readers would benefit from a brief statement about the comparison between condensin I and condensin II. In the revised manuscript, we have added the following statement in Discussion:

      It should also be added that CAP-H2, the kleisin subunit of condensin II, lacks the N-terminal extension that corresponds to the CAP-H N-tail. Thus, the negative regulation by the kleisin N-tail reported here is not shared by condensin II. Interestingly, however, a recent study from our laboratory has shown that the deletion of the CAP-D3 C-tail causes accelerated loading of condensin II onto chromatin (Yoshida et al., 2022). It is therefore possible that condensins I and II utilize similar IDR-mediated regulatory mechanisms, but they do so in different ways.

      3. Page 5. "we next focused on the conserved helix (CH) [...], that is enriched with basic amino acids." Based on the provided sequence alignment, the helix contains an equal number of both basic and acidic residues. Is it correct to characterize this helix as positively charged?

      Response

      The reviewer is right. In the revised manuscript, we have used a more neutral expression as follows:

      we next focused on the conserved helix (CH) [...], that contains conserved basic amino acids.

      4. To prevent N-tail phosphorylation, the authors create a (H-N19A) allele, referring to Cdk promiscuity. Cdk cooperation with other mitotic kinases can also be expected. Nevertheless, in case the authors created a variant with only the 4 Cdk consensus sites mutated, it would be interesting to know its consequences.

      Response

      We consider that this is a reasonable question. In our early experiments, we noticed that introduction of multiple SP/TP sites in the non-SMC subunits of condensin I including CAP-H caused a relatively mild phenotype in mitotic chromosome assembly in Xenopus egg extracts. Then we found that the deletion of the CAP-H N-tail caused a very clear, accelerated loading phenotype, prompting us to focus on the regulatory function of the CAP-H N-tail. As the reviewer correctly points out, the current study does not pinpoint the number and position of target sites involved in the proposed phosphoregulation by the CAP-H N-tail. We wish to address this important issue in the near future, along with reconstitution of the phosphoregulation using purified components.

      5. Fig EV3A, a second region of mitotic condensin phosphorylation is XCAP-D2. The authors state that XCAP-D2 phosphorylation does not impact on condensin function in their assays. This is very relevant to the current paper, so it would be good to see the Yoshida et al. 2022 Elife publication (in press) as an accompanying manuscript.

      Response

      We thank the reviewer for pointing out this issue, but it is not necessarily clear to us what the reviewer requests. In the original manuscript, we cited Yoshida et al (2022) in Discussion as follows:

      Recent studies from our laboratory showed that the deletion of the CAP-D2 C-tail, which also contains multiple SP/TP sites (Supplementary Figure 3A), has little impact on condensin I function as judged by the same and related add-back assays using Xenopus egg extracts (Kinoshita et al, 2022; Yoshida et al, 2022).

      We believe that the current statement is good enough.

      6. One of the authors' most striking results is chromosome formation in interphase egg extracts using condensin (H-dN). At the same time, condensin (H-dN) is unable to support DNA supercoiling or chromosome reconstitution with recombinant components. More emphasis might be placed on this important piece of information, and possible reasons should be discussed. Can Cdk-treatment restore condensin (H-dN) biochemical activity? If not, then condensin (H-dN) might have lost more than just an inhibitory N-tail. The cohesin N-tail is thought to fulfil a positive role during DNA loading (Higashi et al. 2020). Could it be that the condensin N-tail encompasses both positive and negative roles?

      Response

      We were also surprised to find that holo(H-dN) gains the ability to assemble mitotic chromosome-like structures in interphase extracts. It should be stressed, however, that the formation of mitotic chromosome-like structures in I-HSS requires a much higher concentration (150 nM) than the standard concentration used in M-HSS (35 nM). Thus, the deletion of the CAP-H N-tail alone cannot make the condensin I complex fully active in I-HSS. We think that the negative regulation by the CAP-H N-tail is not the sole mechanism responsible for the very tight cell cycle regulation of condensin I function. We emphasize this important point by mentioning that “our results uncover one of the multilayered mechanisms that ensure cell cycle-specific loading of condensin I onto chromosomes” in Summary.

      At the end of Discussion, we describe the limitations of the current study: “we have so far been unsuccessful in using these recombinant complexes to recapitulate positive DNA supercoiling or chromatid reconstitution, both of which require proper Cdk1 phosphorylation in vitro”. We are fully aware that full reconstitution of phosphorylation-dependent activation of condensin I in vitro is one of the most important directions in the future.

      Although we currently do not have any evidence to suggest that the H N-tail has a positive role, we do not exclude such a possibility.

      7. Here comes my main question for the authors (though I should stress that I do not expect an answer for publication in a Review Commons journal). The authors now have a unique opportunity to gain key mechanistic insight into condensin by answering the question, 'how does the kleisin N-tail inhibit condensin'? The authors allude to a model in which the N-tail interacts with Smc2 to close/obstruct the kleisin N-gate, through which the DNA likely enters the condensin ring. Can the authors biochemically recapitulate an interaction between an isolated N-tail (or N-terminal section of XCAP-H) and Smc2? Does Cdk phosphorylation alter this interaction?

      Response

      This comment is related to Comment #1 of Reviewer#1. See above for our response.

      *Minor points. 8. The condensin loop extrusion results would benefit from a supplementary movie or time-series, to illustrate the comparison. Details of how loop rate, duration and sizes were assessed should be added to the methods section. *

      Response

      In the revised manuscript, we have provided a set of time-lapse images of loop extrusion events catalyzed by holo(WT) and holo(H-dN) in Fig 4E. We have also added the following explanations for how the parameters of loop extrusion reactions were assessed in Materials and Methods:

      To determine the loop size, the fluorescence intensity of the looped DNA was divided by that of the entire DNA molecule for each image, and multiplied by the length of the entire DNA molecule (48.5 kb). The loop rate was obtained by averaging the increase in looped DNA size per second. The loop duration was calculated by measuring the time from the start of DNA loop formation until the DNA loop became unidentifiable.

      9. Figure EV3A legend, "hHP4" should probably read "hHP2".

      Response

      The reviewer is right. It should read hHP2. Corrected.

      *Reviewer #4 (Significance (Required)):

      see above *

    1. Author Response

      Reviewer #1 (Public Review):

      It has previously been shown that deletion of the GluA3 subunit in mice leads to alterations in auditory behavior in adult mice that are older than a couple of months of age. The GluA3 subunit is expressed at several synapses along the auditory pathway (cochlea and brainstem), and in ko mice changes in brainstem synapses have been observed. These previously documented changes may account for some of the deficits in hearing in adult ko mice.

      In the current study, the authors investigate an earlier stage of development (at 5 wks) when the auditory brainstem responses (ABRs) are normal, and they ask how transmission persists at inner hair cell (ihc) ribbon synapses in GluA3 ko mice. They discovered that deletion of GluR3A significantly changed 1) the relative expression of Glu A2 (dramatically downregulated) and A4 subunits at SGN afferents, and 2) caused morphological changes in ihc ribbons (modiolar side) and synaptic vesicle size (pillar).

      The changes documented in the 5 wk old GluA3ko mice were not necessarily predicted because in general the mechanisms involved in shuffling GluA receptors at this synapse (or other sensory synapses) are not completely understood; furthermore, much less is known about the role of differentiation of ihc-sgn synapses along a modiolar-pillar axis. With that said, the only shortcoming of the study is a lack of explanation for the observed changes in the synaptic structure; but this is not specific to this study.

      Given the quality of the data and the clarity of presentation of results, this is a very valuable study that will aid and motivate researchers to further explore how auditory circuitry develops, and becomes differentiated, at the level of ihc-sgn synapses.

      We thank the reviewer for the positive and helpful comments. Ongoing studies are seeking to explain the observed changes in synapse structure.

      Reviewer #2 (Public Review):

      The goal of the study by Rutherford and colleagues was to characterize functional, structural, and molecular changes at the highly specialized cochlear inner hair cell (IHC) - spiral ganglion neuron (SGN) ribbon synapse in GluA3 AMPA receptor subunit knockout mice (GluA3KO). Previous work by the authors demonstrated that 2-month-old GluA3KO mice experienced impaired auditory processing and changes in synaptic ultrastructure at the SGN - bushy cell synapse, the next synapse in the auditory pathway.

      In the present study, the authors investigated whether GluA3 is required for ribbon synapse formation and physiology in 5-week-old mice using a series of functional and light- and electron microscopy imaging approaches. While deletion of GluA3 AMPAR subunit did not affect hearing sensitivity at this age, the authors reported that cochlear ribbon synapses exhibited changes in the molecular composition of AMPARs and pre- and postsynaptic ultrastructural alterations. Specifically, the authors demonstrated that GluA3KO ribbon synapses exhibit i) a global reduction in postsynaptic AMPARs, which is also reflected by smaller AMPAR arrays, ii) a reduction in GluA2 and an increase in GluA4 protein expression at individual postsynaptic sites, and iii) changes in the dimensions and morphology of the presynaptic specialization ("ribbon") and in the size of synaptic vesicles. These reported structural changes are linked to the side of innervation with respect to the IHC modiolar-pillar axis.

      The results presented by the authors are conceptually very interesting as the data support the notion that potentially detrimental changes in the molecular composition of a sensory synapse can be compensated to sustain synaptic function to a certain extent during development. The conclusions of the study are mostly well supported by the data, but some experimental details or control experiments are missing or need to be clarified to allow a full assessment.

      1) The authors tested which GluA isoforms are expressed in SGNs of GluA3KO mice and reported that only GluA2 and GluA4, and not GluA1, receptor subunits are present in the cochlear. It is, however, a bit difficult to understand why immunolabelling for GluA1 was only performed on brainstem sections (Fig. 1B right) and not in the cochlear to probe for postsynaptic localization at ribbon synapses as it was done for the other isoforms (Fig. 2 and 6) given that GluA3KO IHCs exhibited a larger number of ribbons that lacked GluA2 and 3 (lone or 'orphaned' ribbons; Fig. 6B). It is also not clear why immunolabelling for GluA2 and 4 was performed to probe for expression of these receptor subunits on SGN cell bodies in the cochlear spiral ganglion. Which neurons are expected to synapse onto these somata?

      There is precedent for expression of GluA subunits in the SGN cell bodies reflecting expression at the synapse, although it is not clear if any of that immunoreactivity reflects cell surface expression in the intact ganglion or if it represents solely intracellular subunits being trafficked to synapses.

      Figure 1b shows that GluA2 is expressed in the somata of WT mice and KO mice. The lower panels show that GluA1 is not expressed in the somata of WT or KO mice. The right panels show that while GluA1 is expressed in the cerebellum of WT and KO mice, is not expressed in the cochlear nucleus of WT or KO mice. We think this demonstrates the lack of compensation by GluA1 in the GluA3 KO.

      We have now added GluA4 immunoreactivity in the SGNs to Fig. 1, for completeness. In our experience, GluA subunits expressed at synapses are also found in the cell bodies, and GluA subunits not expressed at synapses are not found in the cell bodies. The current data is consistent with this, although we did not label GluA1 in the organ of Corti.

      2) The authors state in the text that GluA3 expression is completely abolished in GluA3KO IHCs, however, there appears to still be a faint punctate immunofluorescence signal visible when an antibody directed against GluA3 was used (Fig. 2C). Providing additional information on the specificity of this (and the other) antibodies used in the study would be helpful.

      We agree, and thank the reviewer for pointing this out. There is indeed a small signal presumably due to cross-reactivity of the anti-GluA3 with GluA2 subunits, because the cytoplasmic epitope recognized by the antibody is in a region of high similarity of GluA2 and GluA3 (Dong et al., 1997). In addition, the specification sheet of the Santa Cruz company states that the GluA3 antibody can detect GluA2. This relatively small cross-reactivity is noted now in the text on p. 9. Also, this appearance was a product of the same brightness and contrast issue noted above in the response to the editor’s summary. Upon readjustment, the signal is less apparent, because in the readjustment we used less brightness and less contrast enhancement to avoid the unwanted saturation in some of the panels.

      3) The authors reported changes in the volume of the presynaptic ribbon and postsynaptic density surface area in GluA3KO KO animals. The EM data as presented are however not sufficiently convincing.

      i) There appears to be a mismatch between the EM data shown in Fig. 3 and 4 and the information in the text with respect to the number of data points in the plots and the reported number of reconstructed synapses. This raises several questions with respect to the analysis. For instance, it is unclear whether certain synapses were reconstructed but excluded from the analysis. If so, what were the exclusion criteria?

      We thank the reviewer for pointing out this discrepancy within the text and the figures. The discrepancies are now fixed. We have added more information on how the synapses were reconstructed in the M&M (p.14-15).

      ii) The authors compare PSD surface areas in reconstructions from 3D serial sections, but for some of the shown reconstructions (i.e. Fig. 3A' and B' and 4B'), it appears as if PSDs were only incompletely reconstructed.

      We included all the ultrathin sections that show afferent dendrites with a visible PSD. We revised all the reconstructions and fixed some misalignments. The appearance of the reconstructed PSD relates to how the Reconstruct software creates the 3-D rendering. We did not use any extra software to smooth the hedges of the 3D reconstructions.

      4) The immunolabelling experiments shown in Fig. 2 and 6 are of very high quality and the quantitative analysis of the light microscopy data (Fig. 6-9) is clearly very detailed, but slightly difficult to interpret the way it is presented. Specifically, it is unclear how the number of synapses per IHC (Fig. 6B) and the separation into modiolar and pillar side (Fig. 8) was achieved based on the shown images without the outlines of individual cells being visible.

      We agree. Please see the revised Figs. 2, 6, and 8, and explanation in the figure legend of Fig. 8.

      5) Adding more detailed information about important parameters (mean, N/n, SD/SEM) and the statistical tests used for the individual comparisons presented in the Figures would help strengthen the confidence in the presented data.

      Please see the new spreadsheets accompanying the revised manuscript.

      6) In general, the authors report a series of molecular and structural changes in IHCs and reach the conclusion that GluA3 subunits may have a role in "trans-synaptically" determining or organizing the architecture of both the pre- and post-synapse. However, some of the arguments are very speculative and many of the claims are not supported by experimental data presented in the paper. The authors should consider to also compare their findings to studies that investigated ultrastructural changes of AMPAR subunit knockouts in other synapse types, and discuss alternative interpretations (e.g. homeostatic changes).

      Thank you for this comment. Considering that reviewer 1 asked for more speculation, we have decided to leave the level of speculation similar to the initial submission. However, we went through the text to make sure our claims were backed by our observations.

      Due to space constraints, rather than comparing to additional other synapses, in this context we prefer to compare with auditory brainstem synapses.

      The possibility of homeostatic changes we now added on p. 29.

    1. Author Response

      Reviewer #1 (Public Review):

      With a real interest, I read the manuscript entitled "Sex-specific effects of an IgE polymorphism on immunity susceptibility to infection and reproduction in a wild rodent", written by Wanelik and colleagues. Actually, I am impressed with each and every part of this work. This study is very well designed and answers intriguing scientific questions. The study is multilayer and multidimensional and goes far beyond a genomic association as it deeply addresses, to mention only those most important, ecological, parasitological, immunological, and gene expression aspects. In addition to studying the free-living animal community of voles, it utilizes this opportunity to get some insights into the genetics and biology of the high-affinity IgE receptor not possible to be gained in studies performed in humans or standard laboratory animals. The data are presented in a very elegant way and the article is really nicely written.

      We thank the Reviewer for these positive comments, and are very glad to hear they think our work is so comprehensive.

      Reviewer #2 (Public Review):

      In this manuscript, Wanelik et al. use a wild rodent population to test if a polymorphism in a receptor for immunoglobulin E (IgE) affects immune responses, resistance to infection, and fitness. Finding such effects would imply that polymorphisms in immune genes can be maintained by antagonistic pleiotropy between sexes, which has important implications for our understanding of how genetic variation is maintained. The work presented here extends previous work by the same group where they have shown that expression of GATA3 (a transcription factor inducing Th2 immune responses) affects tolerance to ectoparasites and that polymorphism in Fcer1a affects the expression of GATA3. The present study is based on a fairly large data set and comprehensive analysis of a number of different traits. Indeed, the authors should be commended for investigating all steps in the chain polymorphism→immune response→resistance→fitness. Unfortunately, the presentation of the methodology is a bit confusing. Moreover, most of the key results are only marginally significant.

      We thank the Reviewer for their positive feedback, and are very glad to hear they think our work is so comprehensive. As detailed below, we have tried to clarify our methodology and to temper our claims in the revised manuscript.

      As regards methodology, I was confused by the differential expression (DE) analyses presented in fig 1A. First, it took a while to understand that these were based on a comparison of unstimulated cells (i.e. baseline expression), not ex vivo stimulated cells; this should be made explicit in conjunction with the presentation of the results. Second, it would be good to clarify (and motivate) in the Results that you compare individuals with at least one copy of the GC haplotype against the rest, i.e. a dominant model.

      We apologise for the confusion. We now explicitly state in the Results (lines 313-314) that the DGE analysis was based on unstimulated splenocytes: “Differential gene expression (DGE) analysis performed on unstimulated splenocytes taken from 53 males and 31 females assayed by RNASeq”. We also explicitly state “Unstimulated immune gene expression” in the legend for Figure 1.

      Please note that an additive model was used for all analyses run using the hapassoc package (macroparasites and SOD1). A dominant model was used in the DGE analysis and in other analyses where it was not possible to use the hapassoc package (gene expression assayed by Q-PCR, microparasites and reproductive success) which meant that only those individuals for which haplotype could be inferred with certainty could be included (i.e. a smaller dataset). In this case, a dominant model was used. Our use of the dominant model in the DGE analysis is now more explicitly explained on lines 933-935: “Only those individuals for which haplotype could be inferred with certainty could be included (n = 53 males and n = 31 females; none of which were known to have two copies of the GC haplotype hence the choice of a dominant model).” And its use in other non-hapassoc analyses is now explicitly stated on lines 991-992: “as in the DGE analysis, genotype was coded as the presence or absence of the GC haplotype (i.e. a dominant model)”.

      The first key result is that polymorphisms in Fcer1a have sex-specific effects on the expression of pro- and anti-inflammatory genes in males and females. However, the GSEA analyses (fig 1A) show that the GC haplotype has positive effects on the expression of both pro- and anti-inflammatory gene sets in both sexes - albeit with a stronger effect of proinflammatory genes in males and anti-inflammatory genes in females - but there is no formal evidence for an effect of genotype by sex. I am not sure how to test for interaction with GSEA (or if it is at all possible), so it would be good to complement the GSEA with other analyses (perhaps based on PCA?) of these data to provide more formal evidence for an effect of genotype by sex.

      It is not possible to provide formal evidence for an effect of genotype by sex in the DGE analysis/GSEA. Instead, we have tried to temper our claims about sex-specific effects (please see below for further details).

      Some more evidence of a sex-specific effect of Fcer1a genotype is actually provided by analyses of the expression of 18 immune genes in ex vivo stimulated T cells. Here, a sex-specific effect of Fcer1a genotype was found on the expression of one of 18 measured immune genes, the cytokine IL17a. However, Fcer1a is as far as I am aware not expressed by T cells, so the relevance of these results is unclear. Moreover, it is unclear why these 18 genes were analyzed one by one, rather than by some multidimensional approach (e.g. PCA).

      The Reviewer is right that Fcer1a is not generally considered to be expressed by T cells. However, the stimulation could have indirect effects. We have clarified this on lines 801-804: “Although Fcer1a is not expressed by T-cells themselves, polymorphism in this gene could be acting indirectly on T-cells through various pathways, including via cytokine signalling, following expression of Fcer1a by other cells”.

      The 18 immune genes were specially selected because they represent different immune pathways and are expected to have limited redundancy. This is why individual tests were performed (followed by a correction for multiple testing) rather than using a multidimensional approach like PCA. This is now explicitly explained in the Methods on lines 804-808: “The choice of our panel of genes was informed by…(iii) the aim of limited redundancy, with each gene representing a different immune pathway” and on lines 1031-1032: “We did not use a multidimensional approach (such as principal component analysis) because of limited redundancy in our panel of genes.” and in the Results on line 363-366: “we used an independent dataset for males and females whose spleens were stimulated with two immune agonists and assayed by Q-PCR (for a panel of 18 immune genes with limited redundancy); see Methods for how these genes were selected.”

      The second key result is that Fcer1a genotype has sex-specific effects on resistance to parasites, but this is based on a marginally significant effect as regards one of three tested pathogens.

      We acknowledge that this is a marginally significant result and have acknowledged this in the text on line 428 of the Results section.

      The third key result is that Fcer1a genotype has sex-specific effects on reproductive fitness. However, this is based on a marginally significant effect in males only, and a formal test for sex by genotype could not be performed (and since the direction of the effect was similar in females it is doubtful whether there would be an effect of sex by genotype; see fig 1C).

      Thus, while the results presented here are clearly indicative of sex-specific effects of an immune gene polymorphism, I think it is too early to actually claim such effects.

      We understand the Reviewer’s concerns about the overall lack of formal evidence for an effect of genotype by sex. As we are not able to provide this for the DGE analysis, GSEA (see above), or for the reproductive success analysis, we have tempered our claims about sex-specific effects (as suggested by the Reviewer). We have done this by removing the term “sex-specific effect” throughout the manuscript, including in the title. We now focus more heavily on the multiple effects we have shown across different phenotypic traits, and use the term “sex-dependent effects” or describe effects as “differing between sexes” sparingly, and only where necessary. These changes have been made throughout the manuscript, but more so in the introduction where the narrative has been substantially reworked to lay out this change in focus.

      Reviewer #3 (Public Review):

      This is a well-replicated study: the authors sampled over a thousand field voles (Microtus agrestis), over three years at seven different sites, with a combination of cross-sectional and longitudinal sampling. The authors compared individuals carrying the GC haplotype (<10% of the population) of the high-affinity immunoglobulin receptor gene (Fcer1). They recorded parasite infections (Babesia, Bartonella, ticks, fleas, gastrointestinal helminths), expression levels of inflammatory and immune genes using transcriptomes and quantitative PCR, and genotype and pedigree.

      We thank the Reviewer for their positive feedback, and are very glad to hear they think our work is well replicated.

      A comparison of overall gene expression between GC-carrying and all other voles indicated two sex-dependent differences, the expression in males of Il33, which is associated with antihelminthic responses, and in females of Socs3, which is implicated in regulating immune responses. One substantial issue with the authors' interpretation of these data is to attribute Il33 to the inflammatory response - this taints the rest of their interpretation (e.g., Fig 1A, see below); instead, this is a key cytokine of the antihelminthic Th2 response and its detection suggests there might be a difference in helminth infection between the haplotypes - which is consistent with the role of IgE. Therefore, the authors would need to explore further how the GC haplotype, IgE, and parasite burdens might be driving the expression of IL-33. Specifically, the authors did not control for potential confounding effects of infection, which might be expected to differ based on the rest of their data.

      We acknowledge the difficulty in grouping genes under single GO terms, and the need for more nuance when describing these classifications. No gene set is perfect and immune networks are highly complex, so the same gene can be grouped into multiple gene sets. IL33 is an example of this – it appears in the GO term GO:0050729 (positive regulation of inflammatory response) but, as the Reviewer points out, is also commonly associated with the antihelminthic Th2 response. We have edited the text in the Results (on lines 322-324 and lines 350-352) to communicate this nuance, as well as adding references to support each of these associations: “Il33 is commonly associated with anti-helminthic response [25] and Socs3 with regulation of the immune response more broadly [26]….Both Il33 and Socs3 also share an association with the inflammatory response [26,27]. While Il33 positively regulates this response (appearing in the gene set GO:0050729), Socs3 negatively regulates it (GO:0050728).” References added:

      1. Liew FY, Pitman NI, McInnes IB. Disease-associated functions of IL-33: The new kid in the IL-1 family. Nat Rev Immunol. Nature Publishing Group; 2010;10: 103–110. doi:10.1038/nri2692
      2. Carow B, Rottenberg ME. SOCS3, a major regulator of infection and inflammation. Front Immunol. 2014;5: 1–13. doi:10.3389/fimmu.2014.00058
      3. Cayrol C, Girard JP. IL-33: An alarmin cytokine with crucial roles in innate immunity, inflammation and allergy. Curr Opin Immunol. Elsevier Ltd; 2014;31: 31–37. doi:10.1016/j.coi.2014.09.004

      We have also run an extra DGE analysis including cestode burden as a covariate (cestodes being the most prominent helminth infection in terms of biomass), to check whether IL33 still emerges as a top-responding gene in males (see Appendix 1-table 4 & 5). We found that it did (in fact the signal was even stronger), indicating that the differences in Il33 expression are not being driven by differences in cestode infection. We now mention this additional analysis in the text: “Given the link between Il33 and the antihelminthic response (and more generally, IgE-mediated responses and the antihelminthic response), we repeated the DGE analysis while controlling for cestode burden, but this had little effect on our results (same top-responding immune genes; see Appendix 1—table 4 & 5), suggesting that these effects were not driven by differences in cestode infection”. This is consistent with our finding that there is no difference in macroparasite burden (including cestode burden) between individuals with and without the GC haplotype (see Appendix 1—table 11) and lines 449-451: “However, we found no effect of the haplotype (interactive or not) on the probability of infection with the other parasites in our population”.

      We have also included the following caveat in our discussion on lines 540-542: “Some of the differences in immune phenotype that we observed may also be driven by difference in parasite infection (although we accounted for cestode burden in a follow-up analysis, we cannot rule this out).”

      Among a narrow panel of immune genes measured in ex vivo settings, the authors reported elevated expression of Il17a, which is associated with inflammatory, antibacterial responses. Of note, the panel of genes they measured did not contain antihelminth effectors beyond the transcription factor GATA3, and therefore could not confirm the expression of IL-33 observed in the transcriptomes. However, the expression of IL-17a appears consistent with the elevated activity of antioxidant SOD1.

      In response to this comment, we now point out more clearly that our panel of genes did not include Il33 or Socs3, but did include other inflammatory genes including Il17a, Ifng, Il1b, Il6 and Tnfa.

      Somewhat unexpectedly given the authors' claim that in males the GC haplotype is prone to a more inflammatory immune phenotype, it had no effect on infection in that sex. However, the identity of the genes and pathways matter and the authors do not provide sufficient detail to evaluate their interpretation (GSEA analysis and Figure 1A).

      Barcode plots, such as the one we include in Figure 1A, are commonly used representations of GSEA results. In order to aid interpretation for those who are unfamiliar with barcode plots, we have included some more information in the legend of Figure 1.

      An intriguing and potentially important finding is that males carrying the GC haplotype appeared to have fewer offspring (little to no effect detected in the females). To confirm whether the effect of the haplotype is direct or mediated by other factors, it would be useful to test how other covariates, like infection, might contribute to this.

      To explore this possibility, we have run extra GLMs for both females and males which include two parasite variables: proportion of samples taken from an individual that tested positive for Babesia and proportion of samples taken from an individual that tested positive for Bartonella. We found no difference in the main results – males with the GC haplotype still have fewer offspring, suggesting that infection is not acting as a confounder.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      We really appreciate the reviewers’ insightful comments, which help improve the quality of this work. We have responded to the reviewers’ questions/comments point by point in the following text and made the corresponding changes in the revised manuscript. Lastly, we added one more figure (Fig. 7) with lineage tracing experiments demonstrating the conversion of id2a+ liver ductal cells to hepatocytes in extreme hepatocyte loss condition.

      Reviewer #1 (Evidence, reproducibility and clarity):

      Mi and Andersson describe a method for creating efficient 3' knock-ins in zebrafish using a combination of end-modified dsDNA and Cas9/gRNA RNPs. They tested their method on four genetic loci where they introduced Cre recombinase endogenously, and obtained high F0 mosaicism and germline transmission. The authors included fluorescent proteins with self-cleaving peptides to determine that endogenous expression patterns are observed. By crossing their knock-in Cre lines with lineage tracing reporter lines, the authors temporally traced lineage divergences in zebrafish liver and pancreas.

      The authors should clarify the following points before I can recommend publication:

      Overall, I suggest that the authors consider paring down their figures. Throughout the paper, multiple figure panels convey the same point but for different genes. Furthermore, many construct configurations are shown that are not used in the subsequent panels. For example, the mNeonGreen only (no Cre) constructs and the EGFP constructs are largely not used in downstream experiments. The authors could pick the important constructs and show the relevant data, and summarize all their other constructs in one supplementary figure. The authors also jump around in different parts of the paper with regards to using iCre or CreERT2 and ubi:Switch or ubi:CSHm. It's not clear to me why they're doing that? It makes the paper hard to follow. For example, why use iCre - it's not temporal if I understand correctly (and I'm not sure what improved Cre is - could they reference a paper and include a small explanation) so CreERT2 seems suitable especially for their temporal lineage tracing experiments. Why not limit the description to CreERT2 in the main text/figures? Also, isn't ubi:Switch and ubi:CSHm pretty similar except the latter is nuclear mCherry due to H2B? Why not only focus on ubi:CSHm experiments? I found the paper to be unnecessarily long and think it would benefit from editing to describe the most important concepts and experiments.

      Response: Thank you for your constructive and helpful comments. We do agree that sometimes the schematic constructs seem redundant. This is because the krt4, nkx6.1, and id2a genes have similar gRNA targeting sites (all spanning over the stop codon). However, we prefer to keep these schematic constructs as we have all the statistical results showing the knock-in efficiency in the subsequent figure panels. Such layout can allow readers to make comparisons and better understand the efficacy of this method. However, combined with the comments from the second reviewer, we indeed need to add more detailed information, including the sequence and the length of the short left and right homologous arms in the schematics, to enable the readers to follow this strategy more easily. Meanwhile, we added a new supplementary figure with the sequences of the long left and right homologous arms, as well as the genetic cassettes/point mutations for krt92 knock-in (Figure EV1).

      As for the color switch lines we used, we appreciate your comments and replaced Fig. 5E-G with new fluorescent images using zebrafish larvae carrying the ubb:CSHm transgene. For most of the lineage tracing experiments in this study, we used Tg(ubb:CSHm) as the H2BmCherry is more stable, located in the nucleus, and the fluorescence intensity is stronger than in Tg(ubb:Switch). However, for the lineage tracing experiments in the liver injury model, we believe that Tg(ubb:switch) is a better option than Tg(ubb:CSHm). In the absence of a hepatocyte specific far-red reporter line, we can distinguish the hepatocytes derived from the id2a+ origin using the Tg(ubb:Switch) line, as the cells with Cre recombination express mCherry in the cytoplasm; i.e. we can tell the cell types based on the cell morphology in combination with the ductal anti-vasnb staining. This strategy was previously used by Dr. Donghun Shin’s group in their 2014 Gastroenterology paper (Figure 4B, DOI: 10.1053/j.gastro.2013.10.019). Therefore, we still kept the ubb:switch in the Fig. 1F schematic, and we have elaborated on why we chose Tg(ubb:switch) line for the id2a+ cell conversion experiments in Fig. 7 and Figure EV14.

      The iCre we used is a codon-improved Cre (iCre). The original cDNA sequence was from pDIRE (Addgene plasmid #26745; provided by Dr. Rolf Zeller, University of Basel) (Osterwalder et al., 2010).

      At the beginning of this project, we actually didn’t know whether there were any differences between iCre and CreERT2 in labelling of the cells of interest. Here, using both the iCre and CreERT2 lines, we for the first time, formally show the developmental lineage path of nkx6.1-expressing cells in the zebrafish pancreas. Our data suggested that the early nkx6.1-expressing cells are multipotent pancreatic progenitors giving rise to all three major cell types in the pancreas (endocrine, ductal and acinar cells, shown by nkx6.1 knock-in iCre) and gradually the nkx6.1-expressing cells become restricted in the ductal/endocrine lineages (shown by the nkx6.1 knock-in CreERT2 treated with 4-OHT at different timepoints). In addition, we also aim to use these knock-in lines for multiple studies in which we need to perform many quantitative experiments. As expected, we are unable to reach 100% labeling using the knock-in CreERT2 lines, even if we treated the larvae with very high concentration of 4-OHT over a long period of time. This means that the CreERT2 induced recombination will introduce more variation for quantitative experiments (for instance, the number of regenerated beta-cells from the ductal origin). As we were quite confident with the efficiency of this knock-in strategy, we decided to make both iCre and CreERT2 lines in krt4, nkx6.1, and id2a locus and just observe how they performed. We often use iCre knock-in lines for lineage tracing experiments, because the iCre lines reach near 100% labeling efficiency. Such iCre lines are particularly useful if they only label terminally differentiated cell types. Thus, the near 100% labeling efficiency in iCre lines can be of great help for initial experiments, which later can be confirmed by temporal labeling using CreERT2 lines.

      1. Could the authors describe the purpose of the 5'AmC6 modification earlier in the paper? I didn't see much text about it until the discussion. It seems that the speculation is that it provides end protection and prevents degradation (based on in vitro studies in human). This should be inserted into the introduction as a reader might be wondering about this and won't find an answer until near the end. Also, is this the first in vivo use of this modification for knock-ins? If so, that should be highlighted in the text.

      Response: This is a helpful comment. In the revised manuscript, we elaborate more on why we chose 5'AmC6 modification in our donors. To our knowledge, this is the first time this 5’ modification is used in vivo, however, bulky 5’modification (5'Biotin - 5x phosphorothioate bonds) has been used in medaka (DOI: https://doi.org/10.7554/eLife.39468.001, 2018 Elife, as we previously referenced). The cell division rate is much faster in zebrafish embryos compared with medaka embryos during early development, so we speculate that such modification might be of more importance in zebrafish to achieve early integration. Another advantage is that the 5'AmC6 modification is commercially available, allowing researchers to prepare the donor dsDNA in a handy fashion. We have now expanded on these details and advantages in the introduction.

      1. The authors do not show any sequencing data confirming that their insert was knocked-in as designed with no disruption to the immediate upstream and downstream endogenous sequences. Can they sequence the loci to confirm?

      Response: This is indeed a question we frequently get – thank you for making us relay this information more clearly! We have put the raw Sanger sequencing data in a public repository (mentioned in the Data Availability section), and included the sequencing primers in the method paragraph. Now we also refer to this data in the discussion section in conjunction to highlighting that the integrations were correctly placed in the loci. If you think there are better ways to show the sequencing results, please let us know.

      1. I found the descriptions of the long and short HA to be confusing when describing the results, especially since the first tested gene krt92 only has long and all subsequent ones are short. The discussion made it more clear that short HA is more efficient and applicable when gRNAs span the stop codon. Perhaps that wasn't possible with krt92, but the authors could prevent the confusion by clearly stating the design requirements of long and short HA and that they wanted to test which is more efficient before starting to describe the data. I also didn't see a description of what the length difference between long and short HA is? How short is short HA?

      Response: This is a great question that is well worth discussing. In the revised manuscript, we changed the order in which the parts are described, with nkx6.1 knock-in in front of krt4 knock-in. Here we explain why we would like to do that:

      At the beginning of this project, we did not know if the 5’ modified dsDNA could be an effective donor. To test our hypothesis, we chose the krt92 gene as our first target, as this is a keratin protein and expressed in the epithelial cells. We can easily detect the fluorescence in the epithelial cells (most notably in the skin), which allow us to sort the F0 mosaic embryos with high percentage of integration. Notably, from our experience, the most difficult part of the knock-in method is the sorting step (usually performed during 1-3 dpf). This is because the fluorescence signal is highly dependent on the endogenous gene expression level and is usually dimmer with an overall integration efficiency that is lower compared to canonical transgenesis. Therefore, we thought that targeting an epithelial cell marker would be informative and help us to evaluate the validity and reproducibility of the method. If it worked, then we could move on targeting genes expressing in more restricted tissues or cell types. For krt92 gene, the gRNA targets the region upstream of the stop codon. To prevent the cleavage of the donor template, we had to introduce several point mutations and at the same time keep the amino acid sequence intact. However, such mutations can restrict the knock-in and lower the integration efficiency when using shorter arms (due to the sequence mismatch).

      After we managed to make the krt92 knock-in, our next question was, what about using a gRNA spanning over the stop codon region? In this way, we don’t need to introduce point mutations on neither the left nor the right homologous arm. Also, for the purpose of our biological study, the nkx6.1 were on top of our gene list for lineage tracing experiments and we luckily identified that there is very good gRNA targeting this locus. After we successfully made the nkx6.1 knock-in, we were thinking that we could simplify the protocol even further, i.e. switching to short homologous arms so that we can prepare the donor by a one-step PCR instead of making complicated constructs. We tested that hypothesis in nkx6.1, krt4, and id2a sites and obtained very promising efficiency. Also, we did some further testing with dsDNA without the 5’ modifications and showed that the 5’ modifications indeed greatly increased integration efficiency. Therefore, although the short homologous arm method is a highlight here, we also point out that it was not planned from the beginning. In the revised manuscripts, we want to convey our method in a logical way and show how we modify the method in a step-by-step fashion.

      Moreover, with regards to the comments from the second reviewer, we now added the length of the homologous arms as well as the mutation site on the schematics. We chose short homologous arm because in previous literature it was suggested that short homologous arms (36-48 bp, which we now write out in both the results and the methods) can promote microhomology-mediated end joining (doi: 10.1096/fj.201800077RR). We also noticed that the recent Geneweld method (DOI: 10.7554/eLife.53968) also adheres to a similar length for homology mediated integration. In this study, HAs even shorter than 36 bp also perform well.

      1. The authors state that they could not use in situs to confirm krt92 endogenous and knock-in expression overlap, but rather say that they match based on data from an intestine scRNA-seq dataset. Can they elaborate on this? Which clusters/cell types show overlap? Furthermore, is there any krt92:GFP transgenic line that can be used as a reference for expression as well? This point is also applicable for krt4 described in Fig.2

      Response: We appreciated this point. In the beginning, we contacted Molecular Instruments to synthesize krt92 HCR3.0 in situ hybridization probes. However, the technical staff there told us that they are unable to make specific probes due to high sequence similarity to other keratin protein families. We can see that the sequence similarity mostly occurs in the middle of krt92 genes, and the HCR3.0 probes rely on a probe set (preferably 20-30 probes with different sequences) to target the mRNA.

      The scRNA-seq data that we referenced are from 10X platform, which is based on a 3’enrichment methodology. The reads mapping to krt92 genes are mostly located on the 3’ end. This is good as there is much less similarity to other cytoskeleton genes in the 3’ end of the gene. Unfortunately, there is no krt92 transgenic lines available, so we relied on the single-cell data to correlate expression patterns in this case.

      There are two zebrafish intestine single-cell data sets available, with the following links:

      (1): https://singlecell.broadinstitute.org/single_cell/study/SCP1675/zebrafish-intestinal-epithelial-cells-wt-and-fxr?genes=krt92#study-visualize

      (2): https://singlecell.broadinstitute.org/single_cell/study/SCP1623/zebrafish-intestine-conventional-and-germ-free-conditions?genes=krt92#study-visualize

      We can see that krt92 is widely expressed in different types of intestinal epithelial cells (absorptive enterocytes, secretory enteroendocrine/goblet cells and ionocyte).

      For the krt4 gene, we now added the HCR3.0 in situ hybridization and immunofluorescence for both krt4 knock-in EGFP-t2a-CreERT2 lines and the Tg(krt4:EGFP-rpl10a) transgenic line (a construct from Anna Huttenlocher, https://www.addgene.org/128839/, which has been widely used to label skin cells). The results are shown in Figure EV9. We show that krt4 has very high expression in the intestinal bulb and hindgut based on the HCR3.0 in situ. The Immunofluorescence of the krt4 knock-in fully recapitulate the krt4 expression pattern in the intestine, while there is almost no fluorescence signal in Tg(krt4:EGFP-Mmu.Rpl10a). We believe this is another advantage of using the knock-in method, over transgenics, for cellular labeling and lineage tracing. Classical transgenics often rely on short promoters of the proximal/enhancer region upstream of ATG with various length (arbitrarily or based on clues from motif analysis/DNA methylation sites). However, different tissues/cell types tend to use different cis-_regulatory elements and the chromatin structure/enhancer-promoter loops might differ dramatically among different cell types. It is hard to predict the exact region of the regulatory sequences that is sufficient for driving the gene expression in a certain cell type. Thus, such reasoning consolidates with that our knock-in lines recapitulate the endogenous _krt4 gene expression. Therefore, we believe that the knock-in based genetic lineage tracing will become the standard in the zebrafish field, as theoretically it avoids both the lack of relevant expression and leakage problems of transgenics.

      1. I think Figure 2A needs the dotted lines on the last construct to be fixed (points to p2A)

      Response: Thank you for noticing! This was due to a bug in the IBS software, and we changed it manually using Adobe Illustrator in the revised manuscript.

      1. There are a few instances where the authors describe performing 4-OHT treatment for long period (e.g. over a 20 hour or 24 hour period). Is fresh 4-OHT added after a certain amount of time or is it a one-time addition? Is such long periods of 4-OHT required or has maximal recombination already occurred within a few hours after addition of 4-OHT?

      Response: For 4-OHT treatment, we referred to the method described by Dr. Christian Mosimann (DOI: 10.1371/journal.pone.0152989). We actually tried different conditions (dosage, duration, refresh or not). This is particularly important for the knock-in CreERT lines because the level of CreERT2 is highly dependent upon the endogenous gene expression level. In our case, the nkx6.1 and id2a are transcriptional regulators and relatively lowly expressed compared with structural proteins. We maximized the labeling efficiency by using the highest concentration and longest duration suggested for 4-OHT treatment. The 4-OHT was stored in -20 ℃ and it would become less effective after 30 days of storage. Therefore, we first incubated the 4-OHT in 65 ℃ for 10 min (as recommended by Dr. Christian Mosimann) in order to convert it to a bioactive form. Next, we treated the zebrafish embryos with 4-OHT using a final concentration of 20 μM for 24 hours. We didn’t refresh the 4-OHT since there was no significant difference compared with a one-time addition. Moreover, using higher dosage or longer treatment time can lead to less survival and increased deformity rate. 20 μM 4-OHT treatment for shorter time periods (6 or 12 hours) can cause high labeling variability (some larvae have good labeling while others not). In the end, after several rounds of experiments, we settled on 20 μM 4-OHT treatment for 24 hours as it can reach the highest labeling efficiency, lower variability, and good survival.

      1. For Figures 4-6 where confocal images of lineage tracing experiments are shown, there is no indication of how many times the experiments were repeated, how many sections were images, how many animals used, how many cells counted. All of this information should be included in the figure legends and plots should be added showing quantification and statistical analysis (where appropriate).

      Response: The reviewer makes a good point and we have now added the number of larvae used and statistical results for the quantitative experiments. The quantification of experiments in Figure 3E-H (originally Figure 4E-H) are shown in Figure EV6D using box/dotplot. We randomly selected 3 secondary islets of different sizes (large, middle, and small) from each juvenile fish (n=5) and pooled the number of mCherry/ins double positive cells and ins positive cells together. The quantification of the lineage-tracing efficiency in the experiments in Figure 6 are shown in Figure EV13.

      1. Figure 4 C, C' - I'm not sure what to look for. Is the message that there is no Cherry positive cells that are vasnb negative when labelling is done at 8 somite? But the vasnb positive cells that are also Cherry positive remain? The vasnb staining seems much weaker/harder to see in C C' compared to B, B'. As mentioned above, these data should be quantified and statistical significance indicated.

      Response: Thank you for pointing this out; the second reviewer made a similar point. We redid the experiments using zebrafish larvae carrying the ptf1α:EGFP transgene to indicate the acinar cells (Figure 3B-D, Figure EV4G). We also quantified the results and performed statistical testing.

      1. I recommend the authors include a short section in the discussion comparing the efficiency of their method to other knock-in strategies used in zebrafish. This is an important claim of the paper yet it is not clear how much better it is (if at all) in terms of frequency of F0 mosaicism and identification of founders relative to other methods. I do appreciate the relative simplicity of the molecular steps of construct design/generation.

      Response: This is indeed important. It is also tricky since we are unable to make head-to-head comparisons between different methods as we are targeting different genetic loci and do not have the other methods up and running in our lab. However, the general comparison is based on the statistics shown in the hallmark papers describing these other methods, regardless of which genes were selected for targeting. In the discussion, we added a list of points that are novel/improved with our method versus previous ones, including that: 1) we simplify the knock-in methodology circumventing complicated molecular cloning; 2) we have very high germline transmission rate, which means that one morning of injection is often enough to get a founder; and the expression of fluorescence proteins avoids tedious work in identifying founders, which also saves a lot of space in the fish facility; 3) our lines can be applied for multiple utilities; 4) the method does not disrupt the endogenous gene product. We believe this is critical for the field of developmental biology, regenerative medicine, and disease modeling in zebrafish – and perhaps a similar 3’ knock-in based lineage-tracing method can become commonly used to delineate the cell differentiation and plasticity during homeostatic and diseased conditions in additional organisms.

      Reviewer #1 (Significance):

      Overall, the study contributes a new knock-in strategy in zebrafish that appears to be more user-friendly and results in high germline transmission. The authors also identify nkx6.1+ ductal cells as progenitors of endocrine cells in the pancreas highlighting the biological applications of their method. I think this study represents an important advancement in zebrafish genetics and will have future impact in lineage tracing during development, regeneration, and disease.

      Reviewer #2 (Evidence, reproducibility and clarity):

      Summary:

      Here, the authors present a strategy where they performed knock-in at the level of the STOP codon, taking care of not perturbing the coding region. They integrate cassettes coding for fluorescence protein and Cre recombinase, which are separated from the endogenous gene and each other by two self-cleavable peptides.

      The cassettes are done by PCR with primers with 5' AmC6 modifications and they test short (36 to 46 bp) or long homologous arms (~950bp). For nkx6.1 gene, they observed a dramatic increase of recombination efficiency when injecting the donors with short Homology arms compared to long arms suggesting that short arms could be used. Indeed, short arms used with krt4 and id2a allow them to obtain K.I lines.

      The techniques described here look promising. Indeed, even if the proportion of F0 showing adequate reporter expression is low (usually about 2%), the percentages of founders among these mosaic F0 were quite high (between 50% and 100%). And this is the most important aspect as it is usually the most time-consuming aspect of the work.

      Major comment:

      The authors claim that the knock-in lines can precisely reflect the endogenous gene expression, as visualized by optional fluorescent proteins. But are the authors sure that the integration of the cassettes coding for fluorescence protein and Cre recombinase, which are separated from the endogenous gene and each other by two self-cleavable peptides, will not affect the level of expression of the targeted genes . Indeed, it has been shown that sometimes self-cleavable peptides could affect the expression of the genes of the cassette like for example in this reference ([https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8034980]. Therefore it is important that the authors check whether the cassette affect the level of expression of the targeted gene if they want to claim that the knock-in lines precisely reflect the endogenous gene expression.

      Response: Thank you for your insightful comments. With regards to the endogenous gene expression, we now use qPCR for further validation. We added the qPCR results to the supplement material (Figure EV15) in the revised manuscript. In brief, we pooled 4 larvae in one tube per biological replicate and have 4 biological replicates for each knock-in line. We didn’t see a significant change in the endogenous expression for any gene. In addition, we have grown up homozygous knock-in lines to adulthood and they are fertile without any overt phenotype.

      The highlighted reference is dealing with a cardiomyocyte specific transgenic line, and we assume figure 3-Supplementary figure 1 is what the reviewer is referring to. The altered level of erbb2 expression might be due to the experimental conditions (no treatment or 3 days post treatment). Also, it is possible multiple transgenic insertions occur, as well as gene silencing at some insertion sites. However, such issues would not present, or very limited, with knock-in methods.

      Minor comments:

      General points:

      I believed that the authors should improve the presentation of their data. Indeed, based on what they present, it would be impossible for me to reproduce their technique. Indeed, it is not clear at all how they design the short and long arm, where they are exactly located, which mutations they have done (for fig1), where is located the guide RNA compared to the STOP codon and the HA arms. Graphics that exactly place all these sequences are absolutely required to understand the strategy used and should be placed in figure 1, 2, 3 and 4.

      Response: Thank you for these comments. In the revised version, we added the sequence information of the short homologous arms in each of the schematics. As for the krt92 gene, we added the sequence information in the first supplement results (Figure EV1) with the genetic cassettes and point mutation information. We list all the primer information in the methods. Also, we have uploaded our vector templates in the public repository (as listed in the Data availability section). Lastly, we added a key resource table in the supplement file with all the detailed information of reagents for the ease of reproducibility (including all the primers sequences used). We are also willing to share our constructs with the scientific community upon request.

      Specific points:

      Introduction:

      "In zebrafish, the NHEJ-mediated methods have been intensively investigated in 5'knock-in upstream of ATG using donor plasmid containing in vivo linearization site flanking the insertion sequences (11,12,17-20). The 3' knock-in method has also been examined using circular plasmid as the donor with either long or short homologous arms (HAs) flanked by in vivo linearization sites (14, 21-23). Recently, intron-based and exon-based knock-in approaches have remarkably expanded the knock-in toolbox by targeting genetic loci beyond the 5' or 3' end (8-10,13,24-26)."<br /> This part should be explained better in order that the readers could really understand the differences between these old studies and this new one. And really insist on what is the novelty of their technique.

      Response: Good points. In the revised version, we elaborated more on the previous discoveries, the major challenges, the knowledge gap in zebrafish knock-in methodology, and what is novel and improved with our new technique. Please, see clarifications and the expanded text in both the introduction and discussion.

      Results:

      Page 4: To my opinion, the first paragraph should be removed and the technique directly explained based on krt92 strategy as this paragraph does not allow to understand the technique. As indicated above, figure 1 should indicate more clearly the location of the long arms and which mutations they have done and where is located the guide RNA.

      Figure 1G: The expression in the skin is far from obvious and the image should be improved (for example with some inset).

      Response: Thank you for the comments. We added a new supplementary figure (Figure EV1) and show the sequences of left and right homologous arms, the genetic cassettes, as well as the point mutations with different background color highlight. We added the insets to show the magnified regions of interest. Also, we added the images from the fluorescent microscope used for sorting, to show the EGFP signals in live zebrafish embryos (Figure EV2D and Figure EV8D).

      Figure 3E: The authors say that "cells expressing nkx6.1 (displayed by the green fluorescence) were located on the ventral side of the spinal cord whereas H2BmCherry positive cells, which include all the progenies of nkx6.1+ cells after the iCre recombination, resided in both the ventral and dorsal parts of spinal cord". This differential expression in the spinal cord is not obvious and a more closer view should be provided.

      Response: Thank you for the comment. First, we changed the order and now describe all nkx6.1 content in Figure 2 and 3 and the krt4 content in Figure 4. We added insets to show the magnified regions and better display the expression pattern of the two fluorescence proteins in Figure 2E-G. One can now clearly see from the magnified insets that the green signals driven by the endogenous nkx6.1 gene are present in the ventral part of the spinal cord, while the red signals are present in both the ventral and the dorsal side of the spinal cord.

      Fig S4H: The authors say that" using lineage tracing, we could trace back all three major cell types in the pancreas (acinar, ductal and endocrine cells) to nkx6.1 lineage (Figure 3H-H',Supplementary Figure S4G, H)". While this is obvious for endocrine, the colocalisation with ela3l:GFP is not obvious and the figure should be improved.

      Response: This is a very good point, and the first reviewer gave similar suggestions. In the revised version (shown in Figure EV4H and I), we added the insets to show the magnified regions to better display the expression pattern of two fluorescence proteins. The ela3l reporter line is using a short promoter to drive the expression of H2B-EGFP (doi: 10.1242/dmm.026633). However, this short promoter cannot reach 100% labeling of acinar cells, so we also use the ptf1α:EGFP transgene for further validation (new Figure EV4G). Both transgenic reporter lines showed many EGFP and mCherry double-positive cells, indicating that these acinar cells are derived from a nkx6.1-expressing origin. Here we did not use the anti-GFP antibody, as our color switch lines contains CFP and anti-GFP antibody can also recognize CFP. However, the GFP signal is strong enough to show the expression. We hope the additional experiments and insets clarifies this point.

      Page 8: the authors say that "The immunostaining at 6 dpf showed that both intrapancreatic ductal cells and a portion of acinar cells can be lineage traced when the 4-OHT treatment started at the 6 somite stage (Figure 4B and B'). The identification of the acinar cells has been done based on the absence of the ductal marker vasnb. To trace efficiently the acinar cells, this should be done with an acinar marker.

      Response: Another good point also mentioned by reviewer one. We redid the analyses using zebrafish larvae containing the ptf1α:EGFP transgene to indicate the acinar cells and the co-expression pattern with the lineage-tracing (the data is shown in new Figure 3B-D).

      Reviewer #2 (Significance):

      I do not have enough expertise in the KI field to evaluate whether this strategy is really novel and as mentioned above, the authors should better explain what is really the novelty of their strategy.

      Response: In our answers to the comments of the first reviewer, we elaborated more on the points that are novel/improved with our method vs previous methods, as reiterated here:

      “…including that: 1) we simplify the knock-in methodology circumventing complicated molecular cloning; 2) we have very high germline transmission rate, which means that one morning of injection is often enough to get a founder; and the expression of fluorescence proteins avoids tedious work in identifying founders, which also saves a lot of space in the fish facility; 3) our lines can be applied for multiple utilities; 4) the method does not disrupt the endogenous gene product.”

      Moreover, the first reviewer asked about the difference between the krt4 knock-in and krt4 transgenics, and based on the in situ data, we showed that our krt4 knock-in can fully recapitulate the endogenous gene expression, while the krt4 transgenics can hardly label the intestinal bulb and hindgut. This might be due to that different tissues/cell types may depend on different _cis-_regulatory elements to drive the gene expression. The chromatin structure and the enhancer/promoter loop might also differ dramatically among different tissues. Therefore, the transgenics might be useful for one type of cells, while they might be not useful at all for other cell types. In the future, we believe that, similar to the mouse field, the 3’ knock-in based lineage tracing methods might become the standard method in the zebrafish field, to delineate cellular differentiation and plasticity during homeostatic and diseased conditions.

    1. Now, Americans! I ask you candidly, was your sufferings under Great Britain, one hundredth part as cruel and tyranical as you have rendered ours under you? Some of you, no doubt, believe that we will never throw off your murderous government and “provide new guards for our future security.” If Satan has made you believe it, will he not deceive you? Do the whites say, I being a black man, ought to be humble, which I readily admit? I ask them, ought they not to be as humble as I? or do they think that they can measure arms with Jehovah? Will not the Lord yet humble them? or will not these very coloured people whom they now treat worse than brutes, yet under God, humble them low down enough? Some of the whites are ignorant enough to tell us that we ought to be submissive to them, that they may keep their feet on our throats. And if we do not submit to be beaten to death by them, we are bad creatures and of course must be damned, &c. If any man wishes to hear this doctrine openly preached to us by the American preachers, let him go into the Southern and Western sections of this country—I do not speak from hear say—what I have written, is what I have seen and heard myself. No man may think that my book is made up of conjecture— I have travelled and observed nearly the whole of those things myself, and what little I did not get by my own observation, I received from those among the whites and blacks, in whom the greatest confidence may be placed.

      He urged enslaved people to fight back against their oppressors and to put an end to slavery.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      We thank the Reviewers for their comments. Below we have the Reviewers’ comments and our responses.

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      In this work, the authors claim that their machine learning approach can be combined with a biophysical model to predictably engineer sensors. The concept is interesting, but there are many issues that must be addressed before considering its publication.

      1. It is surprising that their citations are too biased. They keep citing nonrelevant papers from several groups while omitting many key papers regarding genetic sensors and circuits in the field. Some can be justified (e.g., Voigt lab's reports), but others (e.g., reports on dynamic controllers too often) would not be relevant.

      There are hundreds (possibly thousands) of papers that have been published on genetic sensors. Most of those papers report only qualitative results (e.g., genetic sensor implemented in a new host organism or demonstrated to sense a ligand of interest).

      The purpose of this manuscript is to demonstrate methods for quantitative engineering of genetic sensors. Specifically, the manuscript is focused on quantitative tuning of the genetic sensor dose-response curve. So, in deciding which previous papers to cite, we chose several review articles (to cover the many, many qualitative results), any previous papers we could find that reported strategies for tuning the dose-response curve of genetic sensors (the Voigt lab’s reports and others), and any papers we could find that discussed reasons/applications for quantitative tuning of a genetic sensor dose-response curve (e.g., dynamic controllers).

      We added a new paragraph to the beginning of the Results section to explain this focus on quantitative tuning (and to clearly state which statistic we use for assessing accuracy – see response to next comment; lines 72-83 in the revised manuscript).

      We would also like to add more relevant citations as suggested by the reviewer, but that is difficult based on the reviewer’s comment, which just indicates that we have omitted many “key” papers. For the central focus of this manuscript, we think the “key” papers are those that describe methods to tune the dose-response curve of genetic sensors, and we have done our best to cite all of those that we could find. So, we ask the reviewer to please suggest some specific papers that they consider to be “key” that we should cite, or at least some more specific definition of what they think constitutes a “key” paper that should be cited.

      It is very unclear which statistical analysis has been done for their work.

      The main statistical metric used in the manuscript is the fold-accuracy. The fold-accuracy was defined in the previous version of the manuscript, but we agree that it could have been stated more clearly. So, we have moved the definition of fold-accuracy to the (new) first paragraph of the Results section, and identified it as “…the primary statistic we will use to assess different methods.” (line 77 of the revised manuscript)

      There are many practical sensors for real applications, but their work focuses on IPTG-responsive sensors or circuits. I was wondering whether this work would have significant impacts on the field or the advancement of knowledge.

      Similarly, it is questionable that their approach is generalizable.

      Currently, there is only one published dataset that can be used for the methods described in this manuscript, for IPTG-responsive LacI variants.

      However, previous work (cited in our manuscript) has shown that directed evolution can be used to qualitatively “improve” a wide range of genetic sensors beyond LacI. Furthermore, some of those previous studies used a single round of mutagenesis and libraries with diversity similar to the size of the LacI dataset (104 to 105 variants). Based on that, we think it is highly likely that our in silico selection approach will generalize to other sensor proteins.

      With regard to the ML methods used in our manuscript, we showed in the initial publication describing the LANTERN method that the approach is generalizable to different types of proteins and protein functions (LacI sensor protein, GFP fluorescence protein, SARS Cov-2 spike-binding protein). So, we don’t see any reason to question the generalizability of that approach to other sensor proteins.

      We have edited the Discussion section of the manuscript to include these points regarding the generalizability of our approach (lines 340-350 in the revised manuscript).

      Due to the biased literature review, it is unclear to me whether this work is novel.

      The majority of relevant literature on genetic sensor engineering is qualitative in nature and is not particularly comparable to the work here. We have tried to emphasize this in the introduction and discussion. We have searched the relevant literature extensively, and we have only found a small number of papers that describe quantitative methods to tune the dose-response of genetic sensors. Furthermore, there are only a few that contain any kind of quantitative assessment of that tuning. We have cited all of those papers and included specific discussions and comparisons between them and our results.

      If the reviewer knows of any specific papers that we missed we would be happy to include them in our literature review.

      I am unsure whether their correlation is sufficiently high.

      This comment is too vague to address.

      Again, we ask the reviewer for more specific information: What “correlation” are you referring to? And what is “sufficiently high”?

      We have provided statistics on the accuracy of our methods, as discussed above.

      Is EC50 the only important parameter? Or is it really relevant for real applications where the expression levels would change due to RBS changes, context effects, metabolic burdens, circuit topologies, etc.?

      EC50 is not the only important parameter. That is why we also demonstrate the ability to quantitatively tune other aspects of the dose-response (e.g., G∞).

      In any real application of genetic sensors, the EC50 will have to be engineered to have a quantitatively specified value (within some tolerance). So, yes, it really is relevant.

      There is an important question about the effect of context however, and perhaps that is what the reviewer is really asking: If we engineer a genetic sensor that has a given EC50 in the context used for the large-scale measurement, will we be able to use that genetic sensor in a different context where, because of the change in context, its EC50 may be different?

      This is one of the outstanding challenges in the field, to be able to predict the effect of a change in context. But for genetic sensors, there are several previous publications that demonstrate promising routes to quantitatively predict the effect of context on genetic sensor function.

      So, we have added a paragraph to the Discussion section addressing this point and citing the relevant previous publications (lines 315-339 in the revised manuscript).

      There are many reports on mutations or part-variants and their impacts on circuit behaviors. Those papers have not been cited. This is another omission.

      As discussed in response to Comment 1, above, there are many hundreds of such papers. It would not be practical or appropriate for us to cite all of them. However, there are only a few that contain any kind of quantitative assessment of the predictability of mutational effects or of efforts to use mutations to engineer sensors to meet a quantitative specification. We have done our best to cite and discuss all of those. Again, if the reviewer knows of any specific additional papers that we should cite, please tell us.

      CROSS-CONSULTATION COMMENTS

      In general, I agree with the other reviewer. Its significance would be too incremental.

      Reviewer #1 (Significance (Required)):

      See above.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      This paper proposes two approaches for forward-design of genetically encoded biosensors. Both methods rely on a large scale dataset published earlier by the authors in Mol Syst Biol, containing ~65k lacI sequences and their measured dose response curves. One approach, termed 'in silico selection', is proposed as a way to find variants of interest according to phenotypic traits such as the dynamic range and IC50 of the biosensor dose-response curve. The second approach uses machine learning to regress the dynamic range, IC50 and others from the lacI sequences themselves - the ML regressor can then be used to predict phenotypes of new variants not present in the original dataset. The ML algorithm has been published by the same authors in a recent PNAS paper.

      The manuscript has serious flaws and seems too preliminary/incremental:

      1) The 'in silico selection' method corresponds to a simple lookup table. This is a perfectly acceptable method for sequence design, but the attempt to portray this as a new method or 'multiobjective optimization' is highly misleading. Also, the analogy between 'in silico selection' and darwinian evolution or directed evolution are inappropriate, because both latter approaches rely on iterative selection through fitness optimization and randomization of variants. The 'in silico selection' approach in contrast is one-shot and does not use randomization.

      We agree with some of the reviewer’s points here. In making the analogy to directed evolution, we wanted to give the reader a connection to something familiar, but the reviewer is correct that the analogy is imperfect. The “lookup table” description is much better, and probably a familiar idea to most readers. So, we edited the relevant paragraph to describe in vitro selection as the use of the large-scale dataset as a lookup table instead of making the analogy to directed evolution. We thank the reviewer for this suggestion.

      However, we disagree with the reviewer with regard to “multi-objective optimization.” We clearly demonstrate in Figures 3 and 4 that we can simultaneously tune multiple aspects the dose-response curve to meet quantitative specifications. If the reviewer is aware of any previous publications that they think provide a better demonstration of multi-objective engineering of biological function, please let us know; we would like to cite those papers appropriately.

      Also, the reviewer is incorrect in stating that our in silico selection approach does not use randomization. The randomization occurs as part of the large-scale measurement. This is clearly stated in the second paragraph of the Results section.

      2) The ML approach is a minor extension to what they already published in PNAS 2022. One could imagine an extra figure in that paper would be able to contain all ML results in this new manuscript. A couple of comments about the actual method: a) it seems unlikely to work on sequences of lengths relevant to applications, because it relies on gaussian processes that are known to scale poorly in high dimensions. b) The notion of 'interpretable ML' is misleading and quite different to what people in interpretable AI understand. Moreover, the connection between the three latent variables, which provide the 'interpretability', and biophysical models seems to come from their earlier PNAS work and this specific dataset, but there is no indication that such connection exists in other cases. Although this is somewhat acknowledged in L192-195, the text tends to portray the connection with biophysical models as something generalizable.

      The ML results presented in this manuscript are specifically aimed to quantitatively assess the accuracy of the ML predictions for the parameters of a genetic sensor dose-response curve. So, we think those results belong in the current manuscript.

      The reviewer’s comment on Gaussian processes and dimensionality is clearly contradicted by the results presented in this manuscript and in our previous publication describing the ML method: The ML method works quite well for “sequences of lengths relevant to applications,” including LacI (360 amino acids), the SARS-Cov2 receptor binding domain (200 amino acids), and GFP (250 amino acids). The reason for this is that the Gaussian process is only applied on the low-dimensional latent space learned by the ML method.

      The reviewer’s comment on “interpretable ML” is not relevant to this manuscript but is instead a criticism aimed at our previous publication on the ML method.

      The generalizability of this approach is an open question. The same could be said for most other publications describing new methods, since most of those publications include demonstrations with only a small number of specific systems. After re-reading the relevant portions of the manuscript, we disagree with the reviewer’s suggestion that we have exaggerated the potential generalizability of the approach. For example, in the last sentence of the Results paragraph, we state, “Although imperfect, this initial test of linking an interpretable, data-driven ML model to a biophysical model to engineer genetic sensors shows promise…” And, in the Discussion section, “The use of interpretable ML modeling in conjunction with a biophysical model also has the potential to become a useful engineering approach… But more rigorous methods would be needed…”

      Other comments:

      3) There are quite a few reduntant figures, eg Figure 1 contains too many heatmaps of the same variables. Fig 2B and C are redundant as the contain the same information. Altogether figures feel bloated and could have been compressed much more.

      We disagree. The sub-panels of Figure 1 show different 2-D projections of the multi-dimensional data that are relevant to specific aspects of the results in Figs. 2-4.

      Admittedly, Fig 2C shows the residuals from Fig 2B, which is in some sense the “same information.” But it is quite common, in papers focused on quantitative results, to have one sub-panel showing a comparison between predicted and actual and a second sub-panel showing the residuals.

      4) Fig 2A and 3A have problems: the blue & orange lines (Fig 2A) and blue & green lines (Fig 3A) have a kink just before the second dot from the left. Such kinks cannot have been produced by a Hill function. This kind of errors cast doubt on the overall legitimacy and reproducibility of the results.

      The kinks in the curves are a consequence of the use of the “symmetrical log” scale on the x-axis, which allows the zero-IPTG and non-zero-IPTG data to be shown on the same plot while showing the non-zero-IPTG data on a logarithmic scale. That symmetrical log axis uses a log scale for large x values, and a linear scale for smaller x values. The kink appears at the transition between the log and linear scales. We have re-plotted all of the figures showing dose-response curves to move the log-linear transition to overlap with the axis break.

      CROSS-CONSULTATION COMMENTS

      I agree with the other reviewer's comments, particularly on the lack of statistical analyses.

      See our response to Reviewers #1, comment 2, above.

      Reviewer #2 (Significance (Required)):

      The work addresses a timely subject but is too incremental.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Manuscript number: RC-2022-01490

      Corresponding author(s): Cariboni, Anna; Howard, Sasha R

      [Please use this template only if the submitted manuscript should be considered by the affiliate journal as a full revision in response to the points raised by the reviewers.

      • *

      If you wish to submit a preliminary revision with a revision plan, please use our "Revision Plan" template. It is important to use the appropriate template to clearly inform the editors of your intentions.]

      1. General Statements [optional]

      This section is optional. Insert here any general statements you wish to make about the goal of the study or about the reviews.

      2. Point-by-point description of the revisions

      This section is mandatory. *Please insert a point-by-point reply describing the revisions that were already carried out and included in the transferred manuscript. *

      • *

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      The current manuscript in question is well written and of general interest to the reproductive neuroendocrinology field. Overall it is a well written and substantiated.

      Reply: We thank the reviewer for his/her positive and supportive comments on our manuscript.

      The primary problem with the paper is the data derived from the microarray. While the experimental design included replicates (n = 3), although weak, the actual microarray data was based on a single data point. A major weakness. This experiment should be repeated using more up-to-date approaches such as RNA-seq or left out of the manuscript, because this data set is compromised due to the data collection procedure.

      Reply: We thank the Reviewer for raising these points, which we wish to clarify. We respectfully disagree that the microarray data generated in this study is not valuable. The transcriptomic analysis of immortalized cells was performed on 3 biological replicates (specifically, RNA was extracted from n=3 samples, obtained from each cell line at 3 different passages) and run as 3 independent samples (for a total of 6, 3 for GN11 cells and 3 for GT1-7 cells). For the primary embryonic GFP-GnRH neurons, given the difficulty of isolating with FACS a sufficient number of GFP+ cells from each embryo due their very small number (around 1000 GnRH neurons/head), we had to pool sorted cells from 2-3 embryos for each time-point. Thus, although the primary cell microarrays were run on one sample for each time point, the RNA was not derived from one embryo only, but from at least 2/3 embryos.

      Nevertheless, to overcome the issue of low number of replicates for the primary embryonic cells, we revised our manuscript by re-running our analyses, using as the starting dataset the analyses obtained from immortalized cells, which were based on a ‘true’ n=3 of biological replicates. In this context, we filtered DEGs from this microarray using logFC>2 and adj. p-value1) found in primary GFP-GnRH neurons. We believe that this revised analysis is statistically more powerful, as the core bioinformatic analyses were performed on triplicate samples, with a second filtering step to take advantage of biologically relevant data obtained from n=1 primary GFP-GnRH neurons to confirm in vivo the expression of selected genes. Whilst RNAseq offers wider coverage of the genome and has advantages over microarray, we do not believe that this renders unimportant the data generated from these unique experiments and the novel genomic discoveries it facilitated.

      In line with this, our work may be considered as a proof-of-principle that transcriptomic profiles from rodent GnRH neurons can be exploited at different levels, including the possibility to identify novel GD candidate genes. Overall, our work also highlights the existence of similarities between two immortalized GnRH neuron cell lines with primary GnRH neurons, which was so far demonstrated by several functional studies, but not at molecular level.

      The manuscript has been now edited as per the above amendments (see first and second paragraph of Results section, lines 86-135).

      __CROSS-CONSULTATION COMMENTS __Notwithstanding the importance of neuroligin 3 during glutaminergic synaptogenesis, I agree with the reviewers on both points. Further screenings of the patient's family members should be done and the microarray data should be removed or potentially moved to a supplementary status.

      Reply: we thank the reviewer for their comments and, accordingly with their suggestion, we revised the filtering strategy starting from immortalized cells microarray and therefore moved a substantial part of the microarray data from primary GFP+ neurons as supplementary data. We also unsuccessfully tried to collect information of the brother from case 2 and investigated datasets from both the DECIPHER and 100,000 genome projects, but have been limited to two cases for which we have familial consent to publish.

      Reviewer #1 (Significance (Required)): The paper is of significance based on the neuroligin 3 data, which is indicative of abnormal synaptogenesis. However, these defects seem to only have a limited effect on the functionality of GnRH neuron system and do not seem to cause elimination of GnRH neurons themselves. Nevertheless these data do open end a new direction that may help explain some dysfunctions in reproductive health.

      Reply: we thank the reviewer for their comments and agree that our findings have the potential to facilitate new avenues for the investigation of reproductive disorders.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)): Oleari et al performed comparative transcriptome analysis on the different developmental stages of GnRH neurons, as well as two immortalized GnRH neuronal cells GT1-7 and GN11 which represent mature and immature GnRH neurons. As a results, they identified a panel of differentially expressed genes (DEG). They further used top DEGs as candidate disease-related genes for GnRH-deficiency (GD), a disorder characterized with absent of delayed puberty and infertility. To this end, they found two loss-of-function mutations in NLGN3 in patients with GD combined with autism. This study provide a resource for the identification of novel GD-associated genes, and suggest an intrinsic connection between GD and other neurodevelopmental diseases, such as autism. I only have some minor concerns.

      1. According to the pedigree, both probands (case 1 and 2) inherited their NLGN3 mutations from their unaffected mother, consistent with an X-linked recessive inheritance. However, only "parent" was used in the manuscript, therefore, it is not clear if this "parent" is the probands' mother or father. __Reply: __Thank you for this comment. We were limited to the use of non-gendered terminology due to medRxiv policies. We have now amended the text and changed ‘parent’ to ‘mother’, lines 161, 173, 179, 185 and 730. We also integrated this sentence highlighting the X-linked pattern of inheritance: “Sanger sequencing of the probands’ mothers confirmed them to be the heterozygous carrier in each family, consistent with an X-linked recessive inheritance pattern.”, lines 185-186.

      It is suggested to integrate Figure 2 as a panel in Figure 1.

      __Reply: __We thank the reviewer for this suggestion. Due to our revision of first two Results paragraphs, we have now edited the Figures and the filtering flowchart has been added in Figure 2.

      What is the meaning of Peak LH and Peak FSH, and how are they measured in Table 2?

      Reply: This refers to peak value obtained after standard protocol GnRH stimulation testing with 100mcg GnRH (Gonadorelin) as an IV bolus and measurement of serum LH and FSH at 0, 20 and 60 minutes intervals. (e.g. Harrington et al., 2012, doi:10.1210/jc.2012-1598). This clarification has been added to the text in Table 2 legend (lines 681-683).

      A genotyping for the elder brother of Case 2 will be a strong evidence to support NLGN3 as a GD-associated gene.

      __Reply: __We thank the reviewer for this important point. In view of this issue, we have strived to collect DNA from this individual. Unfortunately, despite trying repeatedly to contact the family of proband 2, it has not been practically possible to collect these extra data from this family.

      We also identified a third case via a public database with central hypogonadism who carried a stop-gain variant in NLGN3, but unfortunately the family did not release their consent for publishing this case.

      The authors claimed neither probands carried deleterious variants in known GD genes. It is suggested to indicate the exclusion criteria (which genes? How do they define a variant is deleterious?)

      Reply: We thank this reviewer for raising this important point of clarification. Inclusion criteria for variants in known GD genes (updated gene list available in Supplemental Table 3) were as per Saengkaew et al., 2021 (doi: 10.1530/EJE-21-0387): “Only variants that met the ACMG criteria for pathogenicity, likely pathogenicity, or variants of uncertain significance (VUS) were retained in the analysis”. We have added this sentence in the manuscript, lines 150-151.

      Please also include a sequence chromatogram for proband 2.

      Reply: We thank the reviewer for their comment. We added the chromatograms for proband 2 and his heterozygous mother in revised Figure 3.

      CROSS-CONSULTATION COMMENTS I agree with Reviewer 3, the genetics is not very strong, as NLGN3 mutations were only found in one GD case from their cohort and one pre-pubertal case from the literature. It will be nice to analyze the genotype and phenotype of Case 2's older brother. Further, it is important to screen NLGN3 rare sequencing variants in larger GD cohorts.

      Reply: We thank the reviewer for their comment, but respectfully disagree with this assertion. The second case is not from the literature, but is a second case found thanks to GeneMatcher, an international tool that allows researchers to collaborate on novel gene discovery. We have also explored other cohorts that were available to us, including the DECIPHER and 100,000 genome project, but have been limited to two cases for which we have familial consent to publish. We anticipate that further international patient cohorts will be screened following the publication of this manuscript (added in Discussion section, lines 306-308). As described above, despite trying repeatedly to contact the family of proband 2, it has not been practically possible to collect these extra data from this family.

      Reviewer #2 (Significance (Required)): This study provides a resource for the identification of novel GD-associated genes, and suggest an intrinsic connection between GD and other neurodevelopmental diseases, such as autism. It may welcome by researchers and clinicians in the filed of neurodevelopment.

      Reply: We thank the reviewer for their positive and supportive comments.

      __Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      __Summary: Oleari et al used murine GnRH1, and immortalized GnRH cell lines (GT1-7, Gn11) to define genes of interest in GnRH development and used this list to filter exome sequencing data from patients with some evidence for GnRH Deficiency.

      Title: I am concerned that the title of the paper overstates the results and conclusions.

      Intro: use of "candidate causative genes" overstates the evidence presented.

      __Reply: __We thank the reviewer for their comment and have revised the title to reflect the findings of the study. We have also edited the sentence in the abstract reporting "candidate causative genes" as follows: “Here, we combined bioinformatic analyses of primary embryonic and immortalized GnRH neuron transcriptomes with exome sequencing from GD patients to identify candidate genes implicated in GD pathogenesis”, lines 40-43.

      Results: The transcriptomic profile of the developing human GnRH neuron has been published via in vitro differentiation protocols twice (Lund et al 2020, and Keen et al 2021). Gene set data is publicly available. This should be explicitly compared in results not relegated to discussion -- two or three examples it not enough to say mouse can be used instead of human.

      __Reply: __We thank the reviewer for this comment. We apologize if our sentence in the Discussion was misleading, as we did not intend to make a conclusion on the similarities of the two datasets/cell types, neither to suggest the use of rodent instead of human.

      Although we are aware that differences among species might exist, mouse/rodent models including immortalized cells have been instrumental to understand the molecular mechanisms of GnRH neuron development and to predict candidate genes. Indeed, our aim was to demonstrate that transcriptomic profiles of rodent GnRH neurons could be integrated with exome sequencing data from human patients to reveal novel candidate genes.

      Therefore, the aim of our study was different to that of the Lund and Keen publications. Further, caution should be exercised in any deeper comparative analyses with our transcriptomes, for following reasons: first, the GnRH neurons generated from human iPSC and cultured for 20 and 27 days cannot be objectively defined for their ‘age’ in order to be then compared to immortalized or primary embryonic GnRH neurons; second, in these datasets a different and more extensive transcriptomic technique has been used (RNAseq vs microarrays).

      There was no intention to relegate to the discussion the possible similarities with other transcriptomic datasets, but we felt that these comparative analyses were beyond the scope of our work.

      However, following the Reviewer’s suggestion, we have tried to make comparative analyses with the publicly available datasets from Lund et al 2020 and Keen et al 2021, and with a paper just published (Wang et al 2022), as follows.

      In Lund et al. paper, GnRH-like neurons were obtained from human iPSCs by dual SMAD inhibition and FGF8 treatment. We selected data obtained from cells treated with FGF8 and cultured for 20 days and 27 days for comparison with our early and late genes, respectively.

      Because the authors of this paper did not publish the full list of differentially expressed genes (DEGs) from this specific comparison (20 vs 27days) and we were not able to retrieve it upon request, we used the normalized counts of these samples (available at ArrayExpress repository) to compare the two experimental groups with DESeq (Bioconductor release 3.15). To increase stringency of our analysis, we considered as differentially expressed those genes which displayed both an adjusted p-value of less than 0.05 and an absolute fold change of >2. The number of DEGs obtained was different and greater (5981) than from the published data, and this large number of genes may, by chance alone, contain a large fraction of any gene dataset (including the genes that we found with our analysis). For this reason, this particular comparison in this dataset cannot be informative or useful.

      Next, we considered the dataset from Keen et al. In this paper, the authors have tested different differentiation protocols to obtain GnRH-like neurons from human wild-type or mCherry embryonic stem cells (hESC). They transcriptomically profiled hESC-mCherry-derived GnRH neurons at 8,15 and 25 days of culture.

      Again, although we cannot precisely define the matching embryonic stage of cells cultured for 8, 15 or 25 days, we compared the lists of DEGs from immortalized GnRH neurons (GN11vsGT1-7) with the transcriptomic profiles of mCh-hESC at day 15 vs day 8 and mCh-hESC at day 25 vs day 15, respectively. We considered as differentially expressed the genes that displayed both an adjusted p-value of less than 0.05 and logFC>2. We found that the majority of the genes that were differentially expressed in one dataset were not in the other. However, the few genes that were differentially expressed in both datasets demonstrated a good correlation, i.e. the same expression trend. Although this latter approach was more fruitful, by suggesting a partial similarity between primary GFP-GnRH neurons and hESCs-derived GnRH neurons at day 25 vs day 15 time-point, we do not feel that we could draw significant and reliable conclusions.

      Further, if we compare these two datasets obtained by RNAseq from hiPSC and hESC, even by taking into account the large amount of DEGs found in our re-analysis of Lund et al., 2021 raw data, a relatively small number of common DEGs were found. These data also suggest that there is transcriptomic heterogeneity even among human-derived GnRH neurons.

      In addition to these two datasets, while our manuscript was under revision, a new paper was published, in which the authors dissected iPSC-derived GnRH neuron transcriptome with RNA-seq at single cell level (Wang et al., 2022, doi:10.1093/stmcls/sxac069). Again, although the same concerns may apply in comparing this dataset with ours and raw data of DEGs were not publicly available in this case, we compared the expression trends of our 29 candidates with gene expression trajectories identified in this work. As a result, 24/29 candidate genes, including NLGN3, were found to have an expression trend consistent with our dataset. The few remaining genes exhibited an opposite trend (2/29) or were not found in available data from this work (3/29). As this is a purely qualitative analysis, we do not feel it would be appropriate to include it in the Results section, but have included commentary on these comparative dataset analyses in the Discussion section (lines 247-257). A future study could be designed to mine the raw data from all the available transcriptomic profiles of developing GnRH neurons, but this is beyond the scope of our current manuscript.

      The authors need to comment on other GnRH1 expression in the brain of developing rodent and if they think the GnRH1 sorted neurons are just "GnRH Neurons" associated with reproduction (Parhar et al 2005) due to microdissection.

      __Reply: __We thanks the reviewer for raising this point of clarification. We have carefully selected by microdissection nasal areas from E14, nasal and basal forebrain areas from E17 and basal forebrain from E20 rat embryos (see revised Methods, lines 325-327). We are therefore confident that what we have obtained is RNA from ‘reproductive’ GnRH neurons only.

      Questions about Cases/Missing Phenotypic Information: 1) Case 1: the patient underwent increased testicular volume on testosterone therapy -- testosterone therapy does not increase testicular volume. Has this patient undergone or been assessed for reversal of his hypogonadism?

      __Reply: __We thank the reviewer for their comment. The patient had minimal testicular development on testosterone (from 10ml to 12ml) but did not increase testes volume beyond 12mls, consistent with a partial HH phenotype. He has had two trial periods of 3-4 months off testosterone treatment and during these periods had both low serum testosterone concentrations and symptoms of hypogonadism (tiredness, low energy and reduced muscle strength).

      2) Case 2: Is too young to be classified as having a pubertal defect. Microphallus is mentioned but what size, was this diagnosed at birth and treated? I think the case for GD is overstated in the results and discussion (especially with the discussion of small testes).

      Reply: We thank the reviewer for requesting these clarifications. The patient has not received any treatment for his microphallus (2.5 cm length in mid-childhood). We agree that this case is too young to be classified as having a pubertal defect, but the presence of microphallus and small testes volume in infancy and early childhood, in association with low gonadotrophins and absent erections, are well recognized as red flag signs for hypogonadotropic hypogonadism (Swee & Quinton, 2019, doi:10.3389/fendo.2019.00097). We added this information to the Results section, lines 175-177.

      Genetic Information: Since this was a candidate gene search -- what other candidate genes were uncovered in these probands?

      Reply: The revised list of 29 candidate genes were screened in the two probands from our study using the whole exome sequencing datasets for these individuals, and only the variants of interest in NLGN3 described in the manuscript were found.

      By searching for mutations of the revised list of candidate genes in our GD cohort, we identified nonsense variants only in NLGN3 and no splice variants. We also found few rare and predicted damaging missense variants in this gene list identified. Indeed, two rare (MAF 25) missense variants were identified in the genes PLXNC1 and CLSTN2 in two further probands (now summarized in Supplemental table 4). We have not identified further probands with PLXNC1 or CLSTN2 variants of interest from additional cohorts and thus at present we have not yet taken these gene variants further for molecular characterization, but we will examine the relevance of this gene variant in future work.

      Do the probands have a clear explanation for their developmental disability other than the gene noted?

      __Reply: __We thank the reviewer for raising this point. Proband exomes were also screened for genes related to developmental delay and no other causal gene variant were identified. We added this information in the text, lines 183-185.

      I would encourage the authors to update Table 3: they are missing IHH/KS genes such as GLI3, SEMA7A, SOX2, STUB1, TCF12. I suggest they update the Table and analyses.

      Reply: we thank the reviewer for highlighting this point. Since we performed a new analysis, we also performed a new candidate gene prioritization using a more up-to-date gene list to instruct ToppGene (please see revised Supplemental table 3).

      CROSS-CONSULTATION COMMENTS Dear Reviewer #2, I am concerned that the paper presents only a single case of GD to support the scientific work. What do you think?

      __Reply: __We would like to highlight that, as we describe above, GD can be diagnosed prior to pubertal age in individuals with red flag phenotypic signs and biochemical evidence of hypogonadism.

      Dear Reviewer #1: In addition to the weakness in the microarray data, what do you think about the authors using publicly available data from human GnRH neuron transcriptomics for analysis?

      __Reply: __please see the above discussion on the comparison with publicly available datasets.

      Reviewer #3 (Significance (Required)):

      There is not high significance to this paper: This is not the first article with GnRH transcriptomes. I would argue the human data is more relevant. Developmental disability has been previously linked the GnRH deficiency (as even cited in this paper) The article presents one case of GnRH deficiency, and one pre-pubertal case -- providing some modest evidence for a candidate gene, NLGN3.

      __Reply: __We would like to rebuff this assessment of the paper’s significance. To our knowledge, this is the first report of transcriptomes from primary GnRH neurons isolated at key embryonic developmental time points. Other published reports refer to iPSC-derived or adult GnRH neurons (Keen et al., 2021; Lund et al., 2020; Wang et al., 2022; Vastagh et al., 2016 and 2020).

      Similarly, the association of central hypogonadism with developmental disabilities have been reported in registry-based studies, but few causative genes have been identified, nor patient variants functionally validated in order to investigate the molecular biology underpinning this association. In the Discussion, in the light of a recent paper (Manfredi-Lozano et al., 2022, doi: 10.1126/science.abq4515), we also postulate that NLGN3 might be required for neuritogenesis of extra-hypothalamic projections of GnRH neurons thus contributing to the pathogenesis of NDD (lines 294-300).

      Regarding to human data, we would like to acknowledge that we had a third case that we were not able to publish due to family consent. NLGN3 deficiency is likely to be a rare disorder, but that should not obviate the impact of investigating the molecular etiology – indeed, many insights into human biology have come from private mutations in rare disease.

    1. authority and the expertise to make weather predictions in the first place and it's a story about how to transform knowledge of nature into market knowledge and thus profit and we'll see some of these similar

      I think this whole thing is interesting to me, it almost reminds me today of todays political climate. I feel like there are times in which the different news sources all argue and disagree in similarly childish ways, and where people will manipulate the information that is disseminated to the public for their own personal gain. This is an interesting social facet to the interaction/connection of nature and commerce; which I find interesting because commerce is a largely social construct and innovation in many ways is as well. Social facets may be more important than we think in these analyses; it almost reminds me of Solnits designation of the ghost dance as technology and the amount of social/group emotional factors that unexpectedly need to be considered in that thought.

  8. Oct 2022
    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      RC-2022-01632

      Answers to referees

      First of all, we wish to thank the 3 referees for their careful evaluation of the manuscript. We see many issues that they have raised as legitimate and have tried to provide experimental or editorial answers. In contrast, some issues are presently addressed in the context of a future manuscript and we had rather not introduce these studies in the revised version.

      Below, one will find the answers and the description of the revisions already introduced in the revised manuscript (questions are recalled in blue italics).

      New and modified figures, plus not shown figures and tables are indicated in the text below but could not be pasted in the document and can be found in the Revision plan.

      Referee # 1

      Evidence, reproducibility and clarity

      They then delivered 86/8 and LSBio anti-En1 antibodies, that catch En1 in the cleft and prevent it from being captured by MNs.

      Perhaps we were not clear. We did not deliver the antibodies 86/8 and LSBio, we used them for western blots and immunohistochemistry (IHC) to identify EN1 and localize it. We delivered the third antibody, a single-chain anti EN1 antibody (scFvEN1), that captures extracellular EN1 and prevents it from being captured by MNs on the basis of the LSBio staining (Figure 4A-C).

      Finally, heterozygotes revealed also a degeneration in dopaminergic neurons within midbrain similar to the one observed in spinal MNs, along with an upregulation of SQTSM1/p62 gene/protein, a factor in MN ageing linked to the classical genes implicated in familial forms of ALS (SOD1, TDP-43, FUS, and C9ORF72).

      This is a fair comment/work description, that does not require answers.

      Significance

      *Major comments: *

      It is unclear why levels of intensity for RNAscope were not quantified, and qPCR was preferred for quantifications in Figure 1b. RNAscope is a technique that allows for spatial distribution analysis of the markers and their level of the expression. This data can be easily quantified utilizing the QuPath software which is open access. Same concerns apply to Figure 2a.

      Quantitative RT-PCR provides a quantitative measure of gene expression. Since only V1 interneurons (including, Renshaw cells) express EN1, we infer the spatial distribution, although not expression level cell by cell. Figure 2A is an actual counting at 4.5 months of En1+ cells and of Calbindin+ cells (Renshaw cells), both identified by RNAscope. Thus, it is clear that the number of En1-expressing cells (V1 interneurons) is not modified at 4.5 months when muscle weakness and death of aMNs are well advanced (around 70% of the aMNs that will eventually die, are already gone). Long-term survival of V1 interneurons is further demonstrated in Figure 2D (left panel) until 15.5 months, (see also below) whereas total En1expression is reduced by half. Quantification neuron by neuron of the amount of En1 transcribed (RNAscope) would indicate the variation, among interneurons, of En1 transcription in WT and mutant mice. This is interesting per se but would not modify the main information that these neurons do not die in the heterozygote and that En1 transcription does not decrease with time in both WT and mutant genotypes (at least until 15.5 months).

      *Antibodies should be validated utilizing a reporter mouse. En1cre mice are commercially available and can be crossed with reporters (TdTomato or YFP mice). Utilizing this tissue En1 antibodies can be easily validated. The EN1 antibody shown in Figure 1c seems unspecific, staining several neuronal populations in the spinal cord. *

      Indeed, antibody validation is extremely important. LSBio is commercial (CliniSciences), 86/8 was developed in the laboratory and fully characterized and used in previous studies (e.g. Alvarez-Fischer et al. Nature Neurosci. 14: 1260-1266, 2011; Rekaik et al. Cell Reports 13: 242-250, 2015; Blaudin de Thé et al. EMBO J. 37: e97374, 2018), scFv against EN1 was prepared from the 4G11 hybridoma (Developmental Hybridoma Bank, Iowa City, USA) and validated in previous studies (e.g. Wizenmann et al. Neuron 64: 355-366, 2009). In the present study, the two polyclonal were further validated inseveral ways.

      In the WBs we compared ventral midbrain (VMB) and spinal cord (SC) tissues and found similar patterns. Strong evidence for antibody specificity is immunostaining extinction with the antigen and with absence of first antibody, which we carried out.

      We have now used LSBio and 86/8 to perform a WB on spinal cord (SC) and ventral midbrain (VMB) extracts with or without the first antibody and we find that the absence of first antibody fully eliminates band staining. The western has been introduced in the revised manuscript in place of the cross immunoprecipitation.

      Finally, we have quantified EN1 in the aMNs of the heterozygote at 3 months (before cell death), showing that EN1 content is decreased by approximately 2-fold (LSBio antibody) in both a and gMNs with no change in neuron number. This result demonstrating that EN1 is diluted by approximately twofold (concentration per neuron when all neurons are still present), in addition to further validating the antibody, is itself interesting and has been introduced in the revised manuscript as Supp. Fig. 1A.

      Regarding the staining in other neuronal populations, there is always some background, in particular in the tissue treatment conditions used for RNAscope. Furthermore, given the large number and wide distribution of V1 interneurons (Fig. 1A), we cannot preclude that EN1 is present at a low concentration in the extracellular space and in several cell types (discussed in Fig. 9 of the manuscript). This does not weaken the main conclusion that it primarily accumulates in MNs which do not express En1 (RNAscope).

      *Investigations of En1 expression in motor neurons from already available omics data sets would support the idea that En1 is expressed in motor neurons. *

      The En1 locus is silent in MNs. Microdissection of MNs and proteomic analysis would not be definitive since the interneurons that produce EN1 are in close vicinity of the MNs and since some protein is necessarily present in the extracellular space (where it is trapped by scFvEN1), making contamination unavoidable.

      Differentiation between Gamma and Alpha motor neurons should be performed using specific markers as Err3, Wnt7a or NeuN.

      This is a possible way to do the distinction, but size criterion in Cresyl violet is supported in the literature (Wu et al. Journal of Biological Chemistry, 287: 27335-27344, 2012; Dutta et al. Experimental Neurology, 309: 193-204, 2018). In our study, it is further validated by the demonstration that, in 9-month-old animals, the results obtained (cell number and specific death of large neurons >300µm2, but not of intermediate size ones 200-299µm2) are replicated by counting ChAT-stained neuron (Figure 2C). It is of particular interest that the number of medium size neurons (also ChAT-positive medium size MNs) does not increase when the number of large size (Cresyl and ChAT-positive) neurons decreases, thus precluding a “shrinkage effect”. Most importantly, the size criterion (Cresyl violet) allows us not to be mistaken by a possible down-regulation of markers in the mutant, independently of cell survival. We provide for the reviewer (Revision plan) but not for publication, the evolution with time of the number of neurons based on size (above 200 µm2) showing clearly that at 15.5 months the large population (>300 µm2) is decreased in the En1-Het, with very little change for neurons between 200 and 300 µm2, and certainly not an increase which would be expected if shrinkage occurred.

      We were indeed surprised by this finding and a plausible explanation is that a lower metabolic activity makes interneurons less sensitive to stress than aMNs which have to “fuel” long axons and high firing rates (not the case for gMNs). We propose this explanation in the discussion and make it clearer in our revised version. We agree that it is speculative and that the point raised by the reviewer is very interesting. We hope to address this in the future and have discussed this point.

      Since the cells do not die, we did not look for signs of apoptosis.

      We analyze lumbar sections from L1 to L5 as now indicated in the methods section in the manuscript

      The set of experiments reported in Figure 4 is of difficult interpretation without showing the actual presence of extracellular En1, that could be assessed with protein detection or RNAscope.

      This is another interesting suggestion, but we think that it will be difficult to distinguish low extracellular staining due to EN1 diffusion from some unspecific background. Since the scFvEN1 is secreted by astrocytes, it necessarily neutralizes extracellular EN1, resulting in a decrease in the MN content of the protein. This is an experiment with high specificity since the same scFv harboring a Cysteine to Serine point mutation that prevents EN1 recognition (no disulfide bound formation between the light and heavy chains) does not block EN1 capture by MNs (Fig. 4C for IHC and quantifications).

      As for extracellular EN1 mRNA identified by RNAscope, we hesitate to embark on the idea as mRNAs are likely secreted in insufficient amounts to be identified, even by RNAscope. The results that we have (no En1 visible by RNAscope in MNs, loss of EN1 in MNs following extracellular scFvEN1 activity, and preferential addressing of injected EN1 to MNs) demonstrate EN1 capture by MNs. Indeed, we cannot completely preclude the transfer of tiny amounts (escaping RNAscope detection in MNs) of En1 mRNA (for example, through extracellular vesicles), but we plead for not considering this hypothesis in the present paper. However, if the reviewer wishes, the possibility can be introduced in the discussion.

      Referee 2

      Evidence, reproducibility and clarity

      In general, most of the experiments shown in this study are well done and convincing. However, the data on p62 upregulation appear correlative and do not allow any conclusions about the mechanism and function how EN-1 modulates motoneuron survival and function. In addition, this study is not very precise on the mechanisms how motoneurons degenerate in this model so that there are only limited insights into the way how EN-1 acts on motoneurons in a physiological manner and under pathophysiological conditions.

      This criticism is justified, at least in part, as we agree that p62 upregulation is correlative. However, the fact that the neutralization of extracellular EN1 by the scFv increases p62 expression, is in favor of a causative link. The increase is also seen at 3 months in the En1-Het when all aMNs are still present but not after, which is interesting because, due to aMNs death, surviving MNs receive more EN1, information provided below and now introduced and discussed in the revised manuscript (Supp. Fig. 1B).

      As for p62, and as also mentioned by referee 3, Fig. 8 is very hard to follow and we propose to simplify it to make the message clearer:

      We have revised Fig. 8C, D in which we focus exclusively on SQTSM1/p62 mean expression (see revision plan)

      A second information is that a difference in mean p62 expression between WT and Het is seen only at 3 months in aMNs. For aMNs, we propose that this is due to the fact that they are very sensitive to EN1 dosage (in contrast with gMNs which do not die in the En1-Het). At 3 months, aMNs have only half of their normal EN1 content. Later, at 4.5 months 75% of the aMNs bound to die are already dead (Fig. 2D) and the remaining neurons receive more EN1 (even more so at 9 months), as could be measured (see above Supp. Fig. 1B). We thus can propose an accelerated aging of aMNs at 3 months due to both EN1 decrease and high metabolic activity (higher than in gMNs).

      In the case of the scFv, scFvEN1, but not the mutated version induces enhanced mean p62 expression in the 80% surviving aMNs and in gMNs at 7 months (low aMN death in this model, see Fig. 4F). As can be seen also in a newly added figure (Supp. Fig. 2) that has been introduced in the revised manuscript and is shown below, 7-month-old scFv animals and 3- to 3.5-month-old En1-Het have similar phenotypes. This mild scFv phenotype (a-MN death and muscle strength loss) in 7-month-old mice in spite of a huge loss in the EN1 content of MNs (Fig. 4C) suggests that the En1-Het phenotype is not entirely due to the decrease in EN1 transport from V1 interneurons to MNs (see discussion and Fig. 9).

      It remains true that we have voluntarily decided not to examine in depth the molecular mechanisms allowing EN1 to exert its protective activity, a decision that we would like to defend and maintain.

      A first reason is that in previous papers on mesencephalic dopaminergic (mDA) neurons (Alvarez-Fischer et al. Nature Neurosci. 14: 1260-1266, 2011; Rekaik et al. Cell Reports 13: 242-250, 2015; Blaudin de Thé et al. EMBO J. 37: e97374, 2018), we evaluated several mechanisms involved in EN1 neurotrophic activity and we did not want this study to be a duplication of studies done on a different neuronal population, even if mechanisms might differ in part, between aMNs and mDA neurons. What has interested us more is that, in the two cases, age is an important factor in the unveiling of the degeneration phenotype (mDA neurons start dying at 1.5 months and aMNs at 3 months). It is because of this similarity that we performed the bioinformatic study that has led us to SQTSM1/p62. In this context, it is of interest that mean SQTSM1/p62 expression (variability of expression between neurons is not discussed in the revised version) increases with age in the wild type, thus can be seen as an age marker. It allows us to propose that EN1 extracellular neutralization and the loss of one En1 allele, that increases mean SQTSM1/p62 expression accelerate aging.

      A second reason is that the study is oriented toward a possible use of EN1 as a therapeutic protein. This orientation also has to do with the focus on SQTSM1/p62. Indeed, there are probably many pathways downstream of EN1, but in the bioinformatic analysis of genes differentially regulated in WT and En1-Het mDA neurons and also expressed in MNs, SQTSM1/p62 is the only one that interacts with the 4 genes mutated in the major ALS familial forms. In addition, SQTSM1/p62 mutations have been observed in ALS patients (References 41 to 45 in the manuscript).

      Finally, the most important point is that the main message of this paper is the discovery of a non-cell autonomous EN1 activity in the spinal cord and of its ability to travel between V1 interneurons and MNs. This specificity best explained by a targeting signal that we have identified is at the basis of the specific addressing to MNs of EN1 intrathecally injected, which also has implications for its potential therapeutic use.

      Specific points of criticism

        • In Fig. 2a, the authors show that EN-1-positive interneurons are not reduced at 4.5 months in the spinal cord. No data are shown for later time points such as 9 months, the corresponding stage when motoneuron loss is observed, or at 16 months which corresponds to the data shown in Fig.1. The argument that there is no reduction of V1 interneurons between 4.5 months and 16 months because there is no decrease of EN-1 expression between 4.5 and 16 months, as shown in Fig. 1b is not convincing. EN-1 expression could change in individual cells, thus compensating for the loss. Data on numbers of EN-1-positive cells at 9 and 16 months should be included, and a potential autocrine effect of EN-1 on V1 interneurons, as observed in midbrain dopaminergic neurons, characterized in more detail. * Fig. 2A illustrates the absence of interneuron loss at 4.5 months, but this set of data is completed by those of Fig. 2D that demonstrate the maintenance of V1 interneuron number until 15.5 months, at least. It can be noted that, in contrast with interneurons, aMNs at 4.5 months have experienced massive cell death (70% approx. of total aMN death at 15.5 months). As a whole, data of Fig. 2 demonstrate that the number of small neurons (100-199 µm2) and intermediate size neurons (200-299 µm2) does not change with age, at least through 15.5 months. This is in strong contrast with large aMNs (>300 µm2). As already explained in our answers to referee 1, size is an excellent marker for the identification of neuronal subtypes and the analysis of survival (See answers to referee 1, justifying the use of neuron size).
      1. In Fig. 2e, the authors present data on loss of muscle strength between 4.5 and 15.5 months. They conclude that this reflects gradual neuromuscular strength loss. Since neuromuscular endplates have a very high safety factor, they can maintain full function even if transmitter release is reduced by more than 80%. Therefore, the loss of muscle strength seems to reflect the progressive loss of presynaptic terminals at neuromuscular endplates, rather than a gradual loss of neuromuscular strength. *

      We apologize for the semantic confusion. What is measured is a progressive loss of muscle strength due to the progressive loss of presynaptic terminals and not a gradual loss of neuromuscular strength. This is now modified throughout the revised text.

      • More detailed data on NMJ morphology should be included. How does EN-1 modulate neuromuscular endplates? Is EN-1 located at neuromuscular endplates after being taken up from motoneurons? Even if the mechanism is indirect, via upregulation of p62 under conditions when EN-1 signaling is reduced, does this situation lead to enhanced localization of p62 at neuromuscular endplates? *

      We do not see expression of En1 mRNA or the presence of EN1 protein at the level of the endplate (Supp. Fig. 3 in revision plan)

      • The data shown in Fig. 3 on changes in NJM morphology appear incomplete and not convincing. As SV2a is not a good marker for changes in presynaptic compartments since it does not allow conclusions on how many synaptic vesicles are released, additional markers for presynaptic active zones such as Bassoon, Piccolo, Munc-13 should be studied. The analysis of fully occupied endplates appears arbitrary, and the differences are relatively small. Additional EM pictures and quantitative analyses of active zone proteins in the presynaptic compartment would help to support the argument of the authors that presynaptic compartments degenerate before cell bodies are lost in EN-1 +/- mice. *

      SV2a and NF staining (it is not only SV2a) at the level of endplates identified by a-Bungarotoxin labeling has been used in a large number of studies (Wahlin et al. J. Comp. Neurol. 506: 822-837, 2008; Hasting et al. Scientific Reports 10: 1-13, 2020; Yahata et al. J. Neurosci. 29: 6276-6284, 2009 ; Jones et al. Cell Reports 21: 2348-2356, 2017) Our goal was not to document the loss of synaptic activity through the use of the three suggested markers, Bassoon, Piccolo and Munc-13. Doing it would force us to initiate experiments taking several months to prepare the material and do a quantitative analysis in the models of EN1 loss of function (En1-Het) and neutralization (scFv), plus rescue by EN1. Nor do we wish to initiate a novel collaboration to produce a quantitative ultrastructural study. We see the latter morpho-functional studies beyond the scope of the manuscript and wish to be given the possibility to present them in a separate study (see below in “Description of the experiments that the authors prefer not to carry out”).

      The distinction between fully occupied, partially occupied and denervated endplates is not arbitrary and we apologize for not having sufficiently described the methodology. As illustrated in modified Fig. 3 and explained in Material and Methods, a fully innervated endplate is defined as an endplate in which 80% or more of the green pixels (a-BGT) are covered by a red pixel (SV2a), a partially one is between 20 and 80% and a denervated one below 20% coverage. Thus at 9 months and later ages, close to 30% of the endplates are either partially innervated or denervated. In fact, it is more likely that they are partially innervated since the number of AChR clusters does not change (totally denervated clusters normally dissolve). The 80% threshold for fully innervated was selected to give a margin of security, and it is likely that the percentage of 25 to 30% of partially innervated endplates is an underestimation.

      In the Revision plan is presented a table with the calculations and modified Figure 3.

      We agree that we were not clear enough in our description and that it may have given the impression that the differences were relatively small. We think that retrograde degeneration is strongly supported by a loss of muscle strength that parallels the decrease in fully occupied endplates (a-BGT, NF, SV2a) and precedes aMN loss by more than 1 month. We have recently contacted an electrophysiology group to establish a collaboration that will allow us to follow functional changes at the level of the spinal cord and of the neuromuscular junction and we see the experiments proposed by the reviewer as complementary to these physiological approaches. Yet, we do not want to ignore the opinion of the reviewer and mention it in the conclusion, on the basis of his/her comment.

      • The authors present evidence for a glycosaminoglycan (GAG) binding domain that appears responsible for uptake of EN-1 into motoneurons. However, it is unclear into which cellular compartment EN-1 is taken up after GAG binding on motoneurons. The authors propose this could be an alternative pathway to conventional endosomal uptake. How can the EN-1 that is taken up into cells exert transcriptional effects in motoneurons? As a minimum, more data on the subcellular distribution of endocytosed EN-1 should be included to support current hypotheses and to close the gap from cellular uptake to transcriptional regulation. *

      The question is justified since we did not recall until page 12 of the Discussion that EN1 is, as most tested homeoprotein transcription factors, captured by a mechanism distinct from endocytosis. While not yet fully understood, the process involves the formation of inverted micelles that allow for direct targeting to the cytoplasm and from there to the nucleus thanks to the NLS. We now mention in the introduction that EN1 transfer and HP transfer is based on unconventional secretion and internalization processes.

      • The differences in p62 expression with age in WT and EN-1 +/- mice as shown in Fig. 8c are not convincing. First, the p = 0.0499 and p = 0.0536 values for differences at 3-4 months of age appear borderline, and it is unclear what the dispersion analysis that is shown really means. Moreover, the question remains how a potential dysregulation of p62 then affects NMJ morphology and function. Is this change in p62 also detectable in presynaptic compartments? *

      We agree that p values in the range of 0.05 are not extremely high and this is due to the heterogeneity in SQTSM1/p62 expression, that reflects that of MN populations, and induces a high variance. We also agree that this figure is too complicated and a simplified version has been proposed above (see answers to reviewer 1). To summarize, Fig. 8C shows that in WT animals, with no aMN death (grey) the level of SQTSM1/p62 expression in aMNs and gMNs increases between 3 and 4.5 months and between 4.5 months and 9 months, with significances varying between pThe new Fig. 8 panel D (please see above, answers to referee 1) now includes the results obtained with the scFvs. A phenotype comparison between the two models (En1-Het and scFvEN1) has been introduced in Supp. Fig. 2 (see above).

      We have no evidence that EN1 modulates the SQTSM1/p62 promoter directly. The identification of this gene as a target (not necessarily a direct target) of EN1 comes from the bioinformatic analysis described in the manuscript and we were intrigued by the interaction with the 4 main familial ALS mutations and the existence of families with SQTSM1/p62 mutations. This is what led us to analyze its expression in our two models of EN1 loss of function. Although the En1-Het mouse is not an ALS model, the results support the idea that EN1 could be used as a therapeutic protein in several familial and even sporadic forms of the disease. The latter hypothesis is now being tested on MNs derived from iPSCs (sporadic patients, fALS and isogenic variants, and healthy controls). If the data lend weight to our hypothesis, as collaborative and in-house preliminary data suggest, then a complete analysis of EN1 targets in human MNs will be undertaken. Again, we really think that this is out of the scope of this study.

      For Fig. 8, we fully agree that it can give headaches and we apologize. Moreover, it induces wrong interpretations (mean intensity increases with age and dispersion between 4.5 and 9 months has a calculated p__Referee #3__

      Evidence, reproducibility and clarity

      Nevertheless, the connection between EN-1 and p62 is not well developed by the data presented and future readers may be left with many questions regarding how EN-1 and p62 are related (e.g. direct interaction? transcriptional regulation?), whether p62 is indeed the mediator of EN-1 trophic effects, or the significance of the increased levels of p62 for motoneuron disease

      The reviewer is right and we have tried to better explain and to simplify. Please see responses to referees 1 and 2.

      *Figure 1C: There appears to be EN1 immunoreactivity (green) in several areas of the spinal cord, including dorsal regions. Can the authors clarify what that labeling could be representing? *

      Unfortunately, there is always some background staining, in particular in the tissue treatment conditions appropriate for RNAscope. Furthermore, given the large number and wide distribution of V1 interneurons (Fig. 1A), we cannot preclude that EN1 is present at a low concentration in the extracellular space and in several cell types (now represented in Fig. 9). This does not weaken the main conclusion that it primarily accumulates in MNs which do not express En1 (RNAscope).

      *Figure 1D: These immunoprecipitation results lack a negative control with irrelevant antibody to confirm that the band shown it's being recognized specifically by the antibodies reacting with the blot. *

      Please see the response to reviewer 1 above with the Western blot and the absence of staining on a WB in absence of first antibody (86/8 or LSBio).

      F*igure 1E: The intensity of the EN1 labeling in MNs, much stronger than in V1 interneurons, is intriguing, given that MNs do not express engrailed-1 mRNA. One would have expected the opposite. It may help here if it was possible to show that immunoreactivity in MNs is diminished in the het mutant mouse. *

      We also were surprised by this intensity higher in MNs than in V1 interneurons, as if the protein was exported rapidly towards the target neurons. We have done the experiment proposed by the referee, found a twofold (approx.) immunoreactivity reduction in En1-Het MNs (see above Supp. Fig. 2A in answers to referee 2). This supplemental figure has been introduced in the revised version. The experiment was done at 3 months when no MN death has yet occurred. Later the neurons “replenish” with EN1, probably because they do not have to share the limited supply with the dead ones (see above answers to referee 2 and Supp. Fig. 2B).

      *Figure 2D: There are a few possible problems with these data and their interpretation. First, this reviewer feels that 5 neurons (y-axis) is a rather small number. Are these 5 neurons per what area? From how many mice? I did not find that information in the figure legend. A larger area should be quantified so that we get numbers that are more robust. Second, such differences could also be due to hypotrophy of the MNs, namely, that MN number is the same but they are smaller. *

      The differences cannot be attributed to hypotrophy. A first reason is that, at 9 months, the Cresyl violet and ChAT staining give the same results for medium size and large neurons (Fig. 2C). Furthermore, when one counts the cells throughout 15.5 months, the decrease in the number of large neurons is not compensated by an increase in the number of medium size or small ones. The reasoning and a graph, not intended for publication can be found in answers to referee 1.

      *Figure 3A: It would be useful that the authors explain how these AChR clusters were defined, visualized and counted. I could not find this information in the Methods. Perhaps this could be done by showing an alpha-BTX image illustrating the clusters. *

      We fully agree that the procedure was not well explained and we have introduced a correction in the Material and Methods section. For more details, please see answers to referee 2.

      *Figure 3B: As each adult endplate is only innervated by one MN, one would have expected fewer clusters and/or endplates, if indeed MNs are missing in this mouse, rather than endplates that are partially occupied. This could be clarified a bit more explicitly. *

      This is true and the ambiguity takes its origin in insufficient explanation of how fully innervated, partially innervated and denervated endplates were defined. Please see above and also in answers to reviewer 2. Modifications have been introduced in the text and in Fig. 3. The referee is right, the absence of change in the number of AChR clusters suggests that there are very few fully denervated endplates and that what is defined as such in the analysis corresponds to partially innervated endplates (see above). This is now discussed in the text.

      Figure 6B: Would not be better to do this with a virus, like in the case of the antibody? A more robust effect on MN survival may be attainable and thus strengthen the concept.

      This would be another interesting experiment and we are presently exploring this possibility (with preliminary results). The choice of the virus and of the promoters is very important. We are comparing several AAVs, including AAV2, AAV2-TT (which diffuses better) and AAV8. For the promoter, we do not want to express within MNs as the imported protein might have special properties, associated with import. V1 interneurons would be best, but we have to verify if this does not modify V1 physiology. Astrocyte is another option, but with a similar pitfall. This means that we have a long way to go before proposing a “gene therapy” approach.

      In addition, in the context of future clinical studies, we were eager, on the basis of the long-lasting activity of the protein already observed in the mesencephalic dopaminergic neurons (Alvarez-Fischer et al. Nature Neurosci. 14: 1260-1266, 2011; Rekaik et al. Cell Reports 13: 242-250, 2015; Blaudin de Thé et al. EMBO J. 37: e97374, 2018), to try a protein therapy in the spinal cord. Interestingly, the effects are also long-lasting in the spinal cord, (12 weeks in the mouse before a second injection is needed) and, according to contacted physicians, intrathecal injections, every second month or even more frequently, could be envisaged in the human. In that case, protein injection is possibly advantageous for the following reasons:

      (i) viral particles can travel far and we do not know what would be the side effects.

      (ii) the protein is short-lived but specifically addressed to MNs (thanks to the presence of EN1 binding sites at their surface), thus minimizing the issues associated with permanent expression and side effects.

      (iii) EN1 is a natural protein normally secreted and the immune system might not be solicited as much as with viral approaches.

      *Figure 7A: The protein seems to be mainly in the cytoplasm of those cells (nuclei are dark and unlabeled), which is also unusual for a transcription factor that functions in the nucleus. Also surprising that the protein is gone in 3 days, but has effects over 24 weeks. Any explanation for that? *

      The protein is imported and is thus both in the cytoplasm where it exerts an effect on protein translation (Brunet et al. Nature 438: 94-98, 2005; Alvarez-Fischer et al. Nature Neurosci. 14: 1260-1266, 2011; Yoon et al. Cell 148: 752-764, 2012) and in the nucleus where it exerts its transcriptional and “epigenetic activity (see below for the latter). In fact, different antibodies and fixation procedures can favor cytoplasmic or nuclear staining. When nuclear, the dark point at the center, probably the nucleolus is less stained.

      Two images illustrating this point are shown in the revision plan.

      For the second part of the question, three days are sufficient for a long-lasting activity. This was also observed in the midbrain where the protein restores the epigenetic marks jeopardized by an acute oxidative stress (Rekaik et al. Cell Reports 13: 242-250). This has led to the hypothesis that EN1 has an important action at the level of the structure of the heterochromatin, thus a long-lasting “epigenetic” activity. We are presently working on the latter effects on the chromatin structure using human MNs derived from iPSCs (patients and control).

      *Figure 7B: It's not clear what the blue and red bars mean, as this is not explained in the legend. Also, the y-axis says "%Chat+" suggesting they are counting MNs, but in the text they talk about EN-1 capture. If the latter, the y-axes should indicate % EN-1 over Chat, or something like that. In general, better figure legends would improve the experience of the reader. *

      In this experiment, we wanted to test the presence of a GAG-binding domain in EN1. To test its potential role in EN1 internalization and localization, we co-injected or not the RK-EN1 with hEN1 protein. Then, we counted the percentage of MNs (%ChAT+) which contain, or not, the hEN1 protein (hEN1+ in red or hEN1- in blue), allowing us to verify if the RK-EN1 alters the internalization of the hEN1 protein. So yes, we are looking at the capture of EN1 by the MNs with or without the RK-peptide (or control peptides). We have modified the text to make the point clearer.

      *Statistical analyses: In principle, comparisons of data obtained in studies that involved two variable parameters (such as time and genotype/treatment) should be weighted by a 2-way ANOVA test, which is more stringent since more conditions are being tested simultaneously. Usually a t-test is reserved for a pairwise comparison in an experiment involving only two conditions of the same variable. *

      The reviewer is correct. The two-way ANOVA is explained in the Statistical analyses section of the Methods. The analyses were carried out and the results listed in the legends for Figs 2, 3, 4, 6 and Supp. Fig. 1.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      1. General Statements

      We would like to thank the editor for handling our manuscript entitled, “Mouse SAS‑6 is required for centriole formation in embryos and integrity in embryonic stem cells”, and the reviewers for the insightful comments and suggestions to improve our work. We aim for our manuscript to be considered for a “Short Report” format. As such, we would like to emphasize that we did not focus on the in vivo part of our study, where the Sas-6 mutant mouse embryos resemble our previously published Sas-4 mutants, as pointed out by the reviewers, because both mutants lack centrioles. In our opinion, the novelty of our work is evident in the discovery that mouse embryonic stem cells (mESCs) lacking SAS-6 are still able to form centrioles, albeit mostly abnormal, which is also shared by the reviewers. This is in contrast to Sas-4 mutant mESCs for example, which lack centrioles (Xiao*, Grzonka* et al, EMBO Reports 2021), and human cultured human cell lines without SAS-6, which have been shown to lose centrosomes. We are in the process of editing the manuscript and performing additional experiments per the reviewers’ recommendations. Below, we provide a point-by-point description of our revision plan.

      2. Description of the planned revisions

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      The article by M. Grzonka and H. Bazzi entitled: Mouse Sas-6 is required for centriole formation in embryos and integrity in embryonic stem cells, describes new findings in novel mouse models of Sas-6 knockouts (KO). This is an interesting study that reports two different mouse Sas6 KO models and the depletion of Sas6 from mouse embryonic stem cells (mESCs). This type of analysis has never been done before and so it reveals and describes a role for Sas-6 in centriole biogenesis in mouse.

      We thank the reviewer for highlighting the novelty of our work on the roles of SAS-6 in mice.

      • *

      The authors compare their analysis with Sas-4 KO and overall found similar results when compared to previous work from H. Bazzi, when Sas-4 was depleted in mouse embryos. Due to the mitotic stopwatch pathway, Sas6KO embryos die during development at extremely early stages and this can be rescued by depletion in p53 and other members of the pathway.

      Perhaps, not so surprisingly, these embryos do not contain centrioles, showing that in vivo, Sas6 is absolutely required for centriole duplication. More surprisingly, however, in cultures of mESCs, established and propagated in vitro, Sas-6 crispr induced KO, does not result in lack of centrioles. Instead, abnormal structures that show aberrant morphologies, length, and incapacity to assemble cilia were detected. In principle, this means that centrioles can be assembled independently of Sas-6, even if not in the correct manner.

      We again thank the reviewer for astutely pointing out the most surprising finding in our data, which is that mESCs lacking SAS-6 can still form centrioles.

      The authors interpret these differences as possible differences in the pathways involved in centriole assembly and propose different requirements in different cell types, within the same species.

      I have problems with this interpretation. To me is very difficult to understand, how the "protein" absolutely required for cartwheel assembly at the early stages of centriole biogenesis, can be essential and dispensable at the same time. Although, I may be wrong, I think the authors have not envisage other possibilities to interpret their data, which should be taken into consideration.

      We agree with the reviewer that SAS-6 is currently considered in the centrosome field as one of the “core” centriole formation or duplication factors and that it is a major component of the cartwheel scaffold during the early phase of centriole biogenesis. Although, the absence of centrioles in the Sas-6 mutant mouse embryos in vivo supports the essential function of SAS-6, and perhaps the cartwheel, in centriole formation; the mere presence of centrioles in mESCs indicates that SAS-6, and again the cartwheel, is not essential for the existence of centrioles in these cells. Because this is a major finding that we would like to bring across from our study, we will better highlight and clarify it in the new version of the manuscript as described below. In fact, in points #4 and #5, we share the same possible explanation for the difference in the phenotypes between Sas-6 mutant mouse embryos and mESCs as the reviewer.

      1) I do not know anything about ESC and ESC cultures. So maybe this is a stupid suggestion. But can't they be derived exactly from the same genetic background of SAS-6KO embryos? Because the way the two (or even 3 as there are 2 mouse KO lines) are generated is different. Why is that?

      The reviewer is correctly suggesting that the mESC can be derived from the Sas-6 mutant blastocysts. We have initially derived mESCs from the Sas-6em4/em4 mutants and performed our analyses on the centriole phenotypes in these mutants before realizing that the allele was hypomorphic (SAS-6 staining in Fig. S2F, and the appearance of centrioles at E9 in Fig. S1B). Because the surprising finding in our study is that SAS-6 does not seem to be essential for centriole presence in mESCs, as pointed out by the reviewer, we decided to generate a more convincing Sas-6-/- null allele in mESCs by deleting the entire ORF of Sas-6 (more on this point below). We would also like to direct the attention of the reviewer that we have cultured the blastocysts (E3.5) from the Sas-6em5/em5 null mutants, which as we show lack centrioles at E3.5, and the cells indeed start to form centrioles just 24 h post-culture (Fig. 3C-D).

      To build on these findings, we have already taken this a step further and generated a mESC line from the Sas-6em5/em5 mutants. These Sas-6em5/em5 –derived mESCs show CENT2-eGFP-positive centrioles, and we are currently analyzing their number and integrity, similar to our analyses of the CRISPR-generated Sas-6-/- null mESCs.

      2) Still on mESCs, are the authors sure that there are no WT Sas-6 mRNAs still present in their ESC cells? Because tiny amounts are maybe sufficient to allow the initial cartwheel structure. In FigS2B, I can see a really faint band, very faint but it is there.

      Due to the nature of the surprising finding that Sas-6 mutant mESCs can still form centrioles, we understand the concerns and suggestions of this reviewer and the other reviewers in this regard. We have generated several Sas-6 mutant alleles in mESCs (in exons 2, 4 or 5), in which we used Western blots to check whether they were null alleles or not. We used different commercial (Proteintech cat# 21377-1-AP, Sigma-Aldrich cat# HPA028187 and Santa Cruz cat# SC-81431) and non-commercial (kind gift from Renata Basto, Institute Curie) antibodies. The SAS-6 antibody from the Basto lab gave the most reliable and reproducible results. Using this antibody, and in our own interpretation, we were not able to detect SAS-6 by Western blots in Sas-6 mutant mESCs. We concluded that SAS-6 in mESCs (and mouse embryos, see below) is expressed at low levels. Of note, we always detected centrioles in the different Sas-6 mutant mESCs, even those derived from the Sas-6em5/em5 null mutant blastocysts, which as blastocysts had no detectable centrioles.

      For a more definitive knockout in mESCs, we decided to bi-allelically delete the entire Sas-6 ORF DNA from the ATG to the TAA (over 34 Kb of DNA, Fig. S2A). According to the central dogma of molecular biology, when there is no DNA, then there should be no mRNA and no protein. In confirmation of this premise, recent RT-PCR data showed that Sas-6 mRNA is not detectable in these Sas-6-/- null mESCs. Also, immunofluorescence analyses did not detect SAS-6 in these cells. We will add the RT-PCR and immunofluorescence data to the fully revised manuscript. We will also repeat the SAS-6 Western blots to achieve better band resolution.

      These Sas-6-/- mESCs started from a single cell and have been passaged up to 20 times by now without losing centrioles. SAS-6 protein was not detectable at the early passages and the mRNA is still not detectable. This is how knockouts have been and are produced. If this mutant is still not convincing, then we respectfully ask that the reviewers provide their own suggestion on what will be more convincing. In our humble opinion, this Sas-6-/- mESCs line can be used to test the specificity of the antibodies in mouse cells and not the other way around.

      3) This last point goes also with the western-blot of Figure S2C- there is still a band, very tiny between the two very tick bands (marked with *). Maybe separating proteins better will help visualizing the real Sas-6 band? Have they used the Sas6 ab in other WBs from the KO embryos, for example? Can they use the Sas6 ab in immunostaining to show if the assembled abnormal centrioles completely lack Sas6. This will allow to distinguish between the hypothesis of having some (even if not much) sas6 left?

      The answer to these questions is above in point #2. In addition, we have used the Basto lab antibody for SAS-6 for Western blots on mouse embryos, which detect low levels of SAS-6 in controls and no signal in the mutants. We will repeat the SAS-6 Western blots on mESCs to achieve better band resolution. Using this antibody for immunofluorescence showed that the Sas-6em4/em4 mutant is hypomorphic, whereas the Sas-6em5/em5 mutant showed very low, most likely background, staining (Fig. S1F). For mESCs, we decided to delete the entire Sas-6 ORF DNA in mESCs and generate homozygous Sas-6-/- null mutants. Immunofluorescence analyses did not detect SAS-6 in these cells.

      4) Then a more theoretical point? Have the authors considered that the difference is more in the stability of the abnormal structures. Let's say, without a cartwheel and maybe enough PLK4 activity and high level of other centriolar components, the centrioles are abnormally assembled- they have no cartwheel, but they are disassembled very fast in the embryo but not in ESCs?

      • *

      We agree and share the reviewer’s interpretation for the potential requirement of SAS-6 in vivo to stabilize intermediate structures, that is compensated for by other factors in mESCs. This was not directly discussed in the first version of the manuscript and we will include it in the new version.

      5) Even if there is a real difference and without Sas-6 ESCs can make centrioles that are abnormal in structure and function (at least at the cilia assemble level), the choice of words "strictly required", I am not sure it is correct. Because, since Sas-6 is described by many studies as the factor required for cartwheel assembly, which occurs very early in the pathway, this means that in mESCs centrioles can assembled without forming a cartwheel. And so that the cartwheel is actually not required for the initial building block, but more as a structure that maintains the whole centriole in an intact manner?

      We agree with the reviewer on the likely requirement of SAS-6, and therefore the cartwheel as a whole, for the symmetry and integrity of the forming centrioles, which is along the same line as in point #4. In our interpretation, “centriole formation” does not necessarily mean centriole “initiation” but rather the presence of the centriole as a structure. We will use more appropriate and specific wording to match our shared interpretation with the reviewer.

      6) The authors mentioned that in flies, abnormal Sas-6 structures have been described in certain cell types. Are these mutants, null mutants? In other words, do these structures assembled in a context of no Sas6 or abnormal Sas-6 protein or even low levels of Sas-6?

      According to the published report (Figure S3B in Rodrigues-Martins et al, 2007, PMID: 17689959) the fly brains have no detectable DSAS-6 protein. Therefore, we assume that they are Sas-6 null fly mutants. The abnormal centrioles in Sas-6 C. Reinhardtii mutants and Sas-6-/- mESCs null mutants support the conclusion that the main role of SAS-6, and perhaps the cartwheel, is in maintaining the integrity of the forming procentriole.

      • *

      Other points:

      I think the 1st sentence of the abstract appears disconnected from the rest. The same goes for the 1st sentence of the introduction. And also, what is the evidence that pluripotent stem cells rely primarily on the proper assembly of a mitotic spindle? They rely on many other things, not sure this is the first one.

      The sentences were meant to highlight the importance of cell division in stem cells. We will adjust the wording in these sentences per the reviewer’s comment to not focus on pluripotency per se.

      The authors mention that centrioles are lost in Sas6-/- after "differentiation" of mESCs. The term differentiation is not appropriate, and confusing here. Differentiation normally refer to cells that stopped proliferating and exited the cell cycle, which is not the case here, as NPCs are progenitor cells that keep cycling.

      We believe the reviewer is referring to “terminal differentiation”, when the cells exit the cell cycle and adopt their destined cell fates. The word “differentiation” in this context refers to limiting the potency of stem cells into a subset of cell fates such as NPCs, which are proliferating progenitors.

      Figure S1: Percent of cells with centrosomes was assessed by a co-staining of gtubulin and Cep164, which mark the mother centrioles. As Cep164 may be absent from centrosomes after lack of centriole maturation in sas6-/- embryos, another combination of staining should be performed to evaluate the percent of cells without centrosomes. gtubulin staining can be seen in Sas6 em5/em5 embryos, while the quantification claims total absence of centrosomes. The authors use the CENT2-eGFP transgenic line to count the number of centrioles in Figure 3, they should do the same in Figure S1.

      We will follow the reviewer’s recommendation of counting Cent2-eGFP for the assessment of centrioles in Sas-6em5/em5 (Fig. S1).

      The g-tubulin (TUBG) aggregates at the poles of dividing cells are assembled in the absence of centrioles, as shown in Sas-6em5/em5 embryo sections (Fig. S1H). In addition, we have previously observed these pericentriolar material aggregates in Sas-4-/- mutant embryos (Bazzi and Anderson, 2014), which do not contain centrioles in serial transmission electron microscopy. Therefore, we do not refer to them as centrosomes in the absence of centrioles at their core.

      Reviewer #1 (Significance (Required)):

      This study shows with a novel mouse model the requirement of centrioles during mouse development. It will be relevant to centrosome labs, the novel mouse lines will be useful to many labs working on centrioles, cilia and centrosomes.

      My expertise: centrosome biology

      We thank the expert reviewer for the critical comments and suggestions, and the positive evaluation of our manuscript.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      • *

      Here, Grzonka and Bazzi present their work on characterizing the requirement of SASS6 in mouse embryo development and in embryonic stem cell (mESC) culture. In mouse, female and male gametes lack centrioles, and early divisions occur without centrioles. De novo formation typically happens at the blastocyst stage (~E3.5). The authors generated two SASS6 knock-out strains, SASS6 em4/em4 (frameshift deletion, reported as a severe hypomorphic allele), and SASS6 em5-em5 (frameshift deletion, reported as a null allele). Mutant embryos arrest development at mid-gestation unless the p53, USP28 and USP28 pathway is perturbed. As expected, centrioles do not form in SASS6 -/- mice. However, the authors report that de novo formation of centrioles is facilitated in mESC culture conditions for SASS6 CRISPR knock-out mESCs and mESCs derived from SASS6 em5/em5 blastocysts. Centrioles are lost upon differentiation of SASS6 CRISPR knock-out mESCs into neural progenitor cells (NPCs).

      The presented study is relevant for scientists investigating the requirements for centriole formation during embryonic development. Further, it provides insights in possibly different requirements for centriole formation between stages of differentiation, as well as differences in in vivo and in vitro models.

      We thank the reviewer for finding our work relevant and insightful into the differential requirements for centriole formation depending on the cell type.

      The data represented by Grzonka and Bazzi are robust and support the manuscript and conclusions made. However, the study is predominantly descriptive, and the authors do not test the molecular pathway underlying the de novo formation of centrioles observed in SASS6 -/- mESCs. It is generally believed that de novo formation of centrioles is not possible in SASS6 knock-out cells although work from Wang and Tsou with SASS6 a oligomerization mutant suggests otherwise. A dissection of the specific factors required for the de novo formation of centrioles in the mESC context would provide more insights into de novo centriole assembly in general and would increase the impact of this work. I would support publication of the manuscript if the following points are addressed:

      We again thank the reviewer for finding the data robust and support our conclusions and interpretation. We agree with the reviewer that our study opens new questions about how mESCs manage to assemble centrioles in the absence of SAS-6. Together with the phenotypes of the Sas-6 mutant D. melanogaster and C. Reinhardtii, and the SAS-6 oligomerization mutants (but not full SAS-6 mutants) in human cell lines mentioned by the reviewer and cited in our manuscript, the data open new investigations into the exact requirements of SAS-6 and the cartwheel in centriole biogenesis in the different cellular contexts.

        • One of the main figures, ideally Figure 1, should be dedicated to the characterization of the newly generated mouse strains. This should also be elaborated in the text further. I would like to see a schematic representation of the genomic modifications. The SASS6 stainings of wt and Sas-6 knock-outs (now Figure S1F) should be shown in that context as well as the Figures S2A-C. The authors should discuss why there still appears to be SASS6 protein in the SASS6-em5/em5 Sas-6 stainings visible. Also, the western blot, especially the unspecific bands so close to the SAS-6 protein, should be discussed. Adding qRTPCR results would also be good. Per the reviewer’s requests, we will move the embryo mutant characterization (Fig. S1F) and mESCs (Fig. S2A-C) to the main figures and elaborate the text accordingly. The genomic modifications in mice are described in a detailed tabular format in Table 1 in Materials and Methods. The immunofluorescence staining in Fig. S1F was performed on mouse embryonic sections, which tend to have higher backgrounds than cultured cells; Thus, we attribute the very low percentage of SAS-6 staining in Sas-6em5/em5* mutants to higher background, especially given the lack of centrioles in these mutants at all the stages examined.

      For Western blots, we used different antibodies against SAS-6 that were either commercially available (Proteintech cat# 21377-1-AP, Sigma-Aldrich cat# HPA028187 and Santa Cruz cat# SC-81431) or non-commercial (kind gift from Renata Basto, Institute Curie). The SAS-6 antibody from the Basto lab gave the most reliable and reproducible results. Using this antibody, and in our own interpretation, we were not able to detect SAS-6 by Western blots in Sas-6 mutant mESCs (including hypomorphic alleles). We concluded that SAS-6 in mESCs (and mouse embryos, see below) is expressed at low levels. Thus, we decided to use the antibody provided by Renata Basto and shown in current Fig. S2C, although it shows two thick non-specific bands flanking the specific band for SAS-6.

      For a more definitive knockout in mESCs, we decided to bi-allelically delete the entire Sas-6 ORF DNA from the ATG to the TAA (over 34 Kb of DNA, Fig. S2A). According to the central dogma of molecular biology, when there is no DNA, then there should be no mRNA and no protein. In confirmation of this premise, recent RT-PCR data showed that Sas-6 mRNA is not detectable in these Sas-6-/- null mESCs. Also, immunofluorescence analyses did not detect SAS-6 in these cells. We will add the RT-PCR and immunofluorescence data to the fully revised manuscript. We will also repeat the SAS-6 Western blots to achieve better band resolution.

      In addition, we have used the Basto lab antibody for SAS-6 for Western blots on mouse embryos, which detect low levels of SAS-6 in controls and no signal in the mutants.

      • The authors could elaborate on the topic of mESCs as a special in vitro model for centriole biology akin to the more "primitive" origins of life such as algae.*

      We will elaborate on the topic of mESCs as a special system for centriole biology to stress the findings that mESCs without SAS-6 can still form centrioles, but also that these cells seem to tolerate centriolar aberrations, such as in Sas-6 mutants, or even the loss of centrioles, as in Sas-4 mutants, without undergoing apoptosis or cell cycle arrest.

      • Figure 4 should show timeline of embryo development, include embryo stages (E3.5, E9 etc.), group together mESCs with corresponding embryonic developmental stage. The Figure can indicate when mESCs were derived from SASS6 em5/em5 blastocysts, when they were stained and indicate the number/state of centriole formation observed.*

      We will adjust the model in Fig. 4 to accommodate the suggestions of the reviewer, but at the same time try not to overcrowd the model and dilute the main findings of the study.

      • The work from Wang and Tsou using SAS-6 oligomerization mutants should be better discussed in the context of the work presented here since centriole assembly was not affected per se but structural defects were observed, like is the case in this study.*

      We will elaborate on this finding from Wang et al. In this respect, we will note that the loss of the entire SAS-6 protein in human RPE-1 cells (on a p53-mutant background), leads to the loss of centrioles, but that the deletion of the oligomerization domain of SAS-6 in these cells leads to similar phenotypes to the total loss of SAS-6 in mESCs.

      • The observation that the ability of forming centrioles de novo in NPCs derived from ESCs is lost is interesting but the mechanisms underpinning this differentiation remain unclear. The authors at a minimum should speculate on these further.*

      We agree with the reviewer and will speculate on this finding further. This comment is along the same line as the difference in phenotype between the cells in the developing mouse embryo and mESCs, where the NPCs are more akin to the in vivo phenotype.

      CROSS-CONSULTATION COMMENTS

      Looks like we are all pretty much in agreement.

      • *

      Reviewer #2 (Significance (Required)):

      • *

      This is a well executed study with no major flaws that builds on similar studies on knocking out centriole components in mouse and other cell types. Although well-executed the study remains descriptive and lacks a clear mechanistic understanding of why de novo centriole assembly is ineffective in NPCs. As it stands the advances this study provides to the centrosome biogenesis field remain incremental.

      We thank the reviewer for the compliments about our work and agree that it opens new questions in the field about the precise roles of SAS-6 and the cartwheel in centriole biogenesis.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      In this publication, Grzonka and Bazzi build upon their recent work describing the role of SAS-like protein function in centriole formation during embryonic development. More specifically, they demonstrate that loss of Sas-6 in vivo and in vitro disrupts centriole formation. To this reviewer's surprise, they found that Sas-6 is required for centriole formation in embryos, yet, stem cells form centrioles with disrupted centriole length and ability to template cilia.

      • *

      We thank the reviewer for highlighting the novel and surprising aspect of our work, which is that Sas-6 mutant mESCs are still able to form centrioles. We would like to stress that SAS-4, from our previously published work, and SAS-6, in this study, are not part of the same protein family and have different structures and roles in centriole formation. The naming has its origin in “Spindle-ASsembly abnormal/defective” mutant screens performed in C. elegans. Although the phenotypes are similar in vivo, due the lack of centrioles in both cases, only mutations in Sas-4, but not in Sas-6, lead to the lack of centrioles in mESCs.

      • *

      Likely, this occurs from the residual proteins that existed prior to CRISPR-mediated knockout.

      • *

      Due to the nature of the surprising finding that Sas-6 mutant mESCs can still form centrioles, we understand the concerns and suggestions of this reviewer and the other reviewers in this regard.

      For a more definitive knockout in mESCs, we decided to bi-allelically delete the entire Sas-6 ORF DNA from the ATG to the TAA (over 34 Kb of DNA, Fig. S2A). According to the central dogma of molecular biology, when there is no DNA, then there should be no mRNA and no protein. In confirmation of this premise, recent RT-PCR data showed that Sas-6 mRNA is not detectable in these Sas-6-/- null mESCs. Also, immunofluorescence analyses did not detect SAS-6 in these cells. We will add the RT-PCR and immunofluorescence data to the fully revised manuscript. We will also repeat the SAS-6 Western blots to achieve better band resolution.

      These Sas-6-/- mESCs started from a single cell and have been passaged up to 20 times by now without losing centrioles. SAS-6 protein was not detectable at the early passages and the mRNA is still not detectable. This is how knockouts have been and are produced. If this mutant is still not convincing, then we respectfully ask that the reviewers provide their own suggestion on what will be more convincing.

      • *

      Unsurprisingly, they found that Sas-6 loss in the developing mouse activates the 53BP1-USP28-p53 surveillance pathway leading to cell death and embryonic arrest at mid-gestation, similar to their findings in Cenpj knockouts. What remains to be properly elucidated is the mechanistic differences in the requirement for Sas-6 in stem cells versus the embryo, which may be beyond the scope of a short report. As it reads, the manuscript is a compliment to their Sas-4 paper but falls short of novelty and providing large strides in revealing the role of centriolar proteins in developmental processes. Moreover, the advances beyond the requirement for centriole and associated proteins in embryology is missing, therefore enthusiasm is tempered. Below are remaining concerns that must be addressed:

      • *

      Remaining concerns:

      The authors should provide clear description of the embryonic region (neural plate & mesenchym) used to analyze centriole presence or loss in Figures 1 and S1. Was this in the forelimb vs hindlimb regions?

      The assessment of centrosomes in Fig. 1 and S1 was performed on cell types in all three germ layers in the sections that were taken from the brachial region (forelimb and heart level). The information will be added to the Materials and Methods section.

      Similar to their Cenpj-mouse data, the authors should provide data detailing the mitotic index and activation of the mitotic surveillance pathway beyond just p53 staining. As novelty is not the only criteria for publication, a thorough analysis of the Sas-6 activation of the mitotic purveyance pathway should be provided, including the crosses between Sas-6 and p53, 53bp1 and usp28 knockout crosses to demonstrate the pathway functions similarly to Cenpj loss.

      We will perform the additional experiments suggested by the reviewer that are similar to our previous work in Sas-4 mutants (Xiao*, Grzonka* et al, 2021). We will perform these analyses knowing that both Sas-4 and Sas-6 mutants lose centrioles and activate the mitotic surveillance pathway, as the reviewer indicated. In particular, we will quantify the mitotic index in the Sas-6em5/em5 mutants and perform p53 and Cl. CASP3 staining in the double mutants with 53bp1 or Usp28, to show that the pathway has been suppressed in these mutants.

      Centriole structure should be assessed in the embryos using EM to assess loss and confirm the structural defects. This would strengthen their argument and be a slight advance to their largely descriptive paper.

      Because the Sas-6em5/em5 embryos lack centrioles, as indicated by regular immunofluorescence and Ultrastructure-Expansion Microscopy (U-ExM), using EM would be an attempt to find a structure that does not exist. In our opinion, it would again be a repetition of TEM studies that we have already performed in Sas-4-/- mutant embryos, that lack centrioles (Bazzi and Anderson, 2014). Using U-ExM has advanced the centriole biology field to a level that is approaching EM resolution and, in our opinion, can substitute for EM.

      The WB for Sas-6 knockout is not convincing and should be redone. There are validated Sas-6 antibodies available from SCBT and Proteintech. It is not clear that the band is gone or if there's overlap with the non-specific band.

      The answer to this comment is shown above. In addition, we have used the Basto lab antibody for SAS-6 for Western blots on mouse embryos, which detect low levels of SAS-6 in controls and no signal in the mutants. We will also repeat the SAS-6 Western blots on mESCs to achieve better band resolution as recommended by the reviewer.

      The authors describe the centriolar structural defect in the mESCs in Figure 2C and D, and further characterize the phenotype in S2D-H. Given the role of the SAS6-CEP135-CPAP axis for centriole elongation, it is peculiar that they see elongation upon reduction of CEP135. The authors should find a rationale mechanism to explain their discordant findings. In addition, other centriole distal end components including CEP97 and CP110 should be examined to determine the structural end caping defect in the Sas-6 mESC.

      Over 70% of the centrioles in Sas-6-/- mESCs retain CEP135, but the majority of CEP135 signals (over 80%) seem to be abnormally localized. One potential explanation for the elongated centrioles in Sas-6-/- mESCs is that the mis-localization of CEP135 impacts on the integrity of the centriole and results in parts of the centriolar walls being elongated. Per the reviewer’s suggestion, we have performed U-ExM with stainings for CP110 or CEP97, that also regulate centriole capping and elongation. The preliminary data suggest that similar to WT mESCs, they localize to the ends of the abnormal centrioles in Sas-6-/- mESCs. We will quantify the percentage of normally-localized CP110 and CEP97 in Sas-6-/- mESCs and include it along with the data interpretation in the next version of the manuscript.

      • *

      In Figure 2I, J the authors state the ciliation rate for the WT mESCs was only 11%, could the authors provide an explanation for the low ciliation rate in WT mESCs? Could cells be arrested to increase the ciliation rate? In addition, is there a rational explanation for the loss of centrioles and centrosomes upon differentiation into NPCs?

      mESCs ciliation rate has been shown to be generally low (Bangs et al, 2015; Xiao et al., 2021) perhaps because the cells spend most of the cell cycle in the S-phase. mESCs require a high serum percentage and well-defined media for growth and maintenance. In our hands, attempting to arrest the cells by withdrawing serum, or reducing its percentage, resulted in cell death and a change in morphology to the differentiated phenotype (unpublished data). Our data indicate that a pluripotent state in Sas-6-/- mESCs is compatible with centriole formation but differentiation results in the loss of centrioles (for example, NPCs). Therefore, we have refrained from interfering with the cell cycle of mESCs in order to avoid these confounding effects on cellular viability and centriole formation.

      Regarding the loss of centrioles upon differentiation of Sas-6-/- mESCs into NPCs, we agree with the reviewer and will speculate on this finding further. This goes along the same line as the difference in phenotype between the cells in the developing mouse embryo and mESCs, where the NPCs are more akin to the in vivo phenotype of Sas-6 mutants. The data suggest that the formation of centrioles in Sas-6-/- mESCs is associated with the in vitro pluripotent phenotype. A more comprehensive and general characterization of centriole duplication in mESCs is a future direction to elucidate their ability to form centrioles without SAS-6.

      In figure 3F in the Sas-6−/− NPCs have a box around a cell without centrosomes yet in 3G here are some cells with centrosomes. While the authors are trying to demonstrate the decrease in centrosome in the Sas-6−/− NPCs, they should show the few cell that have centrosomes or centrosome-like structures.

      We will add another example for the minority of cells that retain centrosomes upon differentiation of Sas-6-/- mESCs into NPCs.

      CROSS-CONSULTATION COMMENTS

      • *

      As mentioned in my review; while the Sas6 model is new, it does not provide further evidence of why centriole duplication is important in developing mice aside from it causing an abortive mitosis leading to cell death. The discordant phenotype in the mESCs likely arises from residual Sas6, similar to experiments that were performed in flies with Sas-4 depletion. Moreover, the odd centriole phenotype represents a very small number of cells and is likely phenomenological.

      In addition, their work from last year demonstrated a clear connection between Cenpj loss leading to the mitotic surveillance pathway activation. They performed double knockouts that partially rescued the survival phenotype. This new work falls short of that publication.

      Reviewer #3 (Significance (Required)):

      • *

      The new publication adds a known component to the list of animal models for centrosome-opathies but fails to provide novel mechanistic insights. Dr. Bazzi's publication on Sas-4 was far more novel at the time of publication due to the multiple mouse crosses that could rescue the phenotypes. The recent publication fails to provide as much evidence or any novel insights into the role of Sas-6 (sufficient to be convincing).

      The audience will be limited to centrosome biologists and even then it may not have enough novelty to be compelling. I would recommend with the revisions to be published in a more specialized journal.

      *My expertise lies in genetic causes of microcephaly-associated with mutations in centrosome encoding proteins. *

      • *

      We thank the reviewer for taking the time to evaluate our work and provide helpful comments and suggestions. We would like to emphasize that even if a certain phenotype is expected, the experiment has to be performed to test the hypothesis, which is the case with the Sas-6 mutant embryos phenocopying the Sas-4 mutants. In our opinion, the novelty of our work goes beyond Fig. 1 to the ability of Sas-6-/- null mESCs to form centrioles. This surprising finding opens new avenues of investigation into the precise roles of SAS-6, and the cartwheel, in centriole biogenesis. We are confident that our study will provide a trigger to re-examine these roles in other cell types and organisms.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1

      SUMMARY

      The manuscript by Smoak et al., provides an analysis of the Hyr/Iff-like (Hil) genes in Candida species with a strong focus on C. auris. The authors demonstrate a repeated expansion of these genes in unique lineages of fungal species, many of which are associated with stronger clinical disease. There is evidence of selection operating on the gene family in the primary domain used for identification. These genes include a repeat just downstream of that core domain that changes frequently in copy number and composition. The location of these genes tends to cluster at chromosome ends, which may explain some aspects of their expansion. The study is entirely in silico in nature and does not include experimental data.

      MAJOR POINTS

      Altogether, many of the general findings could be convincing but there are some aspects of the analysis that need further explanation to ensure they were performed correctly. To start, a single Hil protein from C. auris was used as bait in the query to find all Hil proteins in yeast pathogens. Would you get the same outcome if you started with a different Hil protein? What is the basis for using Hil1 as the starting point? It also doesn't make sense to me to remove species just because there are already related species in the list. This may exclude certain evolutionary trends. Furthermore, it would be helpful to know how using domain presence and the conservation of position changes the abundance of the gene family across species? (beginning of results).

      We appreciate the reviewer’s criticisms on our strategy for identifying Hil proteins. In response, we have significantly revised our pipeline. In particular, we now combine the search results from three queries: in addition to C. auris Hil1’s Hyphal_reg_CWP domain (XP_028889033), we added the Hyphal_reg_CWP sequences from C. albicans Hyr1 and C. glabrata Hyr1. They were chosen as representatives in the two phylogenetic groups distinct from the one containing C. auris in order to avoid the bias due to the query’s phylogenetic position. Using the same criteria as we did for the original search, we identified three additional hits compared with the original 104 homologs list. In response to the criticism of the arbitrary exclusion of some species, we now include any species from the BLASTP search results as long as it is part of the 332 yeast species studied by Shen et al. 2018 (PMID: 30415838). The reason for this criterion is so that we can use the high-quality species phylogeny generated by Shen et al. 2018 to properly study the gene family evolution by reconciling the gene tree with the species tree. We additionally include the species in the MDR clade closely related to C. auris and used Muñoz et al. 2018 (PMID: 30559369) as the basis for the species phylogeny in the clade. Lastly, we no longer require the particular domain organization in classifying Hil family members. All BLASTP hits satisfying the E-value cutoff of 1x10-5 and query coverage > 50% are included.

      A major challenge in the analysis like this one is in dealing with repetitive sequences present in amplified gene families. For example, testing modes of selection on non-conserved sites is fraught. It's not clear if all sites used for these tests are positionally conserved and this should be clarified. Alignments at repeat edges will need to maintain this conservation and relatively good alignments as stated in lines 241-242 are concerning that this includes sequence that does not retain this structure necessary for making predictions of selection.

      We appreciate the reviewer’s comment. In the original manuscript, we performed two different types of analyses, one on the conserved and well-aligned Hyphal_reg_CWP domain and another on the rapidly evolving repeat region. For the former, we performed phylogenetic dN/dS analyses using maximum-likelihood, for which a reliable alignment is crucial and is the case here. The Hyphal_reg_CWP domain alignment for C. auris Hil1-Hil8 is shown below and also included as Fig. S7 in the revised manuscript: (figure in the response file)

      In the text, we added this sentence to emphasize this point: “We chose to focus on the Hyphal_reg_CWP domain because of its potential importance in mediating adhesion and also because the high-quality alignment in this domain allowed us to confidently infer the evolutionary rates (Fig. S7).”

      For the repeat domain, what we did in the original version was to calculate the pairwise dN/dS between individual repeat units found in Hil1 and Hil2. This didn’t require aligning the entire repeat regions in the two proteins, but instead relied on the alignment of the individual ~44 aa repeat units, which were highly conserved (see below). In the revised manuscript, however, we decided to focus our analyses on the Hyphal_reg_CWP domain because of a different concern, namely gene conversions between paralogs could distort the evolutionary history of the repeats (the same concern was addressed for the effector domain using an additional step of detecting recombination breakpoints, but the same analysis would be challenging for the repeat region due to alignment issues).

      (figure in the response file)

      It's also unclear to me why Figure S12 is here. The parameters for this analysis should be tested ahead of building models so only one set of parameters should be necessary to run the test. The evolutionary tests within single genes and across strains is really nice!

      We appreciate the reviewer’s suggestion. Based on the reviewer’s suggestion, we removed Fig. S12 and describe the model set up in the Materials and Methods section. We were not sure if the last point was a comment or a suggestion. We didn’t perform a population level selective sweep scan in C. auris. Such an analysis has in fact been attempted by Muñoz et al. 2021, who identified several members of the Hil family as the top candidates for positive selection (PMID: 33769478). We cited this in our Discussion:

      “Lastly, scans for selective sweep in C. auris identified Hil and Als family members as being among the top 5% of all genes, suggesting that adhesins are targets of natural selection in the recent evolutionary history of this newly emerged pathogen (Muñoz et al. 2021).”

      A major challenge for expanded gene families is rooting based on the inability to identify a strong similarity match for the full length sequence. The full alignment mentioned would certainly include significant gaps. If those gaps are removed and conserved sites only are used, does it produce the same tree? Inclusion of unalignable sequences would be expected to significantly alter the outcomes of those analysis and may produce some spurious relationships in reconciling with the species trees. Whether or not there are similar issues in the alignment of PF11765 need to be addressed as well. There's nothing in the methods that clarifies site selection.

      We appreciate reviewer’s comment and agree with the concern about alignment quality affecting phylogenetic reconstruction. To clarify, all phylogenetic analyses in this work are based on the alignment of the Hyphal_reg_CWP domain, which is well aligned (shown above for the subset of eight homologs in C. auris). Alignment of all 215 homologs is provided for readers to review (shorturl.at/kDEJ3). To clarify this choice, we now include the following in Results:

      “To further characterize the evolutionary history of the Hil family, including among closely related Candida lineages, we reconstructed a species tree-aware maximum likelihood phylogeny for the Hil family based on the Hyphal_reg_CWP domain alignment (Fig. 1C, Fig. S2).”

      We also included detailed steps for reconstructing the gene tree in Materials and Methods.

      To test the effect of gaps in the alignment on phylogenetic tree inference, we used two trimming programs, ClipKit (PMID: 33264284) and BMGE (PMID: 20626897), with author-recommended modes. They resulted in consistent gene tree results. We present the tree based on the ClipKit trimmed alignment in the main results. The root of the gene tree was inferred by jointly maximizing the likelihood scores for the gene tree based on the alignment and the evolution of the gene family within the species tree, using GeneRax (Morel et al. 2020, PMID: 32502238).

      Figure 1A: the placement of evolved pathogenesis is a little arbitrary. It's just as feasible that a single event increased pathogenesis in the LCA of C. albicans and C. parapsilosis that was subsequently lost in L. elongisporus. These should be justified or I'd suggest removing. The assignment of Candida species here also seems incomplete. The Butler paper notes both D. hansineii and C. lusitaniae as Candida species whereas they are excluded here.

      We removed Figure 1 entirely based on this and another reviewer’s comment. We note that there is broad consensus that opportunistic yeast pathogens have independently arisen multiple times, such as C. auris, C. albicans and C. glabrata. Whether Candida pathogens that are more closely related evolved separately or not are subjects of ongoing research (PMID: 24034898).

      It is tricky to include scaffolds in analysis of chromosomal location of the HIL genes. The break in the scaffold may be due to the assc repeats of these proteins alone or other, nearby repeats. Any statistics would be best done to include only known chromosomes or those that are strongly inferred by Munoz, 2021. This will change the display of Figure 7, but is unlikely to change the take home message.

      We agree with the reviewer’s concern. In the revised manuscript and with more species included, we now only analyze genomes assembled to a chromosomal level, with the exception of C. auris B8441, which is supported by Muñoz et al. 2021 as having chromosome-length sequences. The revised Figure 7 now only includes these results. We also removed the accompanying supplementary figure that showed results based on scaffold-level assemblies.

      MINOR POINTS

      Line 18: "spp." Should be "spps."

      Addressed throughout the revised manuscript.

      Line 41: I might rephrase this as "how pathogenesis arose in yeast..."

      Accepted (line 43 in revised manuscript).

      I might use a yeast-centric example around line 40 for duplication and divergence. This could include genes for metabolism of different carbon sources in S. cerevisiae.

      Accepted (lines 47-48)

      The Butler paper referenced on line 51 compared seven Candida species and 9 Saccharomyces species

      Changed (line 48)

      The autors state no other evolutionary analysis of adhesins has been performed but do not acknowledge this study: https://academic.oup.com/mbe/article/28/11/3127/1047032

      We appreciate the reviewer pointing this important reference to us. We now cite it in the introduction (line 64) and discussion (line 340)

      The first paragraph of the Results could be condensed

      Addressed.

      How was the species tree in Figure 1A obtained?

      The previous figure 1 is now removed. The species tree used throughout the manuscript is based on Shen et al. 2018 with MDR clade species added, based on Muñoz et al. 2018.

      Figure 2: In panel A, "DH" and "SS" are not defined. I'd be careful with use of "non-albicans Candida" in Figure 2B. This usually includes C. tropicalis and C. dubliniensis and may confuse the reader.

      We removed the DH and SS labels. Instead, we highlighted three clades, which were defined in previous studies. These are the Candida/Lodderomyces clade (based on NCBI taxonomy database), the MDR clade (e.g., Muñoz et al. 2018, PMID: 30559369) and the glabrata clade (e.g., Gabaldón et al. 2013, PMID: 24034898).

      How was the binding domain defined to extract those sequences are produce a phylogeny? In building a ML model, how were parameters chosen?

      We now provide the following details in the Materials and Methods section:

      “To infer the evolutionary history of the Hil family, we reconstructed a maximum-likelihood tree based on the alignment of the conserved Hyphal_reg_CWP domain. First, we used hmmscan (HmmerWeb version 2.41.2) to identify the location of the Hyphal_reg_CWP domain in each Hil homolog. We used the “envelope boundaries” to define the domain in each sequence, and then aligned their amino acid sequences using Clustal Omega with the parameter {--iter=5}. We then trimmed the alignment using ClipKit with its default smart-gap trimming mode (Steenwyk et al. 2020). RAxML-NG v1.1.0 was compiled and run on the University of Iowa ARGON server with the following parameters on the alignment: raxml-ng-mpi --all --msa $align --model LG+G --seed 123 --bs-trees autoMRE.”

      The parameters for the ML tree reconstruction is listed on the last line above. The main parameter was the evolutionary model (LG+G), which accounts for rate variations using a gamma distribution. Other protein evolution models, e.g., VT+I+G, were tested and resulted in nearly identical tree topologies.

      Figure 3C/D could be just one panel.

      The structure predictions are now reorganized and presented on their own in the new Figure 3.

      Can you relate more the fungal hit to the Hil proteins conveyed in lines 152-154?

      We appreciate the reviewer’s comment, which referred to CgAwp1 and CgAwp3, whose effector domain structures were reported in a recent study (Reithofer et al. 2021, PMID: 34962966). We now discuss them in relation to the predicted Hyphal_reg_CWP structure, by showing them in Figure 3 and describing them in the Results (lines 181) “crystal structures for the effector domains of two Adhesin-like Wall Proteins (Awp1 and Awp3b) in C. glabrata, which are distantly related to those in the Hil family were recently reported, and the predicted structure of one of C. glabrata’s Hil family members (Awp2) was found to be highly similar to the two solved structures (Reithofer et al. 2021)”

      Line 168: Should read "Hence, ..."

      The original sentence was removed, but this grammatical error was checked for and corrected.

      Label proteins along the top of Figure 4 too.

      Accepted (in new Figure 4).

      Figure 5: for tests of selection, were sites conserved across the group? What does the black number at each node indicate? Dn and Ds are given as decimals. This is based on what attribute? For panel B, it is unclear what each tip denotes i.e., Hil1_tr6. Hil1 is the gene but what is "tr6"?

      In the revised manuscript, we provide the multiple sequence alignment for the Hyphal_reg_CWP domain used for the selection analysis as Fig. S7 to illustrate the level of conservation. The black numbers at the internal nodes are numeric indices used to refer to those nodes. In the revised manuscript, we use some of them to refer to the internal branches, e.g., 12…14 in the legend. In the new Figure 5, we do not list the numeric values of Dn and Ds (aka Ka, Ks). Instead, we use a color gradient to represent the estimated dN/dS ratios. The raw estimates are available in the project github repository. Panel B in the original Figure 5 and other panels related to the evolution of the repeats are now removed.

      It's unclear why comparison of the PF11765 domain includes the MRD proteins when those aren't included in the comparison to the repeats alone. Could that skew the comparison due to unequal sample numbers or changed variation frequencies in MDR relative to the other two groups?

      These results pertaining to the evolution of the repeats are now removed.

      Table 2 doesn't add much. This section could probably be reduced to a few sentences since it's highly speculative (intraspecies variation).

      Table 2 is now Table S5. We also simplified the result section in the revised version. While the functional implications of the intraspecific variable number of tandem repeats (VNTR) is speculative, it is founded on two bases: 1) the identification of the VNTR is credible, as the copy number variation is consistent within clades but differ between clades, which is not expected if they are caused by assembly errors; 2) experimental studies in S. cerevisiae for the Flo family strongly supported a direct impact of adhesin length on the adhesive phenotype of the cells (PMID: 16086015).

      Table 3 is not needed.

      Table 3 is now removed.

      Figure 6 - color coding in 6A needs to be explained. I'm assuming this is a taxonomical coding.

      In the revised Figure 6A, the coloring scheme is consistent with what we used in Figure 1 based on the three clades, and a legend is provided.

      Figure 1B is unnecessary. A Model of the protein indicating domains is sufficient here. Figure 1C needs labels for all termini, not just the pathogenic red branches. The figure doesn't provide clear association between adhesin families and the associated species. This could be omitted, especially since Flo is often associated with Saccharomyces species. Figure 1D is unnecessary.

      We have removed the original Figure 1.

      SIGNIFICANCE

      The work here is sorely needed in expanded gene families and in fungi specifically. No analysis at this level has been performed, to the best of my knowledge, in any fungal associated gene family and certainly not in relationship to pathogenic potential. The authors do a good job in citing the foundational literature upon which their study builds in most cases (one exception is noted above). It would be of general interest to those interested in the evolution of virulence, but the analysis is tricky. This is the biggest drawback I currently have as some of the information to assess the results is missing.

      We really appreciate the reviewer's positive comments. We agree and plan to explore the relationship between the adhesin family evolution and virulence phenotypes.

      Expertise: gene families, evolution dynamics, human fungal pathogens

      Reviewer #2

      SUMMARY

      Gene duplication and divergence of adhesin proteins are hypothesized to be linked with the emergence of pathogenic yeasts during evolution; however, evidence supporting this hypothesis is limited. Smoak et al. study the evolutionary history of Hil genes and show that expansion of this gene family is restricted to C. auris and other pathogenic yeasts. They identified eight paralogous Hil proteins in C. auris. All these proteins share characteristic domains of adhesin, and the structural prediction supports that their tertiary structures are adhesin-like. Evolutionary analysis of protein domains finds weak evidence of positive selection in the ligand-binding domain, and the central domain showed rapid changes in repeat copy number. However, performed tests cannot unambiguously distinguish between positive selection and relaxed selection of paralogs after gene duplication. Some alternative tests are suggested that may be able to provide more unambiguous evidence. Together with these additional tests, the detailed phylogenetic analyses of Hil genes in C. auris might be able to better support the hypothesis that the expansion and diversification of adhesin proteins could contribute to the evolution of pathogenicity in yeasts.

      We appreciate the reviewer’s comments and will address specific points below.

      MAJOR COMMENTS

      The authors present extensive analyses on the evolution of Hil genes in C. auris. There is significant merit in these analyses. However, the analyses conducted so far are incomplete, lacking proper consideration of other confounding factors. Detailed explanations of our major comments are listed below.

      1. First, the authors restricted genes in the Hil family to those only containing the Hyphal_reg_CWP domain. Yet, previous work included genes containing the ligand-binding domain or the repeat domain as Hil genes. More justification is needed whether the author's choice represents the natural evolutionary history of Hil genes appropriately. For instance, are the genes only containing the ligand-binding domain monophyletic or polyphyletic? We recommend including the phylogeny of all the Hil candidate genes, to discern whether evolutionary histories of the repeat domain and ligand-binding domain are congruent. Authors can use this phylogeny as justification to focus only on the ligand-binding domain containing genes.

      Butler et al. 2009 (PMID: 19465905) defined the Als family and the Hyr/Iff family as having either the N-terminal effector domain or the intragenic tandem repeats (ITR). Their rationale for the latter was that the ITS sequences were often conserved across species. Upon close inspection (Fig. S19,20 in that paper), however, we found that the ITS tend to be conserved in closely related species, but diverged among more distantly related species. Moreover, proteins in those figures that only contain the ITS and not the ligand-binding domains are all missing either the signal peptide, the GPI-anchor or both. This raises questions as to whether these proteins sharing the ITS sequence alone act as adhesins.

      More generally, defining the evolutionary history of proteins with multiple domains is complicated by recombination, which causes different parts (e.g., domains) of the protein to have distinct evolutionary histories. In fact, our study and others show that there exist “chimeras” that combine the effector domain from one adhesin family and the repeat sequence found in another (Zhao et al. 2011, PMID: 21208290, Oh et al. 2019, PMID: 31105652). In these cases, one phylogenetic tree is insufficient to describe the evolutionary history of the whole protein. We chose to define the Hil family by the Hyphal_reg_CWP domain and thus focus on the evolutionary history of this region because 1) while tandem repeat regions also contribute to adhesion in yeasts (Rauceo et al. 2006, PMID: 16936142), the effector domain likely plays a more important role in ligand binding and specificity. Therefore, we believe using the effector domain to define a protein family is more likely to group proteins with similar functional properties than if the repeat sequences were used. Also, while putative fungal adhesins lacking a recognizable ligand-binding domain exist, they are rare (Lipke 2018, PMID: 29772751); 2) The repeat region evolved much more rapidly than the effector domain, as we illustrate in Figures 2, 4 and 6 in our revised manuscript. While some repeat units are highly conserved, e.g., the ~44 aa unit found in Hil1-4 in C. auris and close relatives in the MDR clade, many others are short and degenerate, making it difficult to reliably identify homologs sharing the repeat. Besides, since each protein could contain many distinct repeats, it is not clear how one defines two sequences as belonging to the same family if they share one out of six types of repeats. We acknowledge that this definition leaves out the evolutionary history defined by the tandem repeats, which may reveal intriguing evolutionary dynamics, with functional implications. A recent review for the Als family discussed similar definition challenges and partly supported our choice (Hoyer and Cota, 2016, PMID: 27014205).

      In the analysis of positive selection, the authors do not adequately control for the effect of recombination on the evolutionary histories of protein sequences, especially given that Hil genes are rich in repetitive sequences. To account for recombination, GARD, an algorithm detecting recombination, should be performed to detect any recombination breakpoints within a protein domain. If recombination did occur within a protein domain, the authors should treat the unrecombined part as a single unit and use the phylogenetic information of this part to proceed with PAML analysis, instead of using the phylogeny of the entire protein domain. The authors should consider doing GARD analysis for the ligand-binding and repeat domains. For the repeat domain, low BS values in Fig. 5C indicate recombination between repeat units. Thus, the authors should analyze each repeat unit with GARD and re-analyze dN/dS.

      We deeply appreciate the reviewers’ criticism here. In the revised manuscript, we removed the analysis of the repeat units and followed the reviewers’ suggestion to carry out GARD analysis on the effector domain, which we now show reveals evidence of intra-domain recombination. Using the inferred breakpoints (Fig. S8), we identified two putatively non-recombining partitions and performed all downstream phylogenetic analyses on them separately. The results are presented in Fig. 5 and Table S6. Compared with the previous result based on the entire Hyphal_reg_CWP domain alignment, the new results reveal clearer patterns, including significantly elevated dN/dS on a subset of the branches. Newly added branch-site test results support a role of positive selection on the effector domain during the expansion of the Hil family in C. auris, suggesting functional diversification following gene duplications.

      The authors concluded positive selection in the ligand-binding domain based on the branch-wise model of PAML. Yet, w values were not higher than one, and it's unclear whether the difference in selective pressures the authors claimed here is biologically significant. Overall, what the authors present so far seems to be weak evidence of positive selection but is much more consistent with variation in the degree of purifying selection or evolutionary constraint. Using the site-wise model (m7 vs. m8) in PAML would allow the authors to detect which residues of the ligand-binding domain underwent recurrent positive selection. Combining the evolutionary information of protein residues and the predicted 3D structure will provide molecular insights into the biological impact of rapidly evolving residues. This would be a significant addition and raise the significance of the study, besides providing potentially stronger evidence of positive selection.

      We appreciate the reviewers’ criticism and suggestions. In the revised manuscript, we performed site tests comparing models M2a vs M1a, M8 vs M7 and M8a vs M8. For partition 1 (P1-414), all three tests were insignificant. For partition 2 (P697-987), the M2a vs M1a test was insignificant (P > 0.05) but M8 vs M7 and M8a vs M7a were both significant at a 0.01 level, and the omega estimate for the positively selected category was estimated to be ~15. The site tests require all branches to evolve under the same selection regime. To relax this constraint, we performed additional branch-site tests by designating the branches with an estimated dN/dS > 10 as the foreground (based on the free-ratio model estimates). This test was significant for both branches at a 0.01 level and the Bayes Empirical Bayes (BEB) procedure identified a total of 5 residues as having been under positive selection. Although three of the five residues, located in the C-terminus of the Hyphal_reg_CWP domain, are part of the α-crystallin domain, we refrain from drawing any functional conclusions because 1) the BEB procedure is known to be lacking power in identifying positively selected residues and 2) we still lack structure-function relationship for the α-crystallin domain. But we agree and believe that this line of analysis is promising in yielding functional insight into the evolution of the effector domain in the family.

      MINOR COMMENTS

      1. In Fig 1c, the figure legend should include more specific details: which adhesin proteins are shown here? Please specify species names on the species tree

      Figure 1 is removed in the revised manuscript

      In Fig 3E, secondary structures are labeled with the wrong colors. Sheet: purple, helix: yellow

      In the revised manuscript, the structures of SRRP-BR (original 3E) is now colored in a single color.

      What's the ligand-binding activity of the b-solenoid fold? How structurally similar are C. auris PF 11765 domains compared to C. glabrata Awp domains? This information will support the role of adhesin for the ligand-binding domain of Hil genes.

      We discuss the ligand-binding activity of the β-solenoid as follows in Discussion:

      “The elongated shape and rigid structure of the β-helix are consistent with the functional requirements of adhesins, including the need to protrude from the cell surface and the capacity for multiple binding sites along its length that facilitate adhesion. In some bacterial adhesins, such as the serine rich repeat protein (SRRP) from the Gram-positive bacterium, L. reuterii, a protruding, flexible loop in the β-helix was proposed to serve as a binding pocket for its ligand (Sequeira et al. 2018). Such a feature is not apparent in the predicted structure of the Hyphal_reg_CWP domain. Further studies are needed to elucidate the potential substrate for this domain and its mechanism of adhesion.”

      We also compare the structures of the C. auris Hil1/Hil7 Hyphal_reg_CWP domain and the CgAwp1/3 in Figure 3, with this in the legend “(C) Crystal structure of the C. glabrata Awp1 effector domain, which is highly similar to C. auris Hil1 and Hil7, but with the disulfide bond in a different location.”

      We added a section in the Discussion to comment on the structure-function relationship based on known β-helix (aka β-solenoid) structures. The main insight comes from similar structures identified through DALI searches, many of which are bacterial and viral surface proteins mediating adhesion. The ligand binding pocket and specificity would require additional structural studies to elucidate.

      In lines 248-249, the authors should also consider the influence of evolutionary history. For instance, repeats within the same Hil protein appeared later in evolution, compared to Hil gene duplication, and therefore these repeats experience less time for sequence divergence.

      In the revised manuscript, we removed the analyses pertaining to the evolution of the repeat regions due to multiple challenges including alignment, potential of gene conversion and recombination. This is an important and intriguing aspect of adhesin family evolution that we plan to follow up in future work.

      Although the bioinformatic evidence of C. auris Hil genes acting as adhesins is strong, it is still worthwhile to discuss the experiments of confirming the function of adhesins.

      We agree with the reviewer and acknowledge in the revised manuscript the limitation of our work:

      “Future experimental tests of these hypotheses will be important biologically for improving our understanding of the fungal adhesin repertoire, important biotechnologically for inspiring additional nanomaterials, and important biomedically for advancing the development of C. auris-directed therapeutics.”

      SIGNIFICANCE

      Overall, this study is interesting to investigate the evolutionary history of a crucial virulent gene in C. auris. Such evolutionary understanding will help us identify critical molecular changes associated with the pathogenicity of an organism during evolution, providing insights into the emergence of pathogens and novel strategies to cure fungal infections. The research question is important; however, the current analyses on the positive selection are incomplete, so the conclusion is modest so far. We recommend that the authors re-do the PAML analysis with the above considerations. This work will bring more significance to the mycology field if the functional impact of rapid evolution in protein domains can be supported or inferred.

      This manuscript is well-written, and the authors also did a great job specifying all the necessary details in the M&M.

      We appreciate the reviewers’ positive comments.

      Reviewer #3

      Summary:

      The manuscript by Smoak et al. provides considerable information gleaned from analysis of HYR/IFF genes in 19 fungal species. A specific focus is on Candida auris. The main conclusion is that this gene family repeatedly expanded in divergent pathogenic Candida lineages including C. auris. Analyses focus on the sequences encoding the protein's N-terminal domain and tracts of repeated sequences that follow. The authors conclude with the hypothesis that expansion and diversification of adhesin gene families underpin fungal pathogen evolution and that the variation among adhesin-encoding genes affects adhesion and virulence within and between species. The paper is easy to read, includes clear and attractive graphics, as well as a considerable number of supplementary data files that provide thorough documentation of the sources of information and their analysis.

      We appreciate the positive comment.

      MAJOR COMMENTS:

      • Are the key conclusions convincing?

      Overall, the authors' conclusions are supported by the information they present. However, the overall conclusion is stated as a hypothesis and that hypothesis is not particularly novel. The idea that expansion of gene families associated with pathogenesis occurs in the pathogenic species dates back at least to Butler et al. 2009, who first presented the genome sequences for many of the species considered here.

      We appreciate the reviewer’s comment. Our main conclusions are 1) the Hil family is strongly enriched in distinct clades of pathogenic yeasts after accounting for phylogenetic relatedness. This enrichment results from independent duplications, which is ongoing between closely related species; 2) the protein sequence of the Hil family homologs diverged rapidly following gene duplication, driven largely by the evolution of the tandem repeat content, generating large variation in protein length and β-aggregation potentials; 3) there is strong evidence for varying levels of selective constraint and moderate evidence for positive selection acting on the N-terminal effector domain during the expansion of the family in C. auris as our focal species. Based on these observations, we propose that expansion of adhesin gene families is a key preliminary step towards the emergence of fungal pathogenesis.

      Indeed, some version of this hypothesis has been proposed by several groups before us. We fully acknowledged this in our previous as well as the revised manuscript, by citing Butler et al. 2009 (PMID: 19465905), Gabaldón et al. 2013, 2016 (PMID: 24034898, 27493146). Our study built on these earlier efforts and extended them by addressing several limitations. First, we performed phylogenetic regression to test for the association between gene family size and the life history trait (pathogen or not) in order to properly account for the phylogenetic relatedness. This was not done in previous studies. Second, most earlier studies didn’t construct a family-wide gene tree to fully investigate the evolutionary history of the family. Gabaldón et al. 2013 did a phylogenetic analysis for the Epa family and a few others within the Nakaseomycetes, revealing highly dynamic expansions. In the present study, we expanded this effort by comprehensively identifying homologs within the Hil family in all yeasts and beyond. Third and perhaps the most important novelty in our study is our detailed analysis of sequence divergence and role of natural selection during the evolution of the family post duplication. This allowed us to present a complete picture of the family’s evolution, not just in its increase in copy number but also its diversification after the duplications, which is a key part of how gene duplications contribute to the evolution of novel traits. As such, we believe our study provides strong support for the above hypothesis.

      One key issue with a manuscript of this type is whether genome sequence data are accurate. The authors are not the first research group to take draft genome sequence data at face value and attempt to draw major conclusions from it. The accuracy of public genome data continues to improve, especially with the emergence of PacBio sequencing. Because the IFF/HYR genes contain long tracts of repeated sequences, genome assemblies from short-read data are frequently inaccurate. For example, is it reasonable to have confidence that the number of copies of a tandemly repeated sequence in a specific ORF is exactly 21 (an example taken from Table 2) when each repeat is 40+ amino acids long and highly conserved? Table S6 would benefit from inclusion of the type of sequence data used to construct each draft genome sequence. It is also reasonable to question whether the genome of the type strain is used as a template to construct the draft genomes of the other strains. If that was standard practice, conservation of the repeat copy number among strains might be an artefact. Conservation of repeat sequences to the degree shown is not a feature of the ALS family, a point of contrast between gene families that could be explored in the Discussion.

      We appreciate the reviewer’s comment and agree strongly that a key limitation in gene family evolution studies like ours is the quality of the genome assembly. In the original manuscript, we took several steps to ensure the completeness and accuracy of the Hil family homologs, primarily by basing our results on the high quality RefSeq collection of assemblies, and supplementing it with two fungi-specific databases. In the revised manuscript, we performed further quality analyses to assess and correct for inaccuracy in the BLASTP hits. Because RefSeq aims to provide a stable reference, it is often slow in replacing older assemblies with newer ones based on improved technologies. We thus compared the RefSeq hits for species in which a newer, long-read based assembly had become available. The results are documented in Text S1 and in summary, while we did find examples of missing homologs and inconsistent sequences, the problems were isolated to specific species and the inconsistency pertains only to the tandem repeat regions. Regarding the specific example of within-species variable number of tandem repeats (VNTR) in C. auris Hil1-Hil4, we are confident of both the copy number and the sequence variation for two reasons. First, all C. auris strain genomes analyzed in this study were assembled de novo rather than based on a reference genome, and all were long-read based (PacBio) (Table S4). Second, empirically, we found the VNTR identified in Hil1-Hil4 agree among strains within one of the four clades of C. auris while differing between clades (Table S5). Since assembly errors are not expected to produce clade-specific patterns, we believe this is strong evidence for the VNTR identified being real.

      We also appreciate the reviewer’s suggestion on discussing the conservation of the repeats as an interesting trait for a group of Hil proteins in comparison to the Als family. We now added a section in Discussion focusing on the special properties of this group of Hil proteins.

      • Should the authors qualify some of their claims as preliminary or speculative, or remove them altogether?

      Due to the nature of my comments, this review will not be anonymous. I will include some of the data from my laboratory to further illustrate the point about the quality of draft genome sequences, especially for gene families that contain repeated sequences. My laboratory group has spent the past several years looking at the families of cell wall genes in these species and know that the C. tropicalis genome sequence used in the current analysis is highly flawed. There is even a manuscript from several years ago that documents problems in the assembly (doi: 10.1534/g3.115.017566). There is a new PacBio sequence available that has considerably improved data for this group of genes, but still is not perfect. We designed primers and amplified the various coding regions to verify whether the IFF/HYR were correct in the draft genome sequences. For C. tropicalis, we know that 7 of the genes listed in this paper are broken (i.e. prematurely terminated) giving a false impression of their construction. The current study did not verify any gene sequences, so broken/incomplete genes are a stumbling block for developing conclusions.

      We deeply appreciate the reviewer pointing out the flaws in the C. tropicalis genome. Using the PacBio sequence-based new assembly, we were able to confirm the reviewer’s comment on the sequence and annotation error in the RefSeq assembly for C. tropicalis. We listed the comparisons between the two assemblies in Table S8. Because the differences reside outside of the Hyphal_reg_CWP domain, they don’t impact our phylogenetic analyses, which are based on the effector domain alignment. To determine if this is a widespread issue affecting all genome assemblies based on older technologies, and in response to the reviewer’s criticism, we systematically checked the sequences of BLASTP hits based on the RefSeq assemblies against newer, long-read based ones when available. As detailed in Text S1 in the revised manuscript, the problems seen in C. tropicalis were not observed in four other species. While the sample size is small, we believe the issues with C. tropicalis are likely due to a combination of specific issues with the original assembly and special properties of the genome.

      Similarly, the recent work from Cormack's lab features a PacBio C. glabrata sequence (doi: 10.1111/mmi.14707). The paper details how the authors focused on accurate assembly of the types of genes studied here. Sequences from the current project should be compared to the PacBio assembly to determine if they provide the same results.

      We compared the sequences of the three C. glabrata Hil homologs identified in the RefSeq assembly (GCF_000002545.3) to the best BLAST hits in one of the new Cormack lab assemblies for (BG2 strain, GCA_014217725.1). Two of the three proteins showed identical sequences between the assemblies. One of them is longer in the new assembly than in the RefSeq (1861 vs 1771 aa, XP_447567.2, QHS67215.1). The main difference, however, was the number of hits recovered. Performing BLASTP searches in the new assembly recovered 13 hits vs 3 from the RefSeq assembly, of which 12 were in the subtelomeric region. For this reason, we used the new assembly as the basis for the Hil homologs in our subsequent analyses. To determine if we missed homologs in other genomes due to incomplete subtelomeric regions in the RefSeq assemblies, we repeated the BLASTP search in four other genomes (Text S1). In one of the four species, C. nivariensis, we recovered one more homolog than in the RefSeq. In all other three, we identified the same number (S. cerevisiae: 0, K. lactis: 1, C. albicans: 12), suggesting that the issues seen in C. glabrata is likely specific to this species and its RefSeq assembly.

      Another part of the study that deserves additional attention or perhaps altered presentation is the idea that the Iff/Hyr N-terminal domain binds ligands. The literature on the Iff/Hyr proteins is limited. In my opinion, though, the authors of this paper could more completely present the information that is known. The paper by Uppuluri et al. is cited (doi: 10.1371/journal.ppat.1007056), but I did not see any information about their data regarding interaction of C. albicans Hyr1 with bacterial proteins mentioned in the manuscript under review. It is formally possible that the N-terminal domain of Iff/Hyr proteins does not bind a ligand. The current manuscript includes a great deal of speculation on that point, suiting it better to a Hypothesis and Theory format rather than other types of publications.

      We appreciate the reviewer’s criticism and suggestion. We made two revisions based on the comments. First, we no longer refer to the Hyphal_reg_CWP domain as ligand-binding. Instead, we refer to it as the effector domain, following existing practices in the field (Lipke 2018, PMID: 29772751, de Groot et al. 2013, PMID: 23397570). Second, during the description of the predicted structure for the domain, we mentioned that it lacks an apparent binding pocket as suggested/identified in other β-solenoid proteins with carbohydrate binding abilities. Therefore, we suggest that the potential substrate and mechanism of binding by this domain remain to be determined with further experiments. We do, however, believe that there is strong evidence for the domain being involved in adhesion. A recent study (Reithofer et al. 2021) presented structural and phenotypic characterization of three Adhesin wall-like proteins (Awp1,2,3) in C. glabrata. In particular, experimental studies of CgAwp2, a Hil family protein, showed that its deletion resulted in the reversion of the hyperadhesive phenotype in one of the C. glabrata strains. Plastic was one of the substrates being evaluated, although, as the reviewer’s work pointed out, adhesion to plastics doesn’t indicate ligand binding, as it can be mediated by non-specific hydrophobic interactions (Hoyer and Cota 2016, PMID: 27014205). Nonetheless, the results presented in Reithofer et al. 2021 and other lines of evidence presented in the current work strongly supported adhesin functions of the Hil family.

      Table 1 attempts to offer evidence that the Iff/Hyr N-terminal domain has adhesive function but falls short of convincing the reader. One of the example structural templates is a sugar pyrophosphorylase that seems irrelevant to the current discussion. In the column called "Function", the word adhesin is found several times, but no detail is presented. The only entry that offers an example ligand indicates that the domain binds cellulose which is not likely relevant for mammalian pathogenesis, the main focus of the work. Other functions listed include self-association and cell aggregation--using the N-terminal domain. It is formally possible that Iff/Hyr proteins drive aggregation using the N-terminal domain and beta-aggregation sequences in the repeated region. The authors should develop these ideas further. Discussion of adhesive/aggregative function related to the ALS family can be found in Hoyer and Cota, 2016 (doi: 10.3389/fmicb.2016.00280).

      We appreciate the reviewer’s comments. In the revised manuscript, we removed Table 1, which was based on I-TASSER identified templates. Instead, we identified similar structures in the PDB50 database to the AlphaFold2 prediction for the Hyphal_reg_CWP domain in C. auris Hil1 using DALI (Table S3). We described the functional implications based on this list as follows:

      “We identified a number of bacterial adhesins with a highly similar β-helix fold but no α-crystallin domain (Table S3), e.g., Hmw1 from H. influenzae (PDB: 2ODL), Tāpirins from C. hydrothermalis (PDB: 6N2C), TibA from enterotoxigenic E. coli (PDB: 4Q1Q) and SRRP from L. reuteri (PDB: 5NY0). For comparison, the binding region of the Serine Rich Repeat Protein 100-23 (SRRP100-23) from L. reuteri was shown in Fig. 3F (Sequeira et al. 2018). Together, these results strongly suggest that the Hyphal_reg_CWP domain in the C. auris Hil family genes mediate adhesion.”

      One line of evidence that suggest the Hyphal_reg_CWP domain may have ligand-binding activity is from the L. reuteri SRRP-BR, which is one of the bacterial adhesins identified as having a highly similar β-helical structure (but missing the α-crystallin domain). In Sequeira et al. 2018 (PMID: 29507249), the authors showed via both in-vitro and in-vivo experiments that this domain “bound to host epithelial cells and DNA at neutral pH and recognized polygalacturonic acid (PGA), rhamnogalacturonan I, or chondroitin sulfate A at acidic pH”. However, the predicted structure for the Hyphal_reg_CWP domain in C. auris Hil1 and Hil7 lack a protruding, flexible loop in the β-helix, which was proposed to serve as a binding pocket for the ligand in SRRP-BR. We therefore commented in the text “Such a feature is not apparent in the predicted structure of the Hyphal_reg_CWP domain. Further studies are needed to elucidate the potential substrate for this domain and its mechanism of adhesion.”

      We also appreciate the reviewer’s suggestion to discuss the potential role of the Hil proteins in mediating adhesion vs cell aggregation. We now have a section in Discussion that focuses on the potential role of the β-aggregation sequences especially in the subset of Hil proteins led by C. auris Hil1-Hil4, which have an unusually large number of such sequences. We discuss the recent literature suggesting the potential of such features mediating cell-cell aggregation.

      The incredibly large number of figures that focus on the repeated sequences in the genes does not appear to include mention of the idea that these regions are frequently highly glycosylated. Knowing how much carbohydrate is added to these sequences in the mature protein would also have bearing on whether the beta-aggregation potential is realized. The Iff/Hyr proteins could stick to other things based on ligand binding (adhesion), hydrophobicity, aggregative activity, etc. Not much is really known about protein function so the conclusions are only speculative. The authors are largely accurate in presenting their conclusions as speculative, but the conclusions are not developed fully and always land on the idea that the N-terminal domain has adhesive function when that aspect clearly is not known.

      We appreciate the reviewer’s comment. We have performed N- and O-glycosylataion predictions for the Hil family proteins in C. auris as a focal example and presented the results in Figure 2 of the revised manuscript. Briefly, we found that all eight proteins are predicted to be heavily O-glycosylated (Fig. 2C). N-glycosylation is rare except in Hil5 and Hil6, in regions with a low Ser/Thr content (Fig. 2C). We also deemphasized the ligand-binding ability of the effector domain and its importance in assessing the adhesin function of the Hil family proteins. At the same time, we highlighted other mechanisms as the reviewer pointed out, such as aggregative activities, in our discussion on the potential importance of the large number of β-aggregation motifs.

      Another aspect of the analysis that is not mentioned is that several of the species discussed are diploid. What effect does ploidy have on the conclusions? Most draft genomes for diploid species are presented in a haploid display, so are not completely representative of the species. Additionally, some species such as C. parapsilosis are known to vary between strains in their composition of gene families, with varying numbers of loci in different isolates.

      We appreciate the reviewer raising this issue. The potential impact of having diploid genomes represented as haploids is twofold. First, if the genome sequencing was performed on a diploid cell sample with some highly polymorphic regions, that would present difficulties to the assembly and could result in poorly assembled sections. Second, either because of the first issue, or because the researchers used the haploid phase of the organism for sequencing, the representative haploid genome will not be “completely representative of the species” as the reviewer suggested. The second problem is not specific to diploids – even for haploids, any single or collection of genomes would represent just a slice of the genetic diversity in the species. We did two things to look into this. First, we analyzed multiple strains in C. auris to reveal both Hil family size variation and also sequence polymorphism, particularly in the tandem repeat region. We also, as part of the quality control, compared and searched assemblies for different strains of some species when available. We agree that characterizing multiple genomes in a species is important for fully revealing the gene pool diversity and could have important consequences on our understanding of the emergence of novel yeast pathogens.

      Regarding the first issue, we checked the original publications for two large-scale yeast genome sequencing projects that included 10 of the 32 species in the present study (Dujon et al. 2004, PMID: 15229592 and Butler et al. 2009, PMID: 19465905). In Dujon et al. 2004, the authors stated that haploid cells were used in cases where the species has both haploid and diploid phases. In Butler et al. 2009, the authors said in the Methods that “for highly polymorphic regions of diploid genomes, initial sequence assemblies were iteratively re-assembled in regions of high polymorphism to minimize read disagreement from the two haplotypes while maximizing coverage.”. Therefore, the potential issue of heterozygosity is likely minimal. In addition, many diploid yeasts have large regions in their genomes being homozygous, both as a result of clonal expansion and also due to loss of heterozygosity (LOH), as documented in C. albicans and other Candida species (e.g., PMID: 28080987). Nonetheless, we acknowledge that this issue is yet another challenge to having high-quality, complete genome assemblies. In the discussion, we fully acknowledge the limitation of our study by genome assemblies, and believe that ongoing improvement thanks to the development of long-read technologies will allow more in-depth studies, particularly in the subtelomeric regions and for repeat-rich sequences.

      The manuscript concludes that having more genes is better, that the gene family represents diversification that must be driven by its importance to pathogenesis, without recognizing that some species evolve toward lower pathogenesis. This concept could be explored in the Discussion. …My own experience makes me wonder if the authors found any examples of species that provide an exception to the idea that having more genes is better and positively associated with pathogenesis. The parallel between IFF/HYR and ALS genes is made many times in the manuscript. Spathaspora passalidarum, a species that is not pathogenic in humans, but clearly within the phylogenetic group examined here, has 29 loci with sequence similarity to ALS genes. How many IFF/HYR genes are in S. passalidarum?

      We appreciate the reviewer’s comment. We will address the two comments above together as they are related. First of all, S. passalidarum is now included in our extended BLAST search list and we identified a total of 3 homologs in this species. When compared with the related Candida/Lodderomyces clade, which includes C. albicans, the Hil family in this species is relatively small (3 vs. >10). More generally, we observe a significant correlation between the Hil family size and the species’ pathogenic potential (Figure 1B and the phylogenetic regression result in the text).

      Regarding the first comment, we did identify two species that had a large Hil family (>8 based on C. auris) and yet were not known to infect humans. One of them, M. bicuspidata, has 29 Hil homologs and is interestingly a parasite for freshwater animals, such as Daphnia. The other species, K. africana, has 10 homologs and its ecology is not well described in the literature. With respect to the relationship between adhesin family and pathogenicity, we would like to make two points. First, as mentioned above, we observed a strong correlation between the Hil family size and the pathogen status, after correcting for phylogenetic relatedness, suggesting that expansion of the Hil family is a shared trait among pathogenic species. This doesn’t rule out the possibility that some species may have an expanded adhesin family, such as the example the reviewer mentioned, for reasons other than infecting a human host. Second, a key point in our work is that expansion of the adhesin family is only the first step – the crucial contribution of gene duplications to adaptation is not just in the increase in copy number, but also in providing the raw materials for selection to generate novel phenotypes. On that front, we documented the rapid divergence of the central domains both between and within species, as well as signatures of relaxed selective constraint and positive selection acting on the effector domain following gene duplications in C. auris, both of which support the above theme.

      There are several current taxonomies for the species in this region of the tree. The source of the names used in this paper could be specific more completely.

      We appreciate the reviewer’s comment. We now gave the complete Latin names for all species in Figure 1 and only use abbreviated names, e.g., C. auris, after the first occurrence. For species with multiple names in the literature, we followed the species name and phylogenetic placement in Shen et al. 2018 (PMID: 30415838).

      The Results and Discussion sections are largely redundant. The tone of the paper is conversational, making it easy to read, but there seems little left to say in the Discussion that has not already been mentioned as the background for the various types of analyses. The authors should revise the paper to eliminate discussions of published literature from the Results and expand the Discussion to include some of the themes that have not been mentioned yet.

      We appreciate the reviewer’s comment. In the revised manuscript, we have moved discussion points from the Result to the Discussion section. We also overhauled the Discussion to focus on the implications based on, but not already covered, in the Result part, including the points the reviewer suggested, e.g., the implications of the structure on adhesion mechanism.

      Another point that the authors do not mention is documented recombination between IFF and ALS genes (doi: 10.3389/fmicb.2019.00781) and the effect of that process on evolution among these gene families.

      We appreciate the reviewer’s comment. We now mention this and related observations in Discussion as part of the discussion on the mutational mechanisms for the evolution of the family:

      “Diversification of adhesin repertoire within a strain can arise from a variety of molecular mechanisms. For example, chimeric proteins generated through recombination between Als family members or between an Als protein’s N terminal effector domain and an Hyr/Iff protein’s repeat region have been shown (Butler et al. 2009; Zhao et al. 2011; Oh et al. 2019). Some of the adhesins with highly diverged central domains may have arisen in this manner (Fig. S10).”

      My reading of the work by Xu et al. 2021 (doi: 10.1111/mmi.14707) does not match the direction of its presentation in the current paper. Oh et al., 2021 (doi: 10.3389/fcimb.2021.794529) discussed that point recently, providing another point for the Discussion in the current paper.

      We appreciate the reviewer’s comment and agree that our original reading of Xu et al. 2021 was incorrect. Instead of suggesting a higher mutation rates in the subtelomeric region, the authors instead suggested the evolution of the Epa family in the subtelomere was driven by Break-Induced Replication. We now update our discussion in the following way, also citing Oh et al. 2021

      “Finally, as reported by (Muñoz et al. 2021), we found that the Hil family genes are preferentially located near chromosomal ends in C. auris and also in other species examined (Fig 7), similar to previous findings for the Flo and Epa families (Teunissen and Steensma 1995; De Las Peñas et al. 2003; Xu et al. 2020; Xu et al. 2021) as well as the Als genes in certain species (Oh et al. 2021). This location bias of the Hil and other adhesin families is likely a key mechanism for their dynamic expansion and sequence evolution, either via ectopic recombination (Anderson et al. 2015) or by Break-Induced Replication (Bosco and Haber 1998; Sakofsky and Malkova 2017; Xu et al. 2021). Another potential consequence of the subtelomeric location of Hil family members is that the genes may be subject to epigenetic silencing, which can be derepressed in response to stress (Ai et al. 2002). Such epigenetic regulation of the adhesin genes was found to generate cell surface heterogeneity in S. cerevisiae (Halme et al. 2004) and leads to hyperadherent phenotypes in C. glabrata (Castaño et al. 2005).”

      I might have missed it, but I could not find what constitutes a BLAST-excluded sequence (Table S7). Additional explanation (or making the explanation easier to find) would help the reader.

      We apologize for the inadvertent mistake of leaving out Table S7. In the revised manuscript, we include all hits from species that are part of the 322 species phylogeny in Shen et al. 2018. Thus, we removed the original Table S7.

      • Would additional experiments be essential to support the claims of the paper? Request additional experiments only where necessary for the paper as it is, and do not ask authors to open new lines of experimentation.

      • Are the suggested experiments realistic in terms of time and resources? It would help if you could add an estimated cost and time investment for substantial experiments.

      Ideally, validation of all sequences would provide a stronger foundation for the work. However, that request is not realistic in terms of time or resources.

      We agree with the reviewer and appreciate the understanding. In the revised manuscript, we performed additional analyses to evaluate the accuracy and correct the sequences of the BLASTP hits from RefSeq database by comparing them to long-read based assemblies when possible. Please see previous replies to reviewers’ comments and Text S1 for details.

      • Are the data and the methods presented in such a way that they can be reproduced?

      Yes, the data and methods are documented clearly and perhaps too thoroughly in many places. A considerable amount of confidence is placed in sequences that might not be accurate and tracking details down to the amino acid residue may not be reasonable in this context. A disclaimer might help--everyone probably already knows that genome sequences are not perfect but stating that the analysis is only as good as the genome sequence acknowledges that fact.

      We appreciate the reviewer’s comment. In the revised manuscript, we tried to strike a balance between providing enough methodological details for the readers to assess the conclusions and yet also keeping the flow of the paper. We also accepted the reviewer’s suggestion by adding a disclaimer in the Discussion:

      “we acknowledge the possibility of missing homologs in some species and having inaccurate sequences in the tandem-repeat region. We believe the expected improvements in genome assemblies due to advances in long-read sequencing technologies will be crucial for future studies of the adhesin gene family in yeasts.”

      • Are the experiments adequately replicated and statistical analysis adequate?

      The idea of replicates does not really apply to this analysis. I think that the species sampled are reasonable to represent the region of the phylogenetic tree on which the analysis is focused. The authors clearly documented their computational methods in an admirable way.

      We appreciate the reviewer’s comment.

      MINOR COMMENTS:

      Figure 1 has elements that would make a nice graphical summary, but most of it should not be part of the final manuscript. For example, Panel A is repeated in Figure 2. It is not clear what Panel C means until the reader gets to Figure 2. Panel D is unnecessary. The image in Panel B is a good graphic. Endothelial adhesion is not mentioned, though. It is also debatable whether the proteins bind directly to plastic or to the body fluids that coat the plastic.

      Based on this and another reviewer’s comments, we removed Figure 1 from the revised manuscript.

      Compared to Figure 1, the information in Figure 3 is inconsistent. The "central domain" in Panel A is not central to anything as drawn, located at the end of the protein. The figure should be revised to be consistent with the majority of the authors' results.

      We appreciate the reviewer’s suggestion. The terminologies used to describe the different parts of a typical yeast adhesin vary in the literature. In the Als family literature, central domain refers to the region after the N-terminal effector domain and before the C-terminal Ser/Thr-rich stalk domain. In the Hil family proteins, there is not a clear distinction between a “central” and a “stalk” region. In Boisramé et al. 2011 (PMID: 21841123), the authors referred to the region between the Hyphal_reg_CWP domain and the GPI-anchor as the central domain. We adopted that use. We realize that this can lead to confusion especially for Als researchers. In some other literature, e.g., Reithofer et al. 2021, this part of the protein is referred to as the B-region. But we couldn’t find wide use of that term. We decided to stay with “central domain” in this work and hope that by defining the term in Figure 2A, we would avoid any confusion within the scope of this work.

      Are the low-complexity repeats mentioned in the Figure 4 legend present anywhere else in the C. auris genome or elsewhere among the species used in this study? The answer to that question may also provide evolutionary clues.

      We did find one other putative GPI-anchored cell wall protein containing this ~44aa repeat unit, but with a different effector domain (GLEYA, PF10528). This protein (PIS58185.1 in C. auris B8441), appears to be a hybrid between the repeat region of C. auris Hil1 and an N-terminal effector domain of a different family. This result fits the theme of the reviewer’s work in C. albicans and C. tropicalis on the chimeric adhesins formed between the Als and Hyr/Iff families. Due to the scope of the current work, we omitted this finding from the main result.

      Figure S1 legend. How was the distance to C. glabrata measured to call it equal?

      The original Figure S1 was removed in the revised manuscript. A consistent set of criteria was employed in deciding which BLASTP hits to include as Hil family members.

      Figure S4 could be presented better. Both diagonals have the same information. One could be emptied or could alternatively present nucleotide identity.

      The original Figure S4 was removed in the revised manuscript.

      Italicize the species names in Panel C of Figure S8.

      The original Figure S8C is now Figure S9 and we systematically checked to make sure that species Latin names are italicized. Thanks for pointing this out.

      Lines 256-257: The paper selectively samples the Iff/Hyr family and does not examine the "entire" family. Please revise.

      We appreciate the reviewer’s comment. In the revised manuscript, we no longer selectively sample species. Instead, we only exclude three species that are not part of the 322-yeast species phylogeny in Shen et al. 2018 and Muñoz et al. 2018, namely Diutina rugosa, Kazachstania barnettii and Artibeus jamaicensis. Our extensive BLASTP searches also indicated that the family as defined in this work is specific to the budding yeast subphylum. We therefore believe it is accurate to describe the work as examining the entire Hil family.

      • Are prior studies referenced appropriately?

      I was disappointed to see that the paper does not reference my laboratory's work at all. When ALS genes are featured so strongly in a report, it seems reasonable to include something we have done over 30+ years. Our most-recent ALS paper (Oh et al., 2021 doi: 10.3389/fcimb.2021.794529) would be a reasonable source for defending the gene numbers used in Figure 2A. Other examples of our work that directly relate to concepts in this paper were mentioned above.

      We sincerely apologize for our negligence. We are new to the fungal adhesin field through an accidental finding, and despite our effort to digest the relevant literature, we did unfortunately overlook the extensive work done on the Als family, much of which came from the reviewer’s lab. We have carefully read the papers suggested by the reviewer as well as others, and now have better incorporated prior foundational and insightful work into our result and discussion sections (see previous replies to the reviewer’s comments).

      • Are the text and figures clear and accurate?

      Suggestions for improvement are incorporated into the comments above.

      • Do you have suggestions that would help the authors improve the presentation of their data and conclusions?

      Please present Methods and Results in the past tense. I still make the same mistake when I try to get my ideas on the page but proofread one more time and ensure the verb tenses are accurate.

      We appreciate the reviewer’s comments and have edited the Methods and Results sections accordingly.

      SIGNIFICANCE

      • Describe the nature and significance of the advance (e.g. conceptual, technical, clinical) for the field.

      The paper reads as if it is presenting preliminary data for a grant proposal. Perhaps Prof. He's lab wants to seek functional evidence for the role of the Iff/Hyr proteins. The current paper provides an exhaustive background for such a pursuit. As presented, there is little functional data for these proteins, genome sequences are not 100% accurate, but the trends noted are defendable.

      We appreciate the reviewer’s comments. We acknowledge that experimental studies will be needed to prove and further establish the functional importance of our findings. However, we believe our gene family evolutionary studies provided important novel insights and serve as an example for adhesin family evolution.

      • Place the work in the context of the existing literature (provide references, where appropriate).

      The ideas presented here are similar to those pioneered in the Butler et al. Nature paper in 2009 (doi: 10.1038/nature08064). We now have the benefit of more genome sequences so the analysis can encompass more species. C. auris adds a newer focus on part of the phylogenetic tree that was not previously emphasized. The idea of "more is better" is very simplistic, though. Parallel work for the ALS family shows complexity in gene expression levels, suggesting that some adhesins are poised to make a large contribution while others are likely to have a scant presence on the cell surface. Those concepts are not really explored in the current paper, either. See Hoyer and Cota 2016 (doi: 10.3389/fmicb.2016.00280); Oh et al. (doi: 10.3389/fmicb.2020.594531).

      We appreciate the reviewer’s comments and have included a discussion about the potential diversity of the duplicated Hil family proteins, in terms of function and their regulation in the Discussion. Also see our response to the first comment of the reviewer regarding the novelty of our hypothesis and the significance of our findings.

      • State what audience might be interested in and influenced by the reported findings.

      Potential readers would come from the fields of fungal adhesion and pathogenesis, as well as evolutionary biology.

      We appreciate the reviewer’s comments.

      • Define your field of expertise with a few keywords to help the authors contextualize your point of view. Indicate if there are any parts of the paper that you do not have sufficient expertise to evaluate.

      I discovered and named the ALS gene family in C. albicans and have spent 30+ years characterizing it. Most recently, my lab has focused on providing an accurate gene census and validated gene sequences for the cell wall "adhesinome" in the pathogenic Candida species. Some families are expanded and some are not. Some proteins appear only in a few species and demonstrate key roles in host-fungus interactions. There are many nuances to interpretation of what these fungi are doing from the standpoint of cell-surface adhesins and we look forward to exploring these ideas across many genomes, using validated gene sequences. We have a tremendous dataset that might make good fuel for a collaboration with Prof. He, given his enthusiasm for this area of study, as well as his outstanding expertise and perspectives on evolutionary analyses.

      We sincerely thank the reviewer for the critical analysis of our manuscript and appreciate the many suggestions for improving the manuscript.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary:

      The manuscript by Smoak et al. provides considerable information gleaned from analysis of HYR/IFF genes in 19 fungal species. A specific focus is on Candida auris. The main conclusion is that this gene family repeatedly expanded in divergent pathogenic Candida lineages including C. auris. Analyses focus on the sequences encoding the protein's N-terminal domain and tracts of repeated sequences that follow. The authors conclude with the hypothesis that expansion and diversification of adhesin gene families underpin fungal pathogen evolution and that the variation among adhesin-encoding genes affects adhesion and virulence within and between species. The paper is easy to read, includes clear and attractive graphics, as well as a considerable number of supplementary data files that provide thorough documentation of the sources of information and their analysis.

      Major comments:

      • Are the key conclusions convincing?

      Overall, the authors' conclusions are supported by the information they present. However, the overall conclusion is stated as a hypothesis and that hypothesis is not particularly novel. The idea that expansion of gene families associated with pathogenesis occurs in the pathogenic species dates back at least to Butler et al. (2009; doi: 10.1038/nature08064) who first presented the genome sequences for many of the species considered here.

      One key issue with a manuscript of this type is whether genome sequence data are accurate. The authors are not the first research group to take draft genome sequence data at face value and attempt to draw major conclusions from it. The accuracy of public genome data continues to improve, especially with the emergence of PacBio sequencing. Because the IFF/HYR genes contain long tracts of repeated sequences, genome assemblies from short-read data are frequently inaccurate. For example, is it reasonable to have confidence that the number of copies of a tandemly repeated sequence in a specific ORF is exactly 21 (an example taken from Table 2) when each repeat is 40+ amino acids long and highly conserved? Table S6 would benefit from inclusion of the type of sequence data used to construct each draft genome sequence. It is also reasonable to question whether the genome of the type strain is used as a template to construct the draft genomes of the other strains. If that was standard practice, conservation of the repeat copy number among strains might be an artefact. Conservation of repeat sequences to the degree shown is not a feature of the ALS family, a point of contrast between gene families that could be explored in the Discussion. - Should the authors qualify some of their claims as preliminary or speculative, or remove them altogether?

      Due to the nature of my comments, this review will not be anonymous. I will include some of the data from my laboratory to further illustrate the point about the quality of draft genome sequences, especially for gene families that contain repeated sequences. My laboratory group has spent the past several years looking at the families of cell wall genes in these species and know that the C. tropicalis genome sequence used in the current analysis is highly flawed. There is even a manuscript from several years ago that documents problems in the assembly (doi: 10.1534/g3.115.017566). There is a new PacBio sequence available that has considerably improved data for this group of genes, but still is not perfect. We designed primers and amplified the various coding regions to verify whether the IFF/HYR were correct in the draft genome sequences. For C. tropicalis, we know that 7 of the genes listed in this paper are broken (i.e. prematurely terminated) giving a false impression of their construction. The current study did not verify any gene sequences, so broken/incomplete genes are a stumbling block for developing conclusions.

      Similarly, the recent work from Cormack's lab features a PacBio C. glabrata sequence (doi: 10.1111/mmi.14707). The paper details how the authors focused on accurate assembly of the types of genes studied here. Sequences from the current project should be compared to the PacBio assembly to determine if they provide the same results.

      Another part of the study that deserves additional attention or perhaps altered presentation is the idea that the Iff/Hyr N-terminal domain binds ligands. The literature on the Iff/Hyr proteins is limited. In my opinion, though, the authors of this paper could more completely present the information that is known. The paper by Uppuluri et al. is cited (doi: 10.1371/journal.ppat.1007056), but I did not see any information about their data regarding interaction of C. albicans Hyr1 with bacterial proteins mentioned in the manuscript under review. It is formally possible that the N-terminal domain of Iff/Hyr proteins does not bind a ligand. The current manuscript includes a great deal of speculation on that point, suiting it better to a Hypothesis and Theory format rather than other types of publications.

      Table 1 attempts to offer evidence that the Iff/Hyr N-terminal domain has adhesive function but falls short of convincing the reader. One of the example structural templates is a sugar pyrophosphorylase that seems irrelevant to the current discussion. In the column called "Function", the word adhesin is found several times, but no detail is presented. The only entry that offers an example ligand indicates that the domain binds cellulose which is not likely relevant for mammalian pathogenesis, the main focus of the work. Other functions listed include self-association and cell aggregation--using the N-terminal domain. It is formally possible that Iff/Hyr proteins drive aggregation using the N-terminal domain and beta-aggregation sequences in the repeated region. The authors should develop these ideas further. Discussion of adhesive/aggregative function related to the ALS family can be found in Hoyer and Cota, 2016 (doi: 10.3389/fmicb.2016.00280).

      The incredibly large number of figures that focus on the repeated sequences in the genes does not appear to include mention of the idea that these regions are frequently highly glycosylated. Knowing how much carbohydrate is added to these sequences in the mature protein would also have bearing on whether the beta-aggregation potential is realized. The Iff/Hyr proteins could stick to other things based on ligand binding (adhesion), hydrophobicity, aggregative activity, etc. Not much is really known about protein function so the conclusions are only speculative. The authors are largely accurate in presenting their conclusions as speculative, but the conclusions are not developed fully and always land on the idea that the N-terminal domain has adhesive function when that aspect clearly is not known.

      Another aspect of the analysis that is not mentioned is that several of the species discussed are diploid. What effect does ploidy have on the conclusions? Most draft genomes for diploid species are presented in a haploid display, so are not completely representative of the species. Additionally, some species such as C. parapsilosis are known to vary between strains in their composition of gene families, with varying numbers of loci in different isolates.

      The manuscript concludes that having more genes is better, that the gene family represents diversification that must be driven by its importance to pathogenesis, without recognizing that some species evolve toward lower pathogenesis. This concept could be explored in the Discussion.

      The Results and Discussion sections are largely redundant. The tone of the paper is conversational, making it easy to read, but there seems little left to say in the Discussion that has not already been mentioned as the background for the various types of analyses. The authors should revise the paper to eliminate discussions of published literature from the Results and expand the Discussion to include some of the themes that have not been mentioned yet.

      My own experience makes me wonder if the authors found any examples of species that provide and exception to the idea that having more genes is better and positively associated with pathogenesis. The parallel between IFF/HYR and ALS genes is made many times in the manuscript. Spathaspora passalidarum, a species that is not pathogenic in humans, but clearly within the phylogenetic group examined here, has 29 loci with sequence similarity to ALS genes. How many IFF/HYR genes are in S. passalidarum?

      There are several current taxonomies for the species in this region of the tree. The source of the names used in this paper could be specific more completely.

      Another point that the authors do not mention is documented recombination between IFF and ALS genes (doi: 10.3389/fmicb.2019.00781) and the effect of that process on evolution among these gene families.

      My reading of the work by Xu et al. 2021 (doi: 10.1111/mmi.14707) does not match the direction of its presentation in the current paper. Oh et al., 2021 (doi: 10.3389/fcimb.2021.794529) discussed that point recently, providing another point for the Discussion in the current paper.

      I might have missed it, but I could not find what constitutes a BLAST-excluded sequence (Table S7). Additional explanation (or making the explanation easier to find) would help the reader. - Would additional experiments be essential to support the claims of the paper? Request additional experiments only where necessary for the paper as it is, and do not ask authors to open new lines of experimentation. - Are the suggested experiments realistic in terms of time and resources? It would help if you could add an estimated cost and time investment for substantial experiments.

      Ideally, validation of all sequences would provide a stronger foundation for the work. However, that request is not realistic in terms of time or resources. - Are the data and the methods presented in such a way that they can be reproduced?

      Yes, the data and methods are documented clearly and perhaps too thoroughly in many places. A considerable amount of confidence is placed in sequences that might not be accurate and tracking details down to the amino acid residue may not be reasonable in this context. A disclaimer might help--everyone probably already knows that genome sequences are not perfect but stating that the analysis is only as good as the genome sequence acknowledges that fact. - Are the experiments adequately replicated and statistical analysis adequate?

      The idea of replicates does not really apply to this analysis. I think that the species sampled are reasonable to represent the region of the phylogenetic tree on which the analysis is focused. The authors clearly documented their computational methods in an admirable way.

      Minor comments:

      • Specific experimental issues that are easily addressable.

      Figure 1 has elements that would make a nice graphical summary, but most of it should not be part of the final manuscript. For example, Panel A is repeated in Figure 2. It is not clear what Panel C means until the reader gets to Figure 2. Panel D is unnecessary. The image in Panel B is a good graphic. Endothelial adhesion is not mentioned, though. It is also debatable whether the proteins bind directly to plastic or to the body fluids that coat the plastic.

      Compared to Figure 1, the information in Figure 3 is inconsistent. The "central domain" in Panel A is not central to anything as drawn, located at the end of the protein. The figure should be revised to be consistent with the majority of the authors' results. Structures in Panels C to E would benefit from the "through the spiral" view that is featured in Figure S9. What experimental technique was used to solve the structure in Panel E? Adding that information to the legend would be helpful to the reader. Also, the secondary structure colors seem to be reversed between the legend and domain structure. Adding the coordinates of the domains shown would help the reader to understand their location in the mature protein.

      Are the low-complexity repeats mentioned in the Figure 4 legend present anywhere else in the C. auris genome or elsewhere among the species used in this study? The answer to that question may also provide evolutionary clues.

      Figure S1 legend. How was the distance to C. glabrata measured to call it equal?

      Figure S4 could be presented better. Both diagonals have the same information. One could be emptied or could alternatively present nucleotide identity.

      Italicize the species names in Panel C of Figure S8.

      Lines 256-257: The paper selectively samples the Iff/Hyr family and does not examine the "entire" family. Please revise. - Are prior studies referenced appropriately?

      I was disappointed to see that the paper does not reference my laboratory's work at all. When ALS genes are featured so strongly in a report, it seems reasonable to include something we have done over 30+ years. Our most-recent ALS paper (Oh et al., 2021 doi: 10.3389/fcimb.2021.794529) would be a reasonable source for defending the gene numbers used in Figure 2A. Other examples of our work that directly relate to concepts in this paper were mentioned above. - Are the text and figures clear and accurate?

      Suggestions for improvement are incorporated into the comments above. - Do you have suggestions that would help the authors improve the presentation of their data and conclusions?

      Please present Methods and Results in the past tense. I still make the same mistake when I try to get my ideas on the page but proofread one more time and ensure the verb tenses are accurate.

      Significance

      • Describe the nature and significance of the advance (e.g. conceptual, technical, clinical) for the field.

      The paper reads as if it is presenting preliminary data for a grant proposal. Perhaps Prof. He's lab wants to seek functional evidence for the role of the Iff/Hyr proteins. The current paper provides an exhaustive background for such a pursuit. As presented, there is little functional data for these proteins, genome sequences are not 100% accurate, but the trends noted are defendable. - Place the work in the context of the existing literature (provide references, where appropriate).

      The ideas presented here are similar to those pioneered in the Butler et al. Nature paper in 2009 (doi: 10.1038/nature08064). We now have the benefit of more genome sequences so the analysis can encompass more species. C. auris adds a newer focus on part of the phylogenetic tree that was not previously emphasized. The idea of "more is better" is very simplistic, though. Parallel work for the ALS family shows complexity in gene expression levels, suggesting that some adhesins are poised to make a large contribution while others are likely to have a scant presence on the cell surface. Those concepts are not really explored in the current paper, either. See Hoyer and Cota 2016 (doi: 10.3389/fmicb.2016.00280); Oh et al. (doi: 10.3389/fmicb.2020.594531). - State what audience might be interested in and influenced by the reported findings.

      Potential readers would come from the fields of fungal adhesion and pathogenesis, as well as evolutionary biology. - Define your field of expertise with a few keywords to help the authors contextualize your point of view. Indicate if there are any parts of the paper that you do not have sufficient expertise to evaluate.

      I discovered and named the ALS gene family in C. albicans and have spent 30+ years characterizing it. Most recently, my lab has focused on providing an accurate gene census and validated gene sequences for the cell wall "adhesinome" in the pathogenic Candida species. Some families are expanded and some are not. Some proteins appear only in a few species and demonstrate key roles in host-fungus interactions. There are many nuances to interpretation of what these fungi are doing from the standpoint of cell-surface adhesins and we look forward to exploring these ideas across many genomes, using validated gene sequences. We have a tremendous dataset that might make good fuel for a collaboration with Prof. He, given his enthusiasm for this area of study, as well as his outstanding expertise and perspectives on evolutionary analyses.

    1. This can be said of the common cognitions of blue, yellow etc. also. If self-awareness could be discredited on the ground that it is the product of some beginningless urge, how can any other cognition be credited as valid so that one could depend upon the cognitions of blue, yellow etc.?”

      The other day, I was having an interesting discussion with Buddhist philosopher working on sanskrit. We were discussing perhaps it is the grammatical structure that lead us to think in a certain way. That the languagenand its grammatical structure may already guiding our way of thinking. Sor the argument was on White Cow, is the whiteness posessing the cow, or the cow posessing the whiteness. The scholars were arguing that it could be translated in both ways, that gives us a kind of vague different understanding of what is being described.

    1. Reviewer #1 (Public Review):

      The core question addressed by this study is whether right IFC damage disrupts stop-signal task performance because it plays a key role in response inhibition per se, or because it is crucial for attending to the need to engage response inhibition. A relatively large sample of patients with damage including right IFC, as well as lesioned and healthy control groups, were assessed on the stop-signal task accompanied by scalp EEG. The behavioral data were analyzed using hierarchical Bayesian modeling. Right IFC damage was associated with more trials where 'stopping' was not initiated, while an EEG hallmark of inhibitory control was present in trials where stopping initiation did occur, arguing that rIFG damage disrupts attention to the stop signal, rather than the inhibition that follows.

      This is an interesting study testing a well-defined hypothesis relevant to competing views of the brain basis of inhibitory control. The experimental design is sophisticated and the analysis was preregistered. The acquisition of both behavioral and EEG data in lesion patients provides converging evidence and supports causal inference.

      Interpretation of the results hinges on accepting that a hierarchical Bayesian model is appropriate for discriminating trials where stopping was 'triggered' from trials where there was no trigger. Likewise, we need to accept the EEG frontal beta burst pattern is an indicator of response inhibition. Both of these methodological elements have support from existing literature, although I don't think either of these has been applied in chronic focal lesion patients, so there may be technical issues to consider in their interpretation. Finally, as with most human lesion studies, caution should be applied in interpreting the critical lesion location: in this sample, the effects might relate to insula damage, or to white matter disruption within the ventrolateral/lateral frontal lobe or between those regions and subcortical regions. However, these provisos do not detract from the key finding that damage somewhere in these areas affected initiation/attentional processes rather than response control per se.

      The results are more consistent with an attentional account of right IFG (or more broadly, right ventral frontal lobe) contributions to stop-signal task performance; this is provocative in light of current views of prefrontal contributions to inhibitory control, although in line with a wider literature implicating right frontoparietal circuitry in selective attention. As the authors suggest, a sharp distinction between attention and inhibition may be somewhat artificial: these processes may be closely interrelated in speeded tasks requiring response interruption. However, the present study cleverly tackles the challenge of disentangling them, applying recent modeling and EEG distinctions with interesting results.

      The findings are helpful in further sharpening ideas regarding the neural basis of response control. They also have potential theoretical implications and perhaps direct experimental application in clinical-applied research on disorders of inhibitory control.

    1. Author Response

      Reviewer #1 (Public Review):

      The stated goal of this research was to look for interactions between metabolism, (manipulated by glucose starvation) and the circadian clock. This is a hot topic currently, as bi-directional links between metabolism and rhythmicity are found in several organisms and this connection has important implications for human health. The authors work with the model organism Neurospora crassa, a filamentous fungus that has many advantages for this type of research.

      The authors' first approach was to assay the effects of glucose starvation on the levels of the RNA and protein products of the key clock genes frq, wc-1, and wc-2. The WC-1 and WC-2 proteins form a complex, WCC, that activates frq transcription. The surprising finding was that WC-1 and WC-2 protein levels and WCC transcriptional activity were drastically reduced but frq RNA and protein levels remained the same. Under conditions where rhythmicity is expressed, the rhythms of frq RNA, FRQ protein, and expression of clock-driven "output" genes were also unaffected by starvation. The standard model for the molecular clock is a transcription/translation feedback loop dependent on the levels and activity of these clock gene products, so this disconnect between the starvation-induced changes in the stoichiometry of the loop components and the lack of effects of starvation on rhythmicity calls into question our understanding of the molecular mechanism of the clock. This is yet another example of the inadequacy of the TTFL model to explain rhythmicity. For me, the most significant sentence in the paper was this: "...an unknown mechanism must recalibrate the central clockwork to keep frq transcript levels and oscillation glucose-compensated despite the decline in WCC levels."

      The author's second approach was to try to identify mechanisms for the response to starvation by focussing on frq and its regulators, using mutations in the frq gene and strains with alterations in the activity of kinases and phosphatases known to modify FRQ protein. The finding that all of these manipulations have some effect on the starvation-induced changes in WC protein level is taken by the authors to indicate a role for FRQ itself in the response to starvation. This conclusion is subject to the caveat that manipulations of the activity of multifunctional kinases and phosphatases will certainly have pleiotropic effects on many cellular processes beyond FRQ protein activity.

      Because of the sometimes-speculative nature of our conclusions and based on the suggestion of the editor, we restructured the Discussion and discuss now the mechanism addressed by the Reviewer in the subsection "Ideas and Speculation". We added a sentence to the section about the possible pleiotropic effects of the tested signaling pathways: "Starvation triggers characteristic changes in the activity of signaling routes that affect basic components of the circadian clock. Although the multifunctional pathways might act via pleiotropic mechanisms as well, based on their earlier characterized role in the control of the Neurospora clock, their action can be inserted into a model describing the glucose-dependent reorganization of the oscillator."

      The third section of the paper is a major transcriptomic study of the effects of starvation on global gene expression. Two strains are compared under two conditions: wc wild-type and the wc-1 knockout strain, under fed and starved conditions. The hypothesis is that WCC has a role in the starvation response. The results of starvation on the wild-type are unsurprising and predictable: the expression of many genes involved in metabolic processes is affected. There are no new insights that come from these results and no new testable hypotheses are generated by the data.

      We agree with the reviewer that it is not surprising that glucose depletion strongly affects genes involved in metabolic processes and monosaccharide transport. These data obtained in wt served rather as a control for our experimental conditions. As a new aspect, our analysis focused on the differences between wt and wc-1 in the transcriptomic response to altered glucose availability.

      The authors refer to the wc-1 mutant strain as "clockless" and discuss its effects on the transcriptome only in terms of WC-1's function in the clock mechanism. However, WCC is known to be a major transcriptional regulator, controlling a number of genes beyond the TTFL. As acknowledged earlier in the paper, WC-1 is also the major light receptor in Neurospora. The transcriptomics experiments were carried out in a light/dark cycle, with cultures harvested at the end of the light period, when "an adapted state for light-dependent genes can be expected" according to the authors. However, wc-1 mutants are essentially blind, and so those samples are equivalent to being harvested in the dark. The multifunctional nature of WCC complicates the interpretation of the transcriptomics data. The differences in the transcriptome between wild-type and wc-1 may not be due to loss of clock function, but rather the loss of a major multifunctional transcription factor, or the difference between light and "dark".

      The reviewer is right, when we discussed the difference between wt and wc-1 in the transcriptional response to glucose, we did not emphasize the possible contribution of the photoreceptor function of the WCC. We added the following sentence to the revised version of the discussion: "Further investigations could differentiate between the clock and photoreceptor functions of the WCC in the glucose-dependent control of the transcriptome." Furthermore, we more specifically indicate that in wc-1 the lack of the WCC (and not the lack of a functional clock) results in the altered transcriptomic response to starvation when compared to wt (P15 L14-17).

      In the final set of experiments, the authors tested the hypothesis that the changes in the transcriptome between wild type and wc-1 might make wc-1 less competent to recover growth after starvation. They also test the recovery of frq9, a "clockless" mutant. The very surprising result is that the growth rates of these two mutants are slower than the wild type after transfer from starvation media to high glucose. This is surprising because there will be several generations of nuclear division and doublings of mass within a few hours and the transcriptome should have recovered fully fairly rapidly. A mechanism for this apparent "after-effect" is suggested with evidence concerning differences in expression of a glucose transporter, but it is not clear why this expression should not change rapidly with re-feeding on high glucose. As with previous experiments, the cultures were grown in light/dark cycles, which results in different conditions for the mutants, both of which have very low or absent WC-1 and are therefore blind to light. The potential effects of light have been disregarded.

      The reviewer is right that several generations of nuclear divisions occur within a few hours and lead to a number of doublings of the biomass. However, when the first phase of regeneration is delayed in one or more strains compared to the control, until the stationary phase a substantial difference in the biomass can be expected.

      To the expression change of the glucose transporter: In order to emphasize the different tendency of how glt-1 levels respond to glucose in the different strains, in the previous version of the manuscript we normalized the expression levels to the beginning of recovery (time point of glucose addition). Thus, expression differences between the strains were not shown. To give a more comprehensive picture, in the revised version of the manuscript expression levels without normalization are depicted (Fig 5F). The mutants did not adapt efficiently to changes in the glucose levels, i.e. expression of the transporter was relatively high in both wc-1 and frq10 during starvation and did not further increase upon glucose addition. On the other hand, 24 hours after glucose resupply, glt-1 levels were similar in all strains which might contribute to the similar growth rates observed under steady-state conditions in the standard medium.

      To the photoreceptor-independent function of the WCC during growth recovery: In the revised version of the manuscript we present additional data suggesting the importance of the photoreceptor-independent function of the WCC for efficient recovery from starvation. Fig. 5C and Fig. 5D show now that upon resupply of glucose, wt grows faster than the clock-deficient strains Δwc-1 and frq10 in both LD cycles and constant darkness, indicating that the role of the WCC in growth regeneration is at least partially independent of its photoreceptor function. To the function of the WCC in frq10: frq10 can not be considered blind. Although both Δwc-1 and frq10 lack a functional clock and WC levels are reduced in frq10, these strains show significant differences in WCC activity. While Δwc-1 is considered blind, in frq10 lack of the negative feedback results in high activity of the WCC in both DD and LL and expression levels of all examined, light-sensitive or light-dependent genes were found comparable in wt and in frq-less mutants (Schafmeier et al., 2005; Hunt et al., 2007; own unpublished data).

      The title of the paper refers to a "flexible circadian clock" but this concept of flexibility is not developed in the paper. I would substitute "the White Collar Complex" for this phrase: "Adaptation to starvation requires a functional White Collar Complex in Neurospora crassa" would be more accurate. Some experiments are also conducted using an frq null "clockless" strain, but because WC expression is very low in frq null mutants, any effects of frq null could also be attributed to WC depletion.

      As detailed above, low level of the WCC in the frq-less mutant does not mean low transcriptional activity and accordingly, the two clock mutants, wc-1 and frq10 show important functional differences. We used the word "flexible" to indicate that the molecular clock is able to operate under critical nutrient conditions and with a significantly changed stoichiometry of its key components. Results of our new experiments performed in DD (mentioned above) indicate that growth regeneration is rather independent of the photoreceptor function of the WCC. Nevertheless, we accepted the criticism of the reviewer and changed the title to "Adaptation to glucose starvation is associated with molecular reorganization of the circadian clock in Neurospora crassa".

      The major conclusion I took away from this paper is the multifunctional nature of the WCC as a transcription factor complex. It has been known for a long time that WCC controls the expression of many genes beyond the frq gene at the core of the circadian transcription/translation feedback loop. WC-1 is also the major blue light photoreceptor in Neurospora, controlling the expression of light-regulated genes, and this fact is barely touched on in the paper. These new data now extend the role of WCC in the regulation of metabolic networks as well.

      Reviewer #2 (Public Review):

      The authors have performed an interesting study addressing a topical question in considering how circadian oscillators remain accurate in changing environmental conditions and these circadian oscillators contribute to responses to environmental changes. The authors have performed their studies in Neurospora crassa. The authors have made a very interesting finding that starvation causes a profound decrease in white collar 1 WC-1 abundance, yet the circadian system continues to run despite this decrease in the abundance of a core oscillator component. The study of chronic glucose starvation in a Δwc-1 mutant is interesting and provides the opportunity to investigate the role of the WHITE COLLAR COMPLEX (WCC) and the clock system in adaption to starvation.

      Strengths:

      The authors have used a range of techniques to measure clock behaviour, including qPCR, phosphorylation, protein abundance, and subcellular localisation studies.

      An frq9 mutant was used to test the effects of FRQ on WC1 abundance since WC1 decreased during starvation. This is elegant, though it is not quite clear the logic of this experiment because FRQ did not change abundance during starvation, so why did the author think this experiment was needed?

      We regret that the examination of frq9 was not clearly justified in the previous version of the manuscript. It is true that FRQ levels did not change during starvation, only phosphorylation of the protein was affected, i.e. FRQ became more phosphorylated (displayed by an electrophoretic mobility shift on the Western blot (Garceau N, Liu Y, Loros J J, Dunlap J C. Cell. 1997;89:469–476.)) under low glucose conditions. We tested the starvation response in the FRQ-less strain because WCC level changed significantly in wt upon glucose depletion and expression of WC proteins is known to be controlled by FRQ. In the revised version of the manuscript we tried to introduce and explain the experiments performed with frq9 more thoroughly (P7 L22-P8 L14; P16 L21 – P17 L6).

      An interesting experiment was performed to test whether CK1a-dependent phosphorylation and inactivation of the WCC are involved in the starvation response. An FRQΔFCD1-2 mutant is used in which FRQ cannot interact with CK1a and therefore CK1a cannot phosphorylate and inactivate WC. This experiment suggested that CK1a is not involved in the response to starvation, again leading to the conclusion that FRQ is not involved in the starvation regulation of WC.

      The referee is right, effect of FRQ-bound CK-1a seems to be minor on the adaptation of the molecular clock to starvation, and this is also our conclusion in the manuscript. The major message of this experiment was that FRQ became phosphorylated in response to starvation without stably interacting with CK1a, probably via another mechanism. We agree with the notion that the behavior of WCC levels upon starvation was similar to that in the FRQ-less mutant.

      PKA is shown to be involved in the starvation-induced reduction of WC because the starvation-induced reduction in abundances of WC-1 was absent in the mcb strain in which the regulatory subunit of PKA is defective and hence, PKA is constitutively active.

      The authors have found an interesting potential link between glucose levels and WCC phosphorylation, they demonstrated that starvation reduces PP2A activity and that in a regulatory mutant of PP2A, which has reduced PP2A activity, there is little effect of starvation on WCC levels, suggesting the hypothesis that glucose-dependent PP2A dephosphorylation stabilises WCC.

      Analysis of starvation-regulated transcriptome in Δwc-1 and wild type found strong evidence that the transcriptomic response to starvation is in part dependent on WCC. Much of the misregulated transcriptome appears to be associated with metabolism.

      In a series of growth studies in wild-type frq and wc-1 mutants the authors provide strong evidence that FRQ and WC are involved in growth and survival following starvation, and recovery from starvation.

      Weaknesses:

      The authors describe Neurospora crassa as a model for circadian biology and apparently make the assumption that the findings are indicative of the behaviour of clock systems in other kingdoms. This is not the case. Neurospora crassa is a wonderful model for studying fungal clocks and is a great tool for studying basic circadian dynamics, but the interesting findings here are of a detailed molecular nature and therefore are applicable for fungal clocks, but not other kingdoms.

      We agree that we still do not know whether the described mechanism is specific for only fungal clocks. However, besides the basic feedback loop, overlapping mechanisms (controlled by e.g. casein kinases, glycogen synthase kinase, PKA, PP2A) are involved in the regulation of circadian timekeeping in different eukaryotic systems (reviewed in Reischl and Kramer, 2011, FEBS Lett; Brenna and Albrecht, 2020, Front Physiol). Our results suggest that some of these common factors (PKA, GSK, PP2A) are involved in the reorganization of the Neurospora clock in response to changes in glucose availability. Therefore, it is possible that analogous changes occur in the time keeping mechanisms of other eukaryotic systems when they face serious environmental challenges.

      We included a short section into the Discussion which gives a short overview about known interactions between glucose availability and circadian timekeeping at different levels of the phylogenetic hierarchy (P15 L18 – P16 L7).

      The authors assume that the reader is intimate with the intricacies of Neurospora crassa circadian studies and the significance of differences between LL and DD investigations. More background on the logic of the experiments would be helpful for readers from other fields.

      Thank you for the comment. In the revised version of the manuscript we tried to introduce the molecular clock of Neurospora more thoroughly and completed the description of the experimental conditions with detailed explanations.

      The data in Figure 2 are essential for the interpretation of the findings, demonstrating the presence of free-running rhythms. However, the data are entirely qualitative, making it hard to fully assess the authors' interpretations, a more quantitative assessment of the data would improve clarity.

      We quantified the Western blot signals and show the results in Fig 1E in the new version of the manuscript (according to the reviewer's suggestion Fig 2 of the old version is now part of Fig 1). Our data indicate that oscillation of FRQ levels is similar under both nutrient conditions.

      The conclusion that FRQ contributes to the regulation of WC1 abundance in response to starvation does not seem to be supported by the data because FRQ RNA does not change upon starvation. Furthermore, the authors conclude that the starvation-induced decrease in WC-1 and WC-2 protein levels are due to FRQ because a lack of reduction in an frq9 mutant is open to misinterpretation because this mutant makes WC levels low and therefore starvation might not lower already low levels of WC. Indeed WC-1 is lower in the frq9 mutant under any condition than in the WT under starvation and WC-2 does decrease in abundance in the frq9 mutant in starvation. The data strongly suggest to this reader that FRQ does not participate in the regulation of WC abundance in response to starvation.

      After rereading the criticized section, we admit that the text was not well structured and we carried out several modifications. We intended to emphasize that upon drastic changes of the glucose availability frq RNA levels remained compensated in wt, but this compensation was affected when functional FRQ was not present. We agree with the reviewer's opinion that the low expression of the WCC in frq9 makes it difficult to compare the glucose-dependence of WCC expression in frq9 and wt. We modified the conclusion by adding this information and now mainly focus on the strain-dependent difference in the changes of frq RNA expression. (P7 L22-P8 L14)

      The discussion accurately summarises the results and provides an interpretation but lacking is a comparison to other circadian systems in other kingdoms. How do the data compare with the effects of glucose and other sugars on the mammalian, plant, and insect clocks?

      We included a short section into the Discussion which gives a short overview about known interactions between glucose availability and circadian timekeeping in different organisms (P15 L18-P16 L7).

      How changes in WCC might result in changes in transcription is not explained. This might be very obvious to the authors but to the reader, it is not. Are the transcriptional outputs direct targets of WCC? Has WCC CHIPseq been performed by the authors or others, are the regulated transcripts directly bound by WCC? What are the enriched promoter sequences in the regulated genes, is it possible to identify the network by which these changes in transcription occur?

      We now show the list of genes (Figure 4 – Figure supplement 2) that changed in a strain-specific manner in response to glucose starvation and, based on Chip-Seq results, were earlier described as direct targets of the WCC (Smith et al., 2010; Hurley et al., 2014). Based on the literature data showing that the WCC affects the expression of several other transcription factors and controls basic cellular functions which might affect the expression of further genes, it was not surprising that only 90 out of the 1377 genes were reported to be direct targets of the WCC.

      Whilst the authors claim it is the circadian clock that is involved in the starvation response, in my view a more precise interpretation of the data is that WCC is involved in the response. Since WCC is a photoreceptor with dual function in the clock, is it yet possible to conclude that the effects discovered are due to the clock role of WCC? Or do the data support the role of light signalling in regulating the starvation response through WCC?

      We thank you for the comment. In the revised version of the manuscript we more specifically indicate that in wc-1 the lack of the WCC (and not the lack of a functional clock) results in the altered transcriptomic response to starvation compared to wt. In addition, in the revised version we present a new experiment (Fig. 5D.) which shows that upon resupply of glucose wt grows faster also in constant darkness than the clock-deficient strains wc-1 and frq10 do. This indicates that the role of the WCC in growth regeneration is largely independent of its photoreceptor function.

      The authors do not apparently reconcile that the effect of starvation is to hugely decreases WCC levels, but they find the transcriptional and growth response to starvation requires WCC?

      We agree with the reviewer that the problem of how low levels of WCC could sufficiently support the transcription of frq and different output genes under starvation conditions was not discussed properly. Our results suggest a model in which the maintained level of nuclear WCC and the weakened inhibition by both FRQ (the hyperphosphorylated form is less active in the negative feedback) and PKA (its activity lowered upon glucose depletion) together might ensure that transcriptional activity of the WCC is preserved upon glucose withdrawal in both DD and LL despite the decrease of the overall level of the complex. In the revised version these aspects are discussed more thoroughly (P16-18).

      This study contributes to the increased focus of the circadian community on the regulation of outputs by circadian oscillators. The manuscript will be of interest to many in the field. There needs to be less assumption of knowledge about the N. Crassa circadian system, and better discussion in a broader context of clocks in other kingdoms.

      We added a new section to the Discussion with data concerning interrelationships between glucose availability and the circadian clock in other organisms.

    1. Children’s codeswitching and translanguaging is influenced by the language model provided by parents and significant others in the family, school and community.

      I think this is very important for teachers to know as culturally code-switching and translanguaging may be a huge issue in the home. It is super important to make connections to students first language but we must do research into the families norms and what they want for their child within learning a new language.

    2. Serratrice (2013) notes that the profile of bilinguals constantly changes as their need for and use of each of their languages can vary greatly over time, depending on such factors as context, purpose, the formality of the situation, and who they wish or need to interact with. The term dynamic bilingualism captures this ever-changing nature of language use by emergent bilinguals (O. Garcia, 2009a).

      I think dynamic bilingualism is important for us as future teachers to acknowledge and understand. I also wonder how majority languages within the individual's everyday life impacts the changes between their usage of their languages. In the classroom this could effect teachers because we may become accustom to the use of English with our emergent bilingual students, however, we should still encourage and provide opportunities for students to use of their native/home language during learning. As teachers we should allow opportunities for students to further develop all their languages.

      -Lauren Mitchell

    1. reading too much into this? May

      I know annoying as a director or maybe even as a audience member I should assume thing but I don’t know in my opinion maybe it was aright if passage because he could’ve just asked her for the scissors or stoped what he was doing and got them for his self. But I could also argue that why didn’t he say something then. This made me think of the many questions we have for our parents that we never get answers to.

    1. Anxiety Makes Me Feel Like I am Losing My MindAnxiety, Mental Health, Therapy, Treatment<img width="550" height="321" src="https://elevationbehavioralhealth.com/wp-content/uploads/2019/01/anxiety-makes-me-feel-like-i-am-losing-my-mind-550x321.jpg.webp" class="attachment-entry_with_sidebar size-entry_with_sidebar wp-post-image" alt="i feel like i&#039;m losing my mind" /> Table of Contents Help! Anxiety Makes Me Feel Like I am Losing My MindI Feel Like I’m Losing My MindDifferent Types of Anxiety DisordersHow to Manage AnxietyHolistic Therapies That Help Manage StressElevation Behavioral Health Provides Expert Treatment for Anxiety  Help! Anxiety Makes Me Feel Like I am Losing My Mind Anxiety can be so hard to live with. Constant worry and stress keep you in a state of constant fight-or-flight mode at the slightest little trigger. You may try to reason with yourself, that the stress triggers are no big deal. Your brain, though, is locked and loaded to take you through the spectrum of anxiety symptoms. You just can’t seem to break the stress cycle. Many who approach a doctor with their complaints about their symptoms have truly suffered. They are seeking ways to manage the stress so they can live a normal, happy life. This goal is very possible to reach with the right treatment plan. Anxiety treatment can help reduce when you find yourself expressing am I losing my mind and help reduce the daily struggle and greatly improve your life. <img class="alignright wp-image-28337" src="https://elevationbehavioralhealth.com/wp-content/uploads/2019/06/losingmind.jpg.webp" alt="i'm losing my mind" width="300" height="634" srcset="https://elevationbehavioralhealth.com/wp-content/uploads/2019/06/losingmind.jpg.webp 568w,https://elevationbehavioralhealth.com/wp-content/uploads/2019/06/losingmind-142x300.jpg.webp 142w,https://elevationbehavioralhealth.com/wp-content/uploads/2019/06/losingmind-488x1030.jpg.webp 488w,https://elevationbehavioralhealth.com/wp-content/uploads/2019/06/losingmind-334x705.jpg.webp 334w" sizes="(max-width: 300px) 100vw, 300px" />I Feel Like I’m Losing My Mind Anxiety disorder is a broad grouping of mental health disorders, each with excess worry or fear driving it. Anxiety disorders are very common, with 40 million people struggling with one each year. This disorder is different from the common fear you might feel before having to make a public speech. We all have felt afraid from time to time, like when we are pushed out of our comfort zone. Anxiety disorders, though, are very intrusive. Constant stress can be so difficult to manage that it impacts one’s lifestyle, career, health, and friendships. What It Feels Like On one hand, when someone suffers from this problem, something will trigger a cascade of symptoms. There are many types of anxiety and each has its own unique features. The basic anxiety symptoms include: Feelings of dread and fear. Always being on alert for danger. Racing heart. Shaking. Sweating. Fast breathing. Shortness of breath, holding one’s breath. Stomach upset, diarrhea. Feeling jumpy or restless. Insomnia. Headaches. Different Types of Anxiety Disorders There are varied ways that anxiety is expressed. For this reason, there are six types of mental health disorders. The anxiety spectrum includes: Generalized anxiety disorder: GAD features constant worry for much of the day. This can result in headaches, muscle tension, nausea, and trouble thinking. Panic disorder: Sudden and unexplained feelings of intense terror. This can cause a racing heart, shortness of breath, nausea, chest pain, feeling out of my mind, dizzy. May lead to social isolation to avoid having an attack. Social anxiety: Intense fear of being judged or critiqued. Fear of being embarrassed in public. Causes social isolation. Specific phobias: Irrational fear of a certain thing, place, or situation. To manage this fear, the person will go to great measures to avoid triggers. Trauma disorder: PTSD is about never getting over trauma, even months later, It can lead to avoidance of people, places, or situations that trigger thoughts of the event. Flashbacks, nightmares, or repeated thoughts of the trauma stoke the symptoms. Obsessive-compulsive disorder: OCD involves worries about things like germs, causing harm, or a need for order. This drives compulsive behaviors in an attempt to manage the symptoms of anxiety caused by the fear. How to Manage Anxiety Do the symptoms of anxiety make you feel like you’re losing your mind? If so, it is time to meet with a mental health worker. At the first meeting, a therapist will assess what type of anxiety you are dealing with. We Can Help! Call Now! (888) 561-0868 He or she will then design a treatment plan that will help you manage the symptoms. The treatment uses a combined approach with psychotherapy, drugs, and healthy actions that help to reduce stress. Therapy for anxiety is based on the type you have. CBT is very helpful for people that struggle with excess worry and fear. It also helps you to notice how your thoughts are driving the panic-type response to a trigger. CBT then guides you toward changing those fear-based thoughts into more positive ones. Once the thoughts are reframed, the actions that follow will also be positive. Anti-anxiety drugs from the benzo group can be helpful for some people. These drugs work swiftly to help calm nerves and relax you. In some cases, antidepressants are used to treat anxiety as well. <img class="alignright wp-image-28339" src="https://elevationbehavioralhealth.com/wp-content/uploads/2019/06/maninmirror.jpg.webp" alt="feel like i'm losing my mind" width="300" height="634" srcset="https://elevationbehavioralhealth.com/wp-content/uploads/2019/06/maninmirror.jpg.webp 568w,https://elevationbehavioralhealth.com/wp-content/uploads/2019/06/maninmirror-142x300.jpg.webp 142w,https://elevationbehavioralhealth.com/wp-content/uploads/2019/06/maninmirror-488x1030.jpg.webp 488w,https://elevationbehavioralhealth.com/wp-content/uploads/2019/06/maninmirror-334x705.jpg.webp 334w" sizes="(max-width: 300px) 100vw, 300px" /> Holistic Therapies That Help Manage Stress Holistic therapy self-care for stress actions is now often found in the treatment plan for anxiety. This is because these activities can help improve the treatment outcome. They do this by teaching patients ways to achieve a relaxed state of being. For instance, some of these include: Yoga. Mindfulness. Deep breathing Acupuncture. Massage therapy. Equine therapy. Art therapy Elevation Behavioral Health Provides Expert Treatment for Anxiety  Elevation Behavioral Health is an upscale residential mental health treatment center in Los Angeles. If you feel like anxiety makes you feel like you’re losing your mind, our caring team of experts can help. It is time to seek the treatment you deserve to regain your quality of life. When your outpatient treatment is not giving the results you desire, consider a residential program. Treatment is much more focused, and the home-like setting gives you a chance to heal. Take a break from the stressors or triggers in your daily life. Enjoy our upscale private home and gorgeous setting. Our team will help guide you back to health and wellbeing. For questions about our program, reach out to us today at (888) 561-0868. November 22, 2020/by Elevation Behavioral HealthTags: am i losing my mind, feel like im losing my mind, help im losing my mind, i feel like i am losing my mind, i think im losing my mind, losing my mind, losing your mindShare this entryShare on FacebookShare on TwitterShare on PinterestShare on LinkedInShare on TumblrShare on VkShare on RedditShare by Mail https://elevationbehavioralhealth.com/wp-content/uploads/2019/01/anxiety-makes-me-feel-like-i-am-losing-my-mind.jpg 366 550 Elevation Behavioral Health https://elevationbehavioralhealth.com/wp-content/uploads/2018/12/logo_ebh.png Elevation Behavioral Health2020-11-22 01:00:132022-07-08 16:31:14Anxiety Makes Me Feel Like I am Losing My Mind

      When Anxiety is too Much I Feel Like I am Losing My Mind

    1. Can a Narcissist Stop Lying Even With Evidence?Behavior, Mental Health<img width="845" height="321" src="https://elevationbehavioralhealth.com/wp-content/uploads/2022/04/why-do-narcissists-lie-845x321.jpg" class="attachment-entry_with_sidebar size-entry_with_sidebar wp-post-image" alt="why do narcissists lie" /> Table of Contents Why Do Narcissists LieAbout Narcissistic Personality DisorderWhy Someone With NPD LiesLies Often Turn Into GaslightingYou Are the Narcissistic Supply SourceBreaking Free From an NPD LiarElevation Behavioral Health Provides Residential Luxury Mental Health Treatment Why Do Narcissists Lie Are narcissists compulsive liars? Can a narcissist ever stop lying, even when confronted with evidence of their lies? Learn all about narcissistic personality disorder. If you are involved with a narcissist, then you are quite used to being lied to. Their constant lies simply come with the territory. To a normal person, it may be very perplexing to be lied to all the time by someone who purports to care for you. Learn about what the narcissist seems to gain from telling lies all time. About Narcissistic Personality Disorder Narcissistic personality disorder (NPD) is a mental health disorder that stems from an unhealthy and inflated view of self. At least, that’s how it appears on the outside. Inside, though, the NPD really has a very low opinion of him or herself. All of their heinous behaviors are driven by a need to pump themselves up in their own eyes and others’. Individuals with NPD often seek out partners who have certain traits. For instance, they may be a compassionate and sensitive person, but may also be needy and have low self-esteem. Like a leech that latches to a blood source, the NPD latches onto its victim. Over time, the NPD slowly chips away at the victim’s sense of self-worth. Through lies and gaslighting, they put them down and cause them to doubt themselves. Through this emotional abuse, they can control the victim. But because the NPD has no conscience, they never feel regret or remorse for mistreating their partner. Someone with NPD demands constant admiration and praise while keeping their victim from receiving any. A narcissist does not want any competition. Symptoms of NPD include: Lacks empathy or compassion for others. Feels entitled to special treatment. Expects others to fawn over them. Belittles others; talks down to people. Takes advantage of the others’ weaknesses to build themselves up. Self important; arrogant. May hog the conversation. Emotionally detached. Believes that others envy him. Boastful and pretentious. Becomes angry if challenged. Torments the victim with fear. Has a bad temper; sudden angry outbursts. Easily slighted, sensitive to criticism. Doesn’t notice the needs of others. Emotionally stingy. May isolate their victim from friends. Feels insecure inside; self-loathing. Not willing to go to therapy. The NPD will refuse to get help, believing that they are perfect and beyond reproach. Why Someone With NPD Lies Why do narcissists lie… all the time? If you confront them with proof of the lie, they will still attempt to lie their way out of it. What inspires lying? Simply put, the NPD lies in order to inflate his or her own self-esteem. They lie to the other person, to beat them. By inflating truths, they attempt to make their own skills or abilities seem superior to the other person. In other words, they are a boar, the type of person people avoid at a party. We Can Help! Call Now! (888) 561-0868 When the NPD lies, he or she is trying to make themselves appear dominant. They lie for self-gain believing that telling mistruths makes them look smarter than the other person. Having a victim at their side who they can lie to provides them with a constant narcissistic supply, someone that fuels their sickness. When they impress their partner with their lies, they receive a rush or hit to feel better about themselves. Lies Often Turn Into Gaslighting For the NPD, the lies are often a prelude to gaslighting. Gaslighting is a psychological weapon used by some to keep a person emotionally off-balance. When they lie to the person’s face about what may have occurred, they cause the victim to question their own sanity. When the victim confronts the NPD with solid evidence of a misdeed, they will be met with lies. Not only will the NPD lie and deny it ever happened, but they are also likely to attack. This is where the gaslighting begins. They will attempt to twist the event around to become the fault of the victim. You Are the Narcissistic Supply Source There is a reason why the NPD wants to keep their victim around; the victim fulfills a need for them. They fill up their NPD cup daily by sucking the life out of the unsuspecting partner. Thus, the victim is not even aware of the role they play in the illness at first. The NPD will therefore go to great lengths to keep the victim from leaving them. Some tactics they use include: They may cry false tears to elicit sympathy, thus keeping the victim engaged. They may use force or become violent to assert dominance. They may try to manipulate the victim through guilt. They may threaten the victim by taking the money away or causing some type of harm. They make the victim feel bad about themselves so they won’t think they can do any better. They may threaten suicide, although it is an empty threat. Breaking Free From an NPD Liar If you have woken up to realize you are in a relationship with an NPD, you should run, not walk, to the exits. The sad truth is that these people are rarely able to change their ways, mostly because they don’t want to. In their own minds they feel they never do wrong, so why go to therapy? Partner with a therapist who can offer guidance and support as you detach from the NPD. These people can and do become violent when faced with their N-source leaving them. Prepare for the false promises and tears, as they play on your sense of compassion to keep you entrenched in the abuse cycle. So, can a narcissist stop lying, even with evidence of their lies? The answer is very clear: no, they cannot. Elevation Behavioral Health Provides Residential Luxury Mental Health Treatment Elevation Behavioral Health can help someone who is the victim of a narcissist. Our dedicated team is here to guide you toward wellness and discovering new insights. For questions about our program, please call us today at (888) 561-0868. April 27, 2022/by Elevation Behavioral HealthTags: dealing with a narcissist, lying narcissist, narcissist, when narcissist lieShare this entryShare on FacebookShare on TwitterShare on PinterestShare on LinkedInShare on TumblrShare on VkShare on RedditShare by Mail https://elevationbehavioralhealth.com/wp-content/uploads/2022/04/why-do-narcissists-lie.jpg 687 1030 Elevation Behavioral Health https://elevationbehavioralhealth.com/wp-content/uploads/2018/12/logo_ebh.png Elevation Behavioral Health2022-04-27 18:09:152022-04-27 18:09:15Can a Narcissist Stop Lying Even With Evidence?

      Are narcissists compulsive liars? Can a narcissist ever stop lying, even when confronted with evidence of their narcissistic lies? Learn all about narcissistic personality disorder.

    1. Author Response

      Reviewer #1 (Public Review):

      Kohler and Murray present high-throughput image-based measurements of how low-copy F plasmids move (segregate) inside E. coli cell. This active segregation ensures that each daughter cell inherit equal share of the plasmids. Previous work by different labs has shown that faithful F-plasmid segregation (as well as segregation of many other low-copy plasmids, segregation of chromosomes in many bacterial species and segregation of come supramolecular complexes) require ParA and ParB proteins (or proteins similar to them) and is achieved by an active transport mechanism. ParB is known to bind to the cargo (plasmid) and ParA forms a dimer upon ATP binding that binds to DNA (chromosome) non-specifically and also can bind to ParB (associated with cargo). After ATP hydrolysis (stimulated by the interaction with ParB), ParA dimer dissociates to monomers and from ParB and the chromosome. While different mechanisms of the ParA-dependent active transport had been proposed, recently two mechanisms become most popular - one based on the elastic dynamics of the chromatin (Lim et al. eLife 2014, Surovtsev PNAS 2016, Hu et al Biophys.J 2017, Schumaher Dev.Cell 2017) and the other based on a theoretically-derived "chemophoretic" force (Sugawara & Kaneko Biophysics 2011, Walter et al. Phys.Rev.Lett. 2017).

      It is a minor comment, but we would like to point out that we do not consider these two model types as alternatives but rather as models with different levels of coarse-graining. Our interest is in the molecular-level (stochastic) models (Lim et al. eLife 2014, Surovtsev PNAS 2016, Hu et al PNAS 2015, Hu et al Biophys.J 2017, Schumacher Dev.Cell 2017).

      The authors start by following motion of F plasmid with one or two plasmids per cell and by analyzing plasmid spatial distribution, plasmid displacement (referred to as velocity) as a function of their relative position, and autocorrelations of the position and the displacement. They concluded that these metrics are consistent with 'true positioning' (i.e. average displacement is biased toward the target position - center for one plasmid and 1/4 and 3/4 positions for two plasmids ) but not with 'approximate positioning' (i.e. when plasmid moves around target position, for example, in near-oscillatory fashion). This 'true positioning' can be described as a particle moving on the over-dampened spring. They reproduce this behavior by expanding the previous model for 'DNA-relay' mechanism (Lim et al. eLife 2014, Surovtsev PNAS 2016), in which plasmid is actively moved by the elastic force from the chromosome and ParA serves to transmit this force from the chromosome to the plasmid. Now, the authors explicitly consider in the model that the chromosome-bound ParA can diffuse (which the authors refer as 'hopping') and this allows the model to achieve 'true plasmid positioning' for some combination of model parameters in addition to oscillatory dynamics reported in the original paper (Surovtsev PNAS 2016).

      Based on their computational model, the authors proposed that two parameters, diffusion scale of ParA = 2(2Dh/kd)1/2/L (typical length diffused by ParA before dissociation) and ratio of ParB-dependent and independent hydrolysis rates = kh/kd are key control parameters defining what qualitative behavior is observed - random diffusion, near-oscillatory behavior, or overdamped spring ('true positioning'). They vary this two parameters ~30- fold and ~200-fold range by changing Dh and kh respectively, to illustrate how dynamics of the system changes between these 3 modes of motion. While these parameters clearly play important role, the drawback is that the authors did not put either theoretical reasoning why these parameters are truly governing or showed it by varying other model parameters (kh, number of ParA NParA, spring constant of chromosome k, diffusion coefficient of the plasmid Dp) to show that only these combinations define the type of the system behavior. The authors qualitative analysis on importance of relies on the steady state solution for the diffusion equation for ParA. It is really unfortunate that no ParA distribution was measured simultaneously with the plasmid motion, as this would allow to compare experimental ParA profiles to expected quasi-steady-state solutions.

      We spend almost an entire section and a figure explaining the theoretical reasoning behind the identification of the $\lambda=s/(L/2n)$ as an important system parameter (section “Hopping of ParA-ATP on the nucleoid as an explanation of regular positioning” and Figure 2) and predicted that regular positioning could only occur for $\lambda>1$. This was confirmed by parameter sweeps for the cases of 1 (Figure 3I) and multiple plasmids (Figure 5-figure supplement 1), indicating that $\lambda$ is indeed an important system parameter and that our conceptual understanding of this aspect of the system is correct. This point has now been made clearer.

      However, we agree that the reasoning for $\epsilon$ (varied through the hydrolysis rate $k_h$) was not clear. It was chosen to allow us to modulate the ParA concentration at the plasmid compared to elsewhere, motivated by the differences between different ParABS systems. We originally had also considered a third quantity related to the number of nucleoid-bound ParA but we found that this had little effect on the nature of the dynamics. All three quantities describe how the timescale of a reaction/process (ParA hopping/diffusion across the nucleoid, ParB induced hydrolsysis, ParA association to the nucleoid) compares to the timescale of basal hydrolysis, which we use as a reference timescale.

      We have now made this clearer as well as adding supplementary figures showing the effect of varying other system parameters at several locations in the phase diagram (Figure 3-figure supplement 3 and 4). These sweeps justify our identification of $\epsilon$ and $\lambda$ as a useful/important set of quantities for determining the dynamics of the system.

      Additionally, we now add example kymographs showing the ParA distribution (Figure 3-figure supplement 2C).

      The authors also show by simulations that overdamped spring dynamics can transition into oscillatory behavior when decreases, for example by cell growth. Indeed, they observed more oscillatory behavior when they compared single-plasmid dynamics in the longer cells compared to the shorter cells. This was not the case in double-plasmid cells, in eprfect agreement with their analysis. They also calculated ATP consumption in the model and concluded that the system operates close but below (perhaps, "above" should be used as it refers to bigger ) the threshold to oscillatory regime which minimize ATP consumption. While ATP consumption analysis is very intriguing, this statement (Abstract Ln24-25) seems at odds with the authors own analysis that another ParA-dependent plasmid system, pB171, operates mostly in oscillatory regime, and it is actually for this regime the authors' analysis suggest minimal ATP-consumption (Fig. 8).

      To clarify, we found that pB171 (which in our hands has a copy number of 2-3 in the SR1 reduced-copy-number strain) is only clearly oscillatory in cells with a single plasmid (and only mildly so in cells with two plasmids). Otherwise, it behaves very similarly to F plasmid. We therefore believe that these two distantly related ParABS systems exhibit, overall, similar dynamics and differ only in how close the systems are to the threshold of oscillatory instability. This was not clear as we did not specify the copy number of pB171. We now provide this in Figure 7–figure supplement 1.

      We refer to these systems as lying just below, rather than above, the threshold of the oscillatory instability because, on average, plasmids do not oscillate but only do so in cells with the lowest plasmid concentration.

      I think the real strength of the paper is that it can potentially to show that if one considers that the intracellular cargo can be moved by the fluctuating chromosome via ParA-mediated attachments, then various dynamics can be achieved depending on combinations of several control parameters (plasmid diffusion coefficient, ParA diffusion coefficient, rate of hydrolysis and so on) including previously reported 'oscillations' (Surovtsev PNAS 2016), 'local excursions' (Hu et al Biophys.J 2017) and 'true positioning' (Schumaher Dev.Cell 2017). The main drawback (in this reviewer opinion) that this is obscured by the current presentation and discussion of this work and previous modelling work on ParA-dependent systems. For example, instead of using "unifying" potential of the presented model, yet another name 'relay and hopping' is used in addition to previously used 'DNA-relay', 'Brownian ratchet', 'Flux-based positioning', …

      In the abstract and discussion, we already refer to developing a “unified” model (p1 L21, p15 L22 of the original manuscript) and in the discussion we explain how our model contains other models as limiting cases. But we agree with this recommendation - the unifying nature of our model is its main strength. We now emphasise this more.

      Regarding the model name, we felt obliged to refer to the previous named models (DNA-relay and Brownian ratchet) and simply gave our model a name to avoid confusion when making comparisons. We have now removed almost all mention of ‘hopping and relay’ and just refer to ‘our model’. However, our gitlab repository with the code must have a name and therefore is still called ‘Hopping and relay’ and so the same term is used in Table 3.

      … and it appears that the presented model is an alternative to these previously published work. And only in model description (in Methods section) one can find that the "... model is an extension of the previous DNA-relay model (Surovtsev et al., 2016a) that incorporates hopping and basal hydrolysis of ParA and uses analytic expressions for the fluctuations rather than a second order approximation"(p.17, ln15-17).

      We are sorry that this reviewer felt that the fact that our model is an extension of DNA relay is hidden in the methods. However, we wrote in the main text:

      “Motivated by the previous discussion, we decided to develop our own minimal molecular model (‘hopping and relay’) of ParABS positioning, taking the DNA relay model as a starting point … The original scheme is as follows… We supplemented this scheme with two additional components: diffusion (hopping) of DNA-bound ParA-ATP dimers across the nucleoid (with diffusion coefficient Dh, where the subscript indicates diffusion of the home position) and plasmid-independent ATP hydrolysis and dissociation (with rate kd). See Material and Methods for further details of the model. “

      We now make this clearer.

      However, we would argue that as models of the same system, there are naturally overlaps and the models of Hu et al and Schumacher et al could also be thought of as extensions of the DNA relay model.

      While it is of course the authors right to decide how to name their model, it should be explicitly clear to the reader what is a real conceptual difference between presented and previous models from the abstract, introduction and discussion section of the paper, not from the "fine-print" details in the supplementary materials.

      The main conceptual difference is that we have identified the importance of having a finite diffusive length scale for ParA diffusion/hopping on the nucleoid. This allows both oscillations and regular positioning to occur for biologically relevant parameter values and reproduces the length dependent transition from mid-cell positioning to confined oscillations that we observe for F plasmid. The DNA relay model does not have this behaviour as the ParA diffusive length scale in zero while it is infinite in the models of Ietswaart et al 2014 and Schumacher et al 2017. The model of Hu et al 2017 does have a finite length scale but the authors appear not to have realised its importance and never discovered the regular positioning regime at \lambda >1. While we make these points in the discussion in the context of Figure 8A, where we compare our model to the others, we agree with this reviewer that we should have been more explicit in the abstract and introduction. We have now corrected this.

      This would allow to avoid unnecessary confusion (especially for the readers not directly involved into the modelling of ParA/B system) and clarify that all these models rely on the elastic behavior of fluctuating chromosome to drive active transport of the cargo. This reviewer believes that more explicit discussion on the models (one from the authors and previously published) differences and similarities will help with our understanding of how ParA-dependent system operate. This discussion should also include works on PomXYZ system, in which it was shown that similar dynamic system can lead to specific positioning within the cell (Schumaher Dev.Cell 2017, Kober et al. Biophys.J 2019). This will may it explicit that the models results have direct impact beyond the ParA-dependent plasmid segregation.

      To further clarify the differences between the models (beyond the second and third sections of the main text and the discussion), we have now added a section to the methods and a new table (Table 3). We have also included the mentioned PomXYZ model. However, we would like this was not the first stochastic model to have ‘true’ positioning as this reviewer cites above. Though they did not include the mechanism of force generation, the model of Ietswaart et al 2014 produces regularly positioned plasmids and is referenced repeatedly in Schumacher et al. 2017.

      I think that expanded parameter analysis, and explicit model comparison/discussion will make the contribution of this work to the field more clear and with the potential to advance our general understanding of how the same underlying mechanism can lead to various modes of intracellular dynamics and patterning depending on parameters combination.

      Reviewer #2 (Public Review):

      The work presented in this manuscript details an analysis of the partitioning of low copy plasmids under the control of the ParABS system in bacteria. Using a high throughput imaging set up they were able to track the dynamics of the partition complex of one to a few plasmids over many cell cycles. The work provides an impressive amount of quantitative data for this chemo-mechanical system. Using this data, the paper sought to clarify whether the dynamics of plasmids is due to regular positioning or noisy oscillations around a mean position. They supplement their experimental work with an intuitive model that combines elements of previous modelling efforts. Their model relies on diffusion of the ParA substrate on the nucleoid with the dynamics of the ParB partition complex being driven by the underlying elastic force due to the nucleoid on which the substrate is tethered. Their model dynamics depend on two parameters, the ratio of the length over which the substrate can explore to the characteristic length of the space and the ratio of stimulated to non-stimulated hydrolysis rates of the substrate. If the length ratio is large, ParA can fully explore the space before interacting with the ParB complex leading to balanced fluxes and regular positioning. If it gets reduced, for example by lengthening the cell, oscillations can emerge as fluxes of substrates become imbalanced and a net force can pull the partition complex.

      Strengths:

      Given the large amount of data, the observations unambiguously show that one particular ParABS system under the conditions studied is carrying out regular positioning of plasmids. The model synthesizes prior work into a nice intuitive picture. These model parameters can be fit to the data leading to estimates of molecular kinetic parameters that are reasonable and in line with other observations. Lining up the experimental observations with the phase space of the model suggests that the system is poised on the edge of oscillations, allowing for the system to have regular positioning with low resource consumption.

      Weaknesses:

      However, despite the correspondence of the simulated results with the experimental findings, other explanations are not completely ruled out. The paper emphasizes that ParA diffusion/hopping on the nucleoid is essential for the establishment of regular positioning and that without it, only oscillations were possible. Prior simulation efforts, that the paper cites, which include ParA diffusion and mixing in the cytosol but no diffusion on the nucleoid have shown that regular positioning is possible and that oscillations could get triggered as the system lengthened. Thus ParA hopping is not a necessity for regular positioning (as claimed in the paper), but very well might be needed for the given kinetic parameters of the system studied here.

      We now comment on this result. In short, we believe that the mentioned model/regime is not relevant due to stochastic effects. We are not able to produce, with biological relevant parameters, regular positioning without ParA hopping.

      The paper also presents experimental results for a second ParABS system (pB171) that is more likely to show oscillations. They attribute the greater likelihood of oscillations for pB1717 being due to ParA exploring a smaller space than the F plasmid system that showed regular positioning. This is pure conjecture and the paper does not provide any evidence that this is the reason. Thus it is hard to conclude if oscillations may not be due to other factors.

      We do not explicitly make that claim. We did have a point in the phase diagram of Figure 8A representing pB171 with a lower value of lambda than F plasmid and stated “The location of pB171 is an estimate based on a qualitative comparison of its dynamics”. We agree this was unclear.

      We now indicate the region that has oscillations with roughly the same period as single plasmids of pB171. We also make it clear that we speculate, but have not shown, that the length scale of ParA hopping is smaller than for F plasmid.

      An important point here is that we can explain both oscillations and regular positioning in the same model with the same kinetic parameters, the regimes being determined by the cell length and plasmid number in a manner consistent with experimental observations.

    1. Author Response

      Reviewer #1 (Public Review):

      The authors use the nanobody tools generated in the companion manuscript and have combined them with DNA-Paint oligonucleotide labeling to generate super-resolution images of indirect flight muscles. Using this approach, they could map the precise organization of the different domains from the two giant titin-like fly homologs called Sallimus and Projectin against which the nanobodies had been raised with a precision ranging from 1 nm to 4 nm, depending on the distance between them. They show that in indirect flight muscles the N-ter of Sallimus is located within 50 nm of the Z-disc, and that its C-ter reaches the A-band roughly 100 nm away from the Z-disc. Likewise, they show that the N-ter of Projectin colocalizes with the C-ter of Sallimus at the edge of the A-band, whereas its C-ter is located about 250 nm away in the A-band and 350 nm from the Z-disc. It overall suggests a staggered and linear organization of both proteins with a potential area of overlap spanning 10-12 nm, that Sallimus could bridge the Z-disc to the A-band acting as a ruler, while Projectin should only overlap with 15% of the A-band and possibly a 10 nm of the I-band.

      Thanks for this nice summary of our findings.

      The value of this work comes from its use of advanced technologies (DNA-Paint + superresolution). The biological conclusions confirm and refine earlier and recent papers, especially EM papers and the impressive and very comprehensive JCB paper by Szikora et al in 2020, although the conclusions of the present work differ somewhat from those of Szikora who had predicted that Sallimus does not reach the A-band. That aspect could have been better discussed.

      We have further extended our discussions of the results from Szikora et al. 2020, in particular regarding Sallimus in this revised version.

      Reviewer #2 (Public Review):

      Taking advantage of the high molecular order of the Drosophila flight muscle, Schueder, Mangeol et al. leverage small (<4 nm) original nanobodies, tailored coupling to fluorophores, and DNA-PAINT resolution capabilities, to map the nanoarchitecture of two titin homologs, Sallismus and Projectin.

      Using a toolkit of nanobodies designed to bind to specific domains of the two proteins (described in the companion article "A nanobody toolbox to investigate localisation and dynamics of Drosophila titins" ), Schueder, Mangeol et al position these domains within the sarcomere with <5nm resolution, and demonstrate that the N-ter of Sallismus overlaps with the C-ter of Projectin at the A-band/I-band interface. They propose this architecture may help to anchor Sallismus to the muscle, thus supporting flight muscle function while ensuring muscle integrity.

      This study nicely extends previous work by Szikora et al, and precisely dissect the the sarcomeric geography of Sallismus and Projectin. From these results, the authors formulate specific functional hypotheses regarding the organization of flight muscles and how these are tuned to the mechanical constraints they undergo.

      Although they remain descriptive in essence, the conclusions of the paper are well supported by the experimental results.

      We thank this reviewer for the nice summary of our results.

      Reviewer #3 (Public Review):

      This manuscript by Schueder et al. provides new insight into an important question in muscle biology: how can the smaller titin-like molecules of the much larger sarcomeres of invertebrate muscle perform the same function as the larger titin of vertebrate muscles which have smaller sarcomeres? These functions include the assembly, stability and elasticity of the sarcomere. Using two state of the art methods--nanobodies and DNA-PAINT superresolution microscopy, the authors definitively show that in the highly ordered indirect flight muscle of Drosophila, the elongated proteins Sallimus and Projectin are arranged such that the N-terminus of Sallimus is embedded in the Z-disk, and the C-terminus is embedded in the outer portion of the A-band, and that in this outer portion of the A-band is also embedded the C-terminus of Projectin; thus, if the C-terminus of Sallimus can bind to thick filaments, and/or these overlapping portions of Sallimus and Projectin interact, there would be a linkage of the Z-disk and/or thin filament to the thick filaments to help determine the length and stability of the sarcomere.

      The strengths of this paper include the implementation of nanobody and DNA-PAINT superresolution microscopy for the first time for muscle. The extraordinary 5-10 nm resolution of this method alloiws imaging for definitive localization of the termini of these elongated proteins in the Drosophila flight muscle sarcomere. In addition, the manuscript is well written with sufficient background information and rationale presented, is easy to read, complex new methods are well-described, the figures are of high quality, and the conclusions are well-justified. A minor weakness is that despite the authors demonstrating that the Cterminus of Sallimus is located at the outer edge of the A-band, and that the N-terminus of Projectin is located also in the outer edge of the A-band, the authors provide no data to show whether, for example, these portions of these titin-like molecules interact, or whether Sallimus might interact with thick filaments. Such data would be required to prove their model. However, I can understand that this would require extensive additional study, and the authors have already provided a tremendous amount of data for this first step in supporting the model. Nevertheless, the authors should cite a relevant previous study on the Sallimus homolog in C. elegans called TTN-1, which is also a 2 MDa polypeptide of similar domain organization to at least the large isoforms of Salliums found in fly synchronous muscles. In the study by Forbes et al. (2010), immunostaining, albeit not to the impressive resolution achieved in the present paper, showed that TTN-1 was also localized to the I-band with extension into the outer edge of the A-band. More importantly, that study also showed that "fragment 11/12", Ig38-40, which is located fairly close to the C-terminus of TTN-1 binds to myosin with nanomolar affinity (Kd= 1.5 nM), making plausible the idea that TTN-1 may bind to the thick filament in vivo.

      We thank this reviewer for sharing his enthusiasm about our results and methodology, and also about the way the data are presented. This is one more argument for us to leave a shortened Figure 1 in the PAINT manuscript.

      We are particularly thankful for pointing out the important C. elegans data that we had missed and that, as the reviewer said, perfectly fit with the model we propose for flight muscle (and also the larval muscle data, as the C-term of Sls is the same). Hence, we highlight this paper now in our discussion and compare to our findings.

      Reviewer #4 (Public Review):

      This manuscript reports combining recently developed and described in the accompanying paper nanobodies against Sallimus and Projectin with DNA-Paint technology that allows super-resolution imaging. Presented data prove that such a combination provides a powerful system for imaging at a nano-scale the large and protein-dense structures such as Drosophila flight muscle. The main outcome is the observation that in flight muscle sarcomeres Salimus and Projectin overlap at the I/A band border. This was elegantly achieved using double color DNA-Paint with Sls and Projectin nanobodies.

      We thank the reviewer for appreciating the quality of our work.

      Overall, as it stands, this manuscript even if of high technological value, remains entirely descriptive and short in providing new insights into muscle structure and architecture. The main finding, an overlap between short Sls isoform and Proj in flight muscle sarcomeres, is redundant with the author's observation (described in the companion paper "A nanobody toolbox to investigate localisation and dynamics of Drosophila titins") that in larval muscles expressing a long Sls isoform, Sls and Proj overlap as well.

      Alternatively, combination of Sls and Proj nanobodies with DNA-Paint represents an interesting example of technological development that could strengthen the accompanying nanobodies toolkit manuscript.

      Every structural paper reports the structure and is thus by definition descriptive. This is the aim of our manuscript. We do not think that the other nanobody resource paper reports an overlap of Sls and Projectin in the larvae. To resolve such a possible overlap, super resolution would be needed. The other paper does report that larval Sls isoform is dramatically stretched, more than 2 µm, and that Projectin is decorating the thick filament, likely in an oriented manner. If N-term of Projectin overlaps with C-term of Sallimus in this muscle is an open question that needs DNA-PAINT imaging of larval muscle. This requires a TIRF setting that is technically not trivial to achieve for larval muscle and hence has not been done by anybody.

    1. The end of Twitter

      Ben Werdmüller sees the Musk take-over as one of more signs that Twitter as we know it is sunsetting. Like FB it is losing its role as the all-in-one communal 'space'. I think the decline is real, but also think it will be long drawn out decline. Early adopters and early main stream may well jump ship, if they haven't already some time ago. The rest, including companies, will hang around much longer, if only for the sunk costs (socially and capital). An alternative (hopefully a multitude as Ben suggests) needs to clearly present itself, but hasn't in a way the mainstream recognises I think. It may well hurt to hold on for many, but if there's no other thing to latch onto people will endure the pain. Boiling frog and all that.

    1. Author Resonse

      Reviewer #1 (Public Review):

      The manuscript by Himmel et al is an interesting study representing a topic of substantial interest to the somatosensory neurobiology community. Here, the authors use CIII peripheral neurons to investigate polymodality of sensory neurons. From vertebrates to invertebrates, this is a long-standing question in the field: how is it that the same class of sensory neurons that express receptors for myriad sensory modalities encode different behavioral responses. This system in Drosophila seems to be an intriguing system to study this question, making use of the genetic toolkit in the fly and ease of behavioral assays. In this study, the authors identify a number of channels that are important for cold nociception, and they showed that some of these do not appear to also encode mechanosensation. Despite my initial enthusiasm for this paper, halfway through, it felt as if I were reading two different papers that were loosely tied together. This lack of cohesion significantly reduced my enthusiasm for this work. Below are some of my criticisms:

      We thank Reviewer #1 for their feedback. In addition to the points below, and in accordance with the reviewer’s overall criticisms, we have revised the body text to make it more cohesive. Our main goal with this revision was to better explain to the reader the shift from anoctamins to SLC12 cotransporters.

      1) The first half of the paper is about a role for Anoctamins in cold nociception, but the second half switched somewhat abruptly to ncc69 and kcc. I assumed the authors would connect these genes in a genetic pathway, performing some kind of epistatic genetic interaction studies or even biochemical assays, and that this was the reason to switch the focus of the paper midway through. But this was not the case. Moreover, they performed a different constellation of experiments for the genes in the first half vs the second half of the paper (eg. Showed a role in cold nociception vs mechanosensation or showing phenotype from overexpression). This lack of cohesion made it difficult to follow the work.

      We have edited the text to better explain this shift. Two notable changes are: (1) moving the phylogenetics to Figure 1, to more immediately present and demonstrate that subdued is part of the ANO1/ANO2 family of calcium-activated chloride channels; and (2) a new cartoon schematic in Figure 6 to more strongly communicate to a reader that chloride is a hypothetical mechanism of cold discrimination.

      In short, previous work and our phylogenetic analyses indicate that subdued is a Cl- channel (we have moved the phylogeny earlier in the paper to make this clear from the onset). We were therefore surprised that knockdown/mutation resulted in reduced CT behavior, as neural Cl- currents are often inhibitory. Thus, we looked to known mechanisms of Cl- homeostasis to try to formulate an informed hypothesis about the function of anoctamins in this system; hence the shift in focus to SLC12.

      In response to the second half of the comment: We have in fact performed cold nociception and mechanosensation experiments for both the anoctamins and the SLC12 cotransporters, although the SLC12 mechanosensation results were in a supplemental figure. We have moved the mechanaosensation results to the main Figure 6 to make this clearer. With respect to simple overexpression, the goal of the anoctamin experiments was to test the necessity of anoctamins to cold-evoked behavior, whereas the goal of the SLC12 experiments was to differentially modulate Cl- homeostasis, and this could hypothetically be accomplished by both knockdown and overexpression (hence we performed both knockdown and overexpression).

      2) In Fig1B,C how does one confirm a CIII neuron is being analyzed. It might help the reader if there were at least some zoomed out photos where all the cell types are labeled and potentially compared to a schematic. Moreover, is there a CIII specific marker to use to co-stain for confirmation of neuron type?

      Our CIII fusion is a specific marker for CIII neurons. To better demonstrate this, we have added images of the new CIII fusion expression patterns overlapping with a previously described CIII GAL4 driver (i.e. nompC-GAL4), and provided text describing how the CIII fusion transgene was discovered and generated. Please see the new Figure 1-Figure supplement 1.

      3) As this paper is predicated on detecting differences by behavioral phenotype, the scoring analysis is not as robust as it could be, especially considering the wealth of tools in Drosophila for mapping behaviors. The "CT" phenotype is begging for a richer behavioral quantification. This critique becomes relevant here when considering the optogenetic induced CT behavior in Fig5. If the authors were to use unbiased quantitative metrics to measure behavior, they could show how similar the opto behavior is to the natural cold evoked behavior. Perhaps the two are not the same, although loosely fitting under the umbrella of "CT".

      In accordance with our response above to necessary revisions, we have added one additional metric and reorganized the figures to better demonstrate the complexity of the behavior. We have no further data or new tools at this time.

      To improve our optogenetic analyses, we have added data for Channelrhodopsin-dependent CIII activation, which has been previously shown to induce cold-like behaviors at high levels of activation and innocuous touch-like behaviors at low levels of activation (Turner, Armengol et al 2016). Further, we have added videos (Figure 5—videos 1-3) showing behavior in response to both Channelrhodopsin and Aurora activation.

      With respect to differences in behavior, we have pointed out some differences in the Aurora-evoked behavior from the cold-evoked behavior: chloride optogenetics induces innocuous touch-like behaviors following CT. Please see lines 296-299.

      4) Following on from the last comment, the touch assays in Fig3 have a different measurement system from the other figures. Perhaps touch deficits would be identified with richer behavioral quantification. Moreover, do these RNAi larvae show any responses to noxious mechanical stimulation?

      The touch assays necessarily have different metrics from cold assays, as the touch-evoked behaviors are quite different from cold-evoked change in length (which are relatively simple, prima facie).

      With respect to noxious mechanical stimulation, while Class III neurons have been shown to facilitate this modality and be connected to relevant circuitry (please see Hu et al 2017 https://doi.org/10.1038/nn.4580 and Takagi et al 2017 https://doi.org/10.1016/j.neuron.2017.10.030), Class IV neurons are the primary sensory neuron which initiate the noxious mechanical-induced rolling response. Although this is an interesting question, we believe it is outside the scope of this study.

      Reviewer #2 (Public Review):

      Himmel and colleagues study how individual sensory neurons can be tuned to detect noxious vs. gentle touch stimuli. Functional studies of Drosophila class III dendritic arborization neurons characterized roles in gentle touch and identified a receptor, NompC, and other factors that mediate these responses. Subsequent work primarily from the authors of the current study focused on roles for the same sensory neurons in cold nociception. The two proposed sensory inputs lead to quite distinct sets of behaviors, with touch leading to halting, head turning and reverse peristalsis, and noxious cold leading to whole body contraction. How activity of one type of sensory neuron could lead to such different responses remains an outstanding question, both at the levels of reception and circuitry.

      The cIII responses to noxious cold and innocuous touch raises questions that the authors address here, proposing that studies of this system could advance the understanding of chronic neuropathic pain. A candidate approach inspired by studies in vertebrate nociceptors led the authors to study anoctamin/TMEM16 channels subdued, and CG15270, termed wwk by the authors. The authors focus on a pathway for gentle touch vs. cold nociception discrimination through anoctamins. Several of the experiments in this manuscript are well done, in particular, the electrophysiological recordings provide a substantial advance. However, the genetic and expression analysis has several gaps and should be strengthened. The data also do not provide strong support for some key aspects of the proposed model, namely the importance of relative levels of Cl co-transporters.

      Major comments:

      1) Knockout studies are accomplished using two MiMIC insertions whose effects on subdued or CG15270/wwk are not characterized by the authors. This needs to be established. The MiMIC system is also not well explained in the text for readers.

      We have modified the text to better explain MiMICs (Lines 137-140) and we have verified the mutagenic effects of these MiMIC insertions via RT-PCR (Figure 2 – supplement 1). We believe these data, in conjunction with other converging lines of evidence (e.g. rescue) demonstrate necessity of these genes in cold nociception.

      2) Subdued expression is inferred by a Gal4 enhancer trap. This can be a hazardous way of determining expression patterns given the uncertain relevance of the local enhancers driving the expression. According to microarray analysis subdued is strongly expressed in cIII neurons, but c240-Gal4 is barely present compared to nearby neurons, raising questions about whether this line reflects the expression pattern, including levels, even though the authors suggest that the line is previously validated (line 95; it is unclear what previously validated means). Figure 1B should not be labeled "subdued > GFP" since it is not clear that this is the case. Another more direct method of assessing expression in cIII is necessary. Confidence is higher for wwk using a T2A-Gal4 line, however, Figure 1C might be misleading to readers and indicate that wwk-T2A-Gal4 is cIII specific whereas in supplemental data the authors show how it is much more broadly expressed. The expression pattern in the supplemental figures should be moved to the main figures.

      We have removed the phrase “previously validated” and we have modified Figure 1 to change how we refer to the GFP expression (removed “subdued > GFP”).

      In accordance with the response to necessary revisions above, we make use of several converging lines of evidence to infer expression, including GAL4 expression patterns, microarray, and qPCR (the two latter experiments from isolated CIII samples). That subdued and wwk are expressed in CIII is clearly the most parsimonious hypothesis.

      We have also carefully reviewed our body text to be certain we do not make claims of differential expression between different neural subtypes based on differences in fluorescence in the GAL4-driven GFP imaging. We do not believe that this would be a reasonable way to infer differences in expression levels in any instance.

      With respect to the design of Figure 1, the intent is not to mislead the reader, and we state in the text that wwk is not solely expressed in CIII (lines 120-125). As eLife makes supplemental figures available directly alongside the main figures, we have left the relevant supplemental figures as supplements – we simply think this makes more sense from a standpoint of readability and style.

      3) In figure 8 the authors propose a model in which the relative levels of K-Cl cotransporters Kcc (outward) and Ncc69 (inward) in cIII neurons determine high intracellular Cl- levels and a Cl- dependent depolarizing current in cIII neurons. They test this model using overexpression and loss of function data, but the results do not support their model since for most of the overexpression and LOF of kcc and ncc69 do not significantly affect cold nociception, the exception being ncc69 RNAi. The authors suggest that this could be due to Cl homeostasis regulated by other cotransporters. Nonetheless, it leaves a significant unexplained gap in the model that needs to be addressed.

      We respectfully disagree that our results are not consistent with the stated hypothesis. In fact, it is the lack of change under certain conditions which lend evidence against the alternative hypothesis that CIII neurons maintain relatively low intracellular Cl-. The hypothesis we are testing is that ncc69 expression is driving relatively high intracellular Cl- concentrations, thus resulting in depolarizing Cl- currents.

      Under this hypothesis, we would predict that knockdown of ncc69 and overexpression of kcc would reduce cold sensitivity at 5˚C. That knockdown of ncc69 and overexpression of kcc reduces cold sensitivity is consistent with this hypothesis (and we point out in text that the evidence for kcc is less convincing) – at the least, these results do not disprove it.

      Under this hypothesis, we would also predict that knockdown of kcc and overexpression of ncc69 would not result in reduced cold sensitivity at 5˚C. As there was no phenotype at 5C, our results are likewise consistent with the hypothesis (at the least, they do not disprove it).

      We did find it curious that ncc69 RNAi did not affect neural activity at 10˚C, but speculate that our inability to detect physiological effects for ncc69 knockdown are limitations of our electrophysiology methodology (and we discuss this in the manuscript).

      The only piece of data inconsistent with the hypothesis may be that kcc overexpression may not have affected cold nociception at 5˚C – the data aren’t overwhelmingly convincing. However, this is only one experiment among many, and we believe the preponderance of evidence is consistent with the hypothesis. That is not to say we believe this hypothesis has complete explanatory power, however, as noted by our discussion of both the ncc69 electrophysiological and kcc behavioral data, and by our suggestion that there may be other regulatory mechanisms at work. This latter suggestion is wholly speculative, and we believe appropriate for the discussion section. We agree (and state in the discussion) that this would require further experimentation.

      4) Related to the #3, the authors should verify the microarray data that form the basis for their differential expression model.

      We have performed qPCR for ncc69 and kcc. Although qPCR is semiquantitative when comparing between genes, the Ct value for ncc69 was lower than for kcc, indicating more transcripts were present at the onset (assuming identical efficacy). These data (although semi-quantitative), the microarray, and our behavioral and electrophysiological data are consistent with the stated hypothesis.

      Reviewer #3 (Public Review):

      There are also several modest weaknesses in the paper:

      1) A notable gap remains in the evidence for the hypothesized mechanisms that enhance electrical activity during cold stimulation and the proposed role of anoctamins (Fig. 8) - the lack of evidence for Ca2+-dependent activation of Cl- current. The recording methods used in the fillet preparation should enable direct tests of this important part of the model.

      We have performed an additional experiment at the reviewer’s suggestion. Please see above (in essential revisions) and below (in recommendations for authors).

      2) The behavioral and electrophysiological consequences of knocking down either of the two anoctamins are incomplete (Fig.2), raising the significant question of whether combined knock-down of both anoctamins in the CIII neurons would largely eliminate the cold-specific responses.

      While the results of this experiment would certainly be interesting, we are unsure of how it would be acutely informative in this context and are not convinced that any possible outcomes would disprove any particular hypothesis. In part, this is because we know that blocking synaptic transmission in CIII neurons (via tetanus toxin) does not completely ablate cold-evoked behavior (Turner & Armengol et al 2016 https://doi.org/10.1016/j.cub.2016.09.038). This is also the case for combinatorial mutation of other genes associated with cold nociception (please see Turner & Armengol et al 2016; and more recently, Patel et al 2022 https://doi.org/10.3389/fnmol.2022.942548). Further, the husbandry required to generate the double knockdowns would be quite challenging and might result in GAL4 titration (hypothetically less strongly knocking down each gene). For these reasons, we have not performed this suggested experiment.

      3) Blind procedures were not used to minimize unconscious bias in the analyses of video-recorded behavior, although some of the analyses were partially automated.

      This is correct and a relative weakness of the study. We note it in our methods section. The use of semi-automated data analyses of the behavioral videos is designed to minimize experimenter-specific variability.

      4) The term "hypersensitization" is confusing. Pain physiologists typically use "sensitization" when behavioral or neural responses are increased from normal. In the case of increased neuronal sensitivity, if the mechanism involves an increase in responsiveness to depolarizing inputs or an increased probability of spontaneous discharge, the term "hyperexcitability" is appropriate. Hypersensitization connotes an extreme sensitization state compared to a known normal sensitization state (which already signifies increased sensitivity). In contrast, the effects of ncc69 overexpression in this manuscript are best described simply as sensitization (increased reflexive and neuronal sensitivity to cooling) and hyperexcitability (expressed as increased spontaneous activity at room temperature).

      We have modified the text in accordance with the reviewer’s suggestions (see recommendations for authors section). We have also changed the title of the paper to “Chloride-dependent mechanisms of multimodal sensory discrimination and nociceptive sensitization in Drosophila”

    1. It bears mention that Vannevar’s influential essay “As We May Think” in the July 1945 issue of The Atlantic is entirely underpinned by the commonplace book and zettelkasten traditions pervading Western thought and culture. Rather than acknowledge this tradition tacitly, he creates the neologism “Memex” which stands in for a networked and connected zettelkasten

      This is an interesting observation. Also because Memex went on to inspire e.g. Doug Engelbart. Was Engelbart aware of the history when he demo'd outlining and notes? Was Nelson when he thought up stretchtext in 67?

    1. Reviewer #3 (Public Review):

      This paper aimed to understand how toxin-antidote (TA) elements are spread and maintained in species, especially in species where outcrossing is infrequent and the selfish gene drive of TA elements is limited. The paper focuses on the possible fitness costs and benefits of the peel-1/zeel-1 element in the nematode C. elegans. A combination of mathematical modeling and experimental tests of fitness are presented. The authors make a surprising finding: the toxin gene peel-1 provides a fitness advantage to the host. This is a very interesting finding that challenges how we think about selfish genetic elements, demonstrating that they may not be wholly "selfish" in order to spread in a population.

      Strengths<br /> 1. The authors support results found with a zeel-1 peel-1 introgressed strain by using CRISPR/Cas9 genetic engineering to precise knock-out the genes of interest. They were careful to ensure the loss-of-function of these generated alleles by using genetic crosses.

      2. Similarly, the authors are careful with controls, ensuring that genetic markers used in the fitness assays did not affect the fitness of the strain. This ensures that the genes of interest are causative for any source of fitness differences between strains, therefore making the data reliable and easily interpretable.

      3. A powerful assay for directly measuring the relative fitness of two strains is used.

      4. The authors support relative fitness data with direct measurements of fitness proximal traits such as body size (a proxy for growth rate) and fecundity, providing further support for the conclusion that peel-1 increases fitness.

      Weaknesses<br /> 1. One major conclusion is that peel-1 increases fitness independent of zeel-1, but this claim is not well supported by the data. The data presented show that the presence of zeel-1 does not provide a fitness benefit to a peel-1(null) worm. But the experiment does not test whether zeel-1 is required for the increased fitness conferred by the presence of peel-1. Ideally, one would test whether a zeel-1(null);peel-1(+) strain is as fit as a zeel-1(+);peel-1(+) strain, but this experiment may be infeasible since a zeel-1(null);peel-1(+) strain is inviable.

      2. The CRISPR-generated peel-1 allele in the N2 background only accounts for 32% of the fitness difference of the introgressed strain. Thus, the effect of peel-1 alone on fitness appears to be rather small. Additionally, this effect of peel-1 shows only weak statistical significance (and see point 5 below). Given that this is the key experiment in the paper, the major conclusion of the paper that the presence of peel-1 provides a fitness benefit is supported only weakly. For example, it is possible that other mutations caused by off-target effects of CRISPR in this strain may contribute to its decreased fitness. It would be valuable to point out the caveats to this conclusion, or back it up more strongly with additional experiments such as rescuing the peel-1(null) fitness defect with a wild-type peel-1 allele or determining if the introduction of wild-type peel-1 into the introgressed strain is sufficient to confer a fitness benefit.

      3. The strain that introgresses the zeel-1 peel-1 region from CB4856 into the N2 background was made by a different lab. Given that N2 strains from different labs can vary considerably, it is unclear whether this introgressed strain is indeed isogenic to the N2 strain it is competing against, or whether other background mutations outside the introgressed region may contribute to the observed fitness differences.

      4. Though the CRISPR-generated null allele of peel-1 only accounts for 32% of the fitness difference of the zeel-1 peel-1 introgressed strain, these two strains have very similar fecundity and growth rates. Thus, it is unclear why this mutant does not more fully account for the fitness differences.

      5. Improper statistical tests are used. All comparisons use a t-test, but this test is inappropriate when multiple comparisons are made. Importantly, correction for multiple comparisons may decrease the already weak statistical significance of the fitness costs of the peel-1 CRISPR allele (Fig 3E), which is the key result in the paper.

      6. N2 fecundity and growth rate measurements from Fig 2B&C are reused in Fig 3C&D. This should be explicitly stated. It should also be stated whether all three strains (N2, the zeel-1 peel-1 introgressed strain, and the peel-1 CRISPR mutant) were assayed in parallel as they should be. If so, a statistical test that corrects for multiple comparisons should also be used.

      7. It appears that the same data for the controls for the fitness experiments (i.e. N2 vs. marker & N2 vs. introgressed npr-1; glb-5) may be reused in Fig 2A and 3E. If so, this should be stated. It should also be stated whether all the experiments in these panels were performed in parallel. If so, this may affect the statistical significance when correcting for multiple comparisons.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Response to reviewers


      Reviewer #1 (Evidence, reproducibility and clarity (Required)): The authors develop a previously identified lead compound for the blocking of malaria transmission from humans to mosquitoes further and identified a protein target of the chemical. The protein target, Pfs16 is long known to be upregulated in gametocytes and has been speculated to be a target for small molecules. The work is well (if at time maybe too well/too detailed) described and potential shortfalls are highlighted.

      My major comment is that without a deletion mutation of Pfs16, the paper will remain somewhat preliminary. I would strongly encourage the authors to generate such a mutant and compare it to the parasites treated with their drug candidate. I feel the text can be much shortened and a lot of information moved to the materials and methods. The conclusions should be toned down on several occasions (abstract, introduction, discussion). Avoid adjectives, e.g. what is a 'powerful starting point' (abstract) or 'compelling interdisciplinary evidence' but hot air?

      We thank the reviewer for this comment. However, we would like to reiterate (as stated in the manuscript) that knockout of Pfs16 in P. falciparum is transmission lethal, i.e. you do not get progression of male gametogenesis. Thus, whilst re-generation of a Pfs16 KO would be interesting in terms of comparing phenotypically with the drug treated parasites, we are not convinced it would add any further evidence of support for or against our conclusion in terms of the ability of the N-4HCS scaffold to target this protein. E.g. we could drug treat a Pfs16 KO but this would not be expected to show gametogenesis irrespective of treatment. Therefore, whilst of academic interest, we believe it is satisfactory to judge our phenotypic work based on published accounts of the Pfs16 KO without having to engage in the costly experiments to regenerate the parasite and work on it side-by-side, especially given the limited resolution it would give towards the overall goal of the work in terms of defining the effect and likely target of this drug class on parasites.

      Addressing the second comment, we are happy to alter areas of the paper that may have over-stated the conclusions of the work including the abstract/introduction and discussion.

      CROSS-CONSULTATION COMMENTS I think these three reviews are pretty much in line with their overall assessment. I am happy if send as is to authors as it will help them shape a much better paper

      Reviewer #1 (Significance (Required)):

      The paper shows that very likely a new chemical with some potential for transmission inhibition of malaria parasites for mosquitoes binds to a Plasmodium protein that is specifically expressed in the sexual stages of the parasite.

      The paper compares to good papers published in journals like ACS Infectious Diseases or Antimicrobial Agents and Chemotherapy, but I am not sure which of the Review Commons sister journals it would fit to. I am a molecular parasitologist.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)): Transmission blocking drugs are of high interest as a strategy to combat malaria but they are difficult to study. For instance it is problematic to raise resistant parasites to find mode of action of transmission blocking drugs and to identify their targets in the cell. In this manuscript Yahiya et al. build on previous work which identified the N-4HCS scaffold, of which DDD01035881 is the lead compound, as an inhibitor P. falciparum male gametocytes. Using PAL to enrich for target proteins Pfs16 was identified and validated as a possible target of DDD01035881. Binding was validated through CESTA. Determination of the phenotype following DDD01035881 treatment was found to partially match the previously published Pfs16 KO phenotype. However curiously no impact was seen in gametocytogenesis despite published evidence of Pfs16 being involved in sexual conversion. The authors speculate as to reasons but a direct experimental comparison with Pfs16 mutant parasites (which likely would have been revealing) is not provided. On the positive side, this analysis of the stage-specific effect of the drug pinpoint the stage inhibited during microgamete development which is a very interesting part of the manuscript.

      We thank the reviewer for this positive assessment of our work. Mirroring comments above, our challenge with Pfs16 knockout or mutation is that if we ablate Pfs16 function we cannot assess the effect of drug action. Definition of a mutant that would demonstrate precisely the drug mode of action would require structural resolution of drug bound to target (i.e. to identify which residues to target) – this is a major goal for our research group moving forwards, but likely many years’ work. In general, our core approach here has been one of chemo-proteomic based methods and phenotypic investigation of the novel antimalarial. Further evidence might be forthcoming from molecular genetics/structural biology, but we believe these are beyond the scope of the current work (and our available resources at present). We state future directions in the discussion and can add more to this in any revised manuscript.

      This work deepens the understanding of a novel class of transmission blocking drugs with reasonable potency (foremost (-)-DDD01028076, which has low nanomolar activity, the modified versions considerably less). Question on how to achieve serum concentrations for sufficient potency aside, these compounds will in the very least provide experimental tools to study their mode of action and might reveal interesting biology. This work is therefore of interest to the malaria field.

      The experimental methodology seems excellent but some of the results raise questions that make definite conclusions difficult and this should be addressed. Overall, this is very solid work but leaves some doubts whether Pfs16 is indeed the (only) target of this class of compounds.

      Major comments: 1. The reasons for excluding Etramp10.3 are not convincing. In fact it could be argued it is nearly as good a candidate as Pfs16. Contrary to the author's statements in the results section, etramp10.3 transcription is highly upregulated in gametocytes (see e.g. PMID: 22129310) with a generally very low transcription in asexual stages. It is argued that Etramp10.3 is essential in blood stages because MacKellar et al failed to disrupt the gene and because the PiggyBAC screen predicted it to be essential. However, if this is an argument for exclusion then this would also apply to Pfs16 which is also predicted by the PiggyBAC screen to be essential (likely both are non-essential in blood stages as they are barely expressed but Pfs16 and Etramp10.3 might by chance have not received an insertion in the PiggyBAC screen due to their very small size which may also explain failure of disrupting integration in MacKellar). Given the finding that the drug binds Pfs16 only in late gams it might also be argued that an essential function in asexuals might not be affected if they behave similarly to young gams and hence this criterion is not valid anyway.

      Further following this line of thought that ETRAMP10.3 could be a hit equivalent to Pfs16, Figure 2D shows a band below the band considered to be Pfs16. It would not be all surprising if this were ETRAMP10.3 (the size would fit).

      We don’t disagree with reviewer 2’s comments that ETRAMP10.3 could be an additional target. Although not traditionally related there is some similarity between these proteins and it may be that at the macroscopic level there is a structural homology between them. As stated elsewhere we are happy to tone down the assertion that Pfs16 is the only drug target candidate, leaving open the possibility of future follow up work that may yet reveal additional targets. This cannot be explored much further without extensive experimentation, which is beyond our current capacity. Given the strong phenotypic effect on gametocytes, whilst ETRAMP may be upregulated, this paper naturally focused its core attention on Pfs16 as a candidate target. We certainly subscribe to the view that absence of evidence is not evidence of absence.

      Both, Pfs16 and ETRAMP10.3 can be expected to be very abundant proteins in the parasite periphery in gams. Can the authors exclude that these simply are the first to encounter the N-4HCS photoaffinity probe and that this may have led to their enrichment in the target identification experiments. The biochemical data argues for a specific interaction with Pfs16, but by itself is not that strong. Given the discrepancies of the phenotype with the Pfs16 disruption and the peculiar finding that the drug binds Pfs16 only in later stage gametocytes, it might be a good idea to further caution the conclusion of Pfs16 as the inhibited target.

      We don’t necessarily agree that the evidence is not strong (three methods pointing to the same target is by many accounts solid evidence). Additionally, whilst it is true that the N-4HCS photoaffinity probes likely interact with PVM proteins in first instance, it is also worth noting that this doesn’t necessarily deduct from their likelihood to be true targets, but instead fits with the N-4HCS phenotype. We observe the compounds to inhibit microgametogenesis without any prior incubation and to retain this activity even beyond activation of microgametogenesis, specifically during the window in which the PVM remains associated with the parasite. Our phenotypic observations therefore fit with the notion that the molecules target proteins that lie within the PVM and interact with the molecules at first instance. Whilst we understand the concern that PVM proteins may be likely to be enriched given their abundance and localisation, we believe this to support our phenotypic findings.

      The phenocopy evidence of the NH compounds with the Pfs16 disruption is based on comparison with published evidence. It would have been much preferred to have a side-by-side comparison with the (or an) actual Pfs16 disruption parasite line. Although the authors stress that the phenotype with DD01035881 fits the phenotype of the targeted gene disruption in the results, this only partially matches the cited publication (PMID: 14698439) which concludes there is an effect on the number of gametocytes produced. The exflagellation phenotype in that publication was classified as preliminary. Although this is discussed, the main results text should be adapted to reflect this and the conclusion that Pfs16 may be the target should be further cautioned.

      As stated, we are happy to tone down conclusions in this direction. We also note comments above about Pfs16 disruption.

      Minor comments: 4. From the modifications of the compounds it seems the chemical space for further modification to achieve higher potency is limited with this scaffold. Maybe the authors can comment whether they envisage this to be a potential obstacle.

      The modification space of the compounds is explored extensively in previous work from our group, which we feel more than adequately addresses this question. See Rueda-Zubiaurre et al (2020) J Med Chem.

      Line 67: references are superscript.

      We can change this

      Line 77: I would recommend replacing 'quiescence' here, a cell that matures is not quiescent.

      We can change this

      Line 116: consider removing 'interdisciplinary'.

      We can change this

      Line 120: I would caution here (see major comments) and recommend a less definite proclamation of Pfs16 as a promising new drug target

      We can change this along with the general “tone” of the manuscript.

      Page 7: compounds 9 is still considered active ("retained micromolar activity"), but in Table 1 this is given as >1000nM. Please add the actual IC50.

      We can add this to the final version. The actual IC50 for this compound was 1.7uM. For the SAR study we grouped compounds with IC50 >1uM into discrete groups based on rough IC50 (>1uM, >10uM etc.) hence this fell in the intermediate group.

      Line 138- 173: The order in which this is discussed makes it unclear that the work described was done prior to, and guided, the synthesis of compound 1 and probe 2

      This can be addressed in a revised manuscript.

      Line 194: was the data deposited in a database?

      The proteomics data has not been deposited in a database but is accessible in the extended SI.

      Line 202: introduction as to the benefits of using a competition + probe condition here could aid reader understanding. The interpretation of this data is complicated by the covalent and reversible binding of the two compounds and the weight of this control is therefore difficult to gage.

      We can embellish the description here.

      Table 2 and Extended Data Table 1 show different p values and enrichments for the same hits. This is confusing. It would also be useful to label the hits in the scatter plots in Figure 2 for easy identification and comparison to the tables.

      We can amend this and label each hit within the scatter plot.

      Line 215-218, please correct the data on Etramp10.3 (see major points) and put in perspective to Pfs16 (Etramp10.3 is similarly upregulated in gams where it is highly expressed; PiggyBAC predicts essentiality for Pfs16 and Etramp10.3 in blood stages).

      We can discuss this to a limited extent for future exploration of Etramp10.3.

      Line 221: the results from the PiggyBAC screen are stated as fact, but what the screen provides is a prediction of the probability of importance for parasite growth. I would replace 'is' with 'is predicted' (even though in the case of Rab1b it seems likely the prediction is correct).

      We can change this

      Line 233 and elsewhere: define 'reversibility' (binding? activity?).

      We can change this

      Line 240: clarify what is in the cited paper (see major points).

      We can clarify this

      Line 297: We utilised in-lysate...... clunky sentence, please rephrase.

      We can change this

      Line 325: reference is missing the year.

      We can change this

      Line 343: It is utterly puzzling that binding is specific to Pfs16 in mature gametocytes and I do not find the explanation in the discussion convincing (see point 28 below). Do the authors have another explanation? Could Pfs16 be modified in later gams (or vice versa)?

      We believe that Pfs16 is functionally different at different stages of gametocyte development, this is either in terms of its presentation (e.g. perhaps due to complex formation, though this remains elusive) or the functionality of different domains, as per the effect of different truncation mutants. We can address some of these concerns in a revised manuscript.

      Line 388: Justification seems odd as a PV protein would be unlikely to directly impact DNA replication. Please rephrase the sentence.

      We can change this

      Line 405: remove the 'to'

      We can change this

      Line 411: it would be useful to the reader to state at what IC-value the drug was used in these experiments.

      We can state this

      Line 431: While the alpha-tubulin staining indicates exflagellation and is similar to the DMSO only control, the staining for the RBC membrane (Glycophorin A) and DNA (DAPI) appear different, yet this is ignored. One interpretation of this could be that while late treatment doesn't block exflagellation, it still impacts other aspects of microgamete development.

      We can make mention of this

      Line 436: IFA work was done with drug treatment post activation while EM was done post activation but drug treatment prior to activation. Is there a reason for this?

      The reviewer is astute to point this out. Limitations with access to the EM facility meant that whilst IFAs were completed for pre-activation treated samples, the post-activation EM became impossible as the EM facility closed during the COVID lockdown. Thus, we do not have a complete set here. However, we do not feel this takes away from the EM observations presented. We can clarify this incompleteness in the revised manuscript.

      Line 450: is this really CytB, or was it CytD?

      We did indeed used Cytochalasin B here, which whilst less potent than D does still target microfilament formation.

      Line 465: Pfs16 localised to vesicles: there is no data showing the dots in the micrograph are vesicles, please rephrase.

      We can change this

      Page 19 and 20, discussion on stage-specific differences of Pfs16 during gametocytogenesis to explain the difference in binding: without experimental data using H-4HCS in the parasites of the publication cited to explain this (PMID: 21498641), this is very speculative. The cited work used episomal expression of Pfs16 tagged with fluorescent proteins. This would be the first integral PVM protein that is actually inserted into the PV membrane when tagged in that way (usually this results in a PV location), casting some doubt on the findings in that paper. All in all the provided explanation is not very convincing.

      We can attempt to clarify this in a revised discussion.

      Line 519: if with the conserved part the N-terminus is meant, then this has for other PVM proteins already been shown to be PVM internal, not facing the erythrocyte (show in very early work; PMID: 1852170 but also multiple times after that).

      We can clarify this

      Line 534: consider replacing 'highly plausible' with something more cautious.

      We can change this

      Line 550: Given this discussion how stable are N- 4HCS compounds?

      We can clarify this.

      Table 1: Having all chemical structures in same orientation would be nicer visually. I assume blue indicates modification but this is not stated.

      We can change this

      Figure 1: Please use different colours or symbols. The dark green crosses and the blue Pfs16 cross are hard to distinguish.

      We can change this

      Figure 3d: Unclear as to why a difference temperature range is displayed here.

      We can clarify this

      Figure 3e: Unclear % Inhibition compared to what.

      We can clarify this

      Figure 5G: What is the white arrow pointing to?

      We can clarify this

      Figure 5j: Given how the explanation is written this would make more sense between current image 5G and 5J.

      We are not sure what the comment relates to here but we can endeavour to clarify this

      Figure 6: Erythrocyte membrane colour not stated in legend.

      We can change this

      Figure 6A: were the exposure times similar? How can so little be left after ~4-5.5 minutes but at later time points there seems to be much more Pfs16 signal left? Maybe amount of signal should be taken into consideration to establish the fate of Pfs16 in the process.

      We can endeavour to clarify this

      Figure 6B: is the second phenotype (successful but aberrant egress) shown? The only image where WGA is not circular around the parasite is an exact match of Pfs16 which is in dots (image at 7.5-8.5 minutes). The imaging data for this phenotype should be presented more clearly.

      We can attempt to clarify this

      Reviewer #2 (Significance (Required)):

      Nature and significance: a lot of weight has been placed on transmission blocking drugs although there are also a number of problems associated with them (ethics for testing and use etc; drugs acting on asexuals and transmission stages alike might be even more useful). Transmission blocking drugs are difficult to study and this work is therefore important. The experiments are well done, but the conclusions are not fully convincing, leaving some doubts in regard to Pfs16 being the actual target of the class of drugs studied.

      Compare to existing published evidence: it is a logic continuation of previous work and this is appropriately highlighted in the manuscript.

      Audience: medium interest for malaria researchers; high interest for researchers working on transmission blocking drugs and those studying microgametes.

      Your expertise: malaria, P. falciparum, biology of apicomplexans

      Reviewer #3 (Evidence, reproducibility and clarity (Required)): The manuscript by Yahiya et al describes an extensive investigation of the mode of action of DDD01028076, which specifically inhibits microgametogenesis in Plasmodium falciparum. The phenotypic characterisation of the MOA uses some very nice imaging to demonstrate the point at which this compound inhibits microgametogenesis. The authors have also attempted to identify the molecular target using chemoproteomics and label-free CETSA techniques. The photoaffinity labelling and pull-down approach suggested the Pfs16 may be preferentially enriched by a PAL probe that is representative of this series. However, the data supporting the validation of this target is not very conclusive, and in some cases argues against Pfs16 being a specific target of DDD01028076. Whilst the presented data makes a significant contribution to the literature regarding a novel drug candidate that targets microgametogenesis, it does not support the author's claims that Pfs16 is the target.

      Major Concerns: The strongest evidence for Pfs16 being the target comes from the chemoproteomics pull-down study that found Pfs16 to be the most significantly enriched protein by compound 2 vs DMSO. However, this should be interpreted with caution as it is based on only 3 replicates and omics studies are prone to false-positives. That only 125 proteins were detected also raises questions about the coverage of the proteomics, it is quite possible that the actual target is not detectable using this method, and the Pfs16 appears because it is one of the more abundant proteins during this stage of the lifecycle.

      As discussed, we are happy to tone down the conclusions about Pfs16 being an exclusive target for the N-4HCS drug class, however, we feel the reviewer is being unnecessarily negative. There are myriad papers in the literature based on singular proteomics experiments (given their cost, complexity and time -consuming nature) that then facilitate downstream experiments that support findings. We have endeavoured to be as thorough as we could in the work and believe, like others, three replicates of a massive experimental pipeline should be sufficient to make a defined conclusion – whether the additional downstream evidence we have then leaned on is supportive of this (as we judge it to be) is another matter. We agree, proteomics often suffers with low protein abundance. The complexity of growing large quantities of gametocytes is familiar to anyone who has struggled to grow these finicky parasites at a larger scale than 10-25mL dishes. Given the scales we have reached, we believe these might in fact be some of the most comprehensive proteomics studies to date!

      Somewhat concerningly, the control with 1 as the competitor did not show significant enrichment of Pfs16, although a trend was observed. More concerning, was the lack of enrichment when using DDD01028076 as the competitor. This result essentially proves that Pfs16 is not the specific target (and the argument about reversibility is unlikely since most drugs are reversible binders, but many have worked with this type of approach). It is surprising that DDD01028076 (ideally the (-) form) wasn't used as the competitor for the proteomics study. This compound has ~100-fold better potency than the probe 2, which should provide much better competition that 1. It would also be more specific than 1, which is an important control considering that (-)-DDD01028076 has activity in the low nanomolar range, whereas 2 acts in the micromolar range. Non-specific interactions are an important consideration to exclude, and whilst 1 is structurally similar, it is not very potent and therefore not the best control to find the target associated with activity.

      Whilst we understand the concerns with insignificant enrichment in the competition labelling, we believe the enrichment in the presence of photoaffinity probe 2 over background (i.e. DMSO vs. probe experiments) to be of more value given the design of the experiment. The competition experiments were performed by co-treating gametocytes with photoaffinity probe 2 and parent molecule 1 prior to UV irradiation, to enable irreversible conjugation to protein target(s). However, given that both compounds, probe and parent, theoretically bind to Pfs16 at the PVM in a reversible manner (i.e. losing interaction with even gentle washing), UV irradiation is likely to favour probe-binding irrespective of competition with a marginally more potent parent molecule (in this case, parent molecule 1). This is especially true as treated parasites were very thoroughly washed after irradiation, so should the parent molecule have bound the target protein(s), these drug-target interactions were likely lost during stringent washing. The drug-target interactions with parent molecule 1 wouldn’t have been aided by UV irradiation, as the molecule lacks the functional group required for bioconjugation. So, even if parent molecule-target interactions were more abundant than probe-target interactions, interactions between parent molecule 1 were most likely lost and proteins bound by probe were enriched.

      This would have been true with more potent N-4HCS derivatives such and DDD01028076 and (-)-DDD01028076 (where potency is tested in the DGFA, independent of bioconjugation), and here we opted for a structurally similar compound of similar potency to not skew competition solely based on potency.

      We can embellish on this in the revised manuscript to make our conclusions from this part clear.

      A closer look at the gels in the supplementary data raises many questions that undermine the authors conclusions: - Fig S1a - The lane without probe (2) still identifies Pfs16 (or a protein at that MW) as the most abundant protein. Also, as the Pfs16 band increases, you can see that most other proteins also increase in abundance, so either the loading is inconsistent, or the probe actually causes non-specific enrichment of many proteins. This figure also indicates that the washing protocol is not sufficient to remove non-specific binders. Given the covalent nature of the PAL approach I would think a very thorough washing protocol could be employed.

      It is certainly the case that Pfs16 is abundant in gametocytes, a reason behind its early discovery. Thus it is challenging to remove it from background. We still believe the enrichment to be specific, highlighting the comparative work with Pfg377 in Figure 2. Further repetitions with more stringent washing might resolve the background, however, this is beyond our current resources to repeat.

      -- Running another negative control in the proteomics using one of the inactive controls from table 1 might help to disambiguate specificity.

      We don’t disagree with this though this would involve an entire re-running of the experimental workflow which is not possible.

      • Fig S2a - The anti-Pfs16 Western blots show that this protein is actually enriched more in the flow-through than the eluates. This shows that this protein is not specifically enriched by the PAL-CuAAC pull-down, it is just more abundant in the treated samples.

      Again, the presence of Pfs16 in the flow-through is unsurprising, given its abundance in stage V gametocytes. The relative abundance in the eluate is not an indication that the binding and subsequent enrichment is not specific, rather this shows the compound does not necessarily bind each and every protein – which is not unexpected. The crucial conclusion to be drawn here is the concentration-dependent enrichment of Pfs16 in the eluate in the presence of probe.

      • Fig S2b - The darkest Pfs16 spot is actually the sample with no UV treatment. This is a negative control, so should not enrich the target protein. This sample also has significant signal in replicates A and C.

      As we have noted above, it is not unsurprising that modification of the N-4HCS scaffold to yield this probe may introduce a level of irradiation-independent binding, which explains the presence of signal in the UV-independent sample.

      • Fig S2c - This blot is very messy and difficult to read, but in general the Pfs16 spots in the IGF don't correlate with the intensities in the anti-Pfs16 western.

      These experiments are extremely challenging (something that is perhaps beyond the expertise of the reviewer) and what is presented is the result of substantial optimisation. Loss of AzTB fluorescence in the gel which is subsequently analysed by western blot explains this.

      • Fig S2 - This data, and the main figures based on this data, generally don't support the hypothesis that Pfs16 is the specific target. The controls are not as would be expected, and there are no loading controls. Looking at the flow-throughs suggests that there was just more Pfs16 (and possibly total protein) in the treated samples before the enrichment step. The Pfg377 also appears quite variable in the different samples, with replicates B and C not consistent with A.

      We do not concur with the reviewer here and their dismissal of what was extremely thorough and well-executed experimtns. These are not like traditional western blots and require substantial optimisation. We refer them to our previous point in reference to the UV controls. With regards to the Pfg377 variability, the experiment itself is inherently variable with such large volumes of parasites. In many cases, for example, the male:female ratio within a mature gametocyte culture can vary and this can contribute to the variability in 377 abundance between replicates.

      The other major concern is with the CETSA analysis, which appears to show very minor stabilisation of Pfs16, but the specificity of this target is questionable, and the data has the following inconsistencies. - The supplementary data only shows n=1, yet there are error bars in the main figures. Where did these come from?

      The individual western blot replicates can be provided in a revised manuscript if judged important.

      • The samples with apparent destabilisation are all near the edge of large western blots, which often doesn't run straight and has no loading controls. We need to see the loading controls.

      Given all proteins within a lysate will aggregate with thermal treatment, antibody loading controls are not feasible with these experiments. Each sample is normalised prior to thermal stabilisation (ensuring the same protein quantity is treated in both DMSO and drug, at each temperature) and any protein that is not aggregated is loaded – the nature of CETSA itself is to compare the stabilisation between DMSO and drug.

      • The melting temperature of Pfs16 is extremely high at around 85 degrees C. Most plasmodium proteins melt at around 50-60 degrees (Dzekian et al, 2019). Even the cited work on membrane proteins didn't go to those temperatures (Kawatkar et al, 2019) Can this high temperature be explained, and has the CETSA approach been validated at such high temperatures where additional physical and chemical processes may be occurring in the sample?

      We agree that this temperature of stabilisation is unusually high and may require further biochemical validation. Without further investigation we cannot say definitively why the melting temperature of Pfs16 is so high, but suspect its size and membrane localisation may play a role.

      • The lack of difference between + and - isomers suggests that the very small stabilisation observed here is not specific to drug activity, but is more likely a non-specific binding effect. Additional negative control compounds might help here, but the + isomer is probably the best negative control (albeit the concentrations were not ideal in the presented data).

      Please we have already addressed this in the text – refer to line 312 and beyond.

      • The very high concentration (100uM) increases the chances of non-specific effects being observed here (especially since the authors claim to see stabilisation at about 10nM). The study should be repeated at lower concentrations (with negative controls) in order to confirm a specific binding effect.

      Whilst further replicates with different conditions might be preferable, as discussed extensively here, this would be beyond the scope of what we are able to achieve for a revision.

      • The concentration-ranging study was performed at 78.4 degrees, at which temperature very little denaturation of Pfs16 occurs fig S4a (and Fig 3b-c). Therefore, you would not expect to see any drug-induced stabilisation, and it is not plausible that significant stabilisation could occur at this temperature. Therefore, the apparent destabilisation at sub-10nM drug concentrations is highly questionable.

      We would have to agree to disagree on this point.

      • Stabilisation of Pfs16 did not occur in lysates from younger gametocytes (fig s4g-h), but this is a biophysical assay, so regardless of the function of this protein at different stages, the biophysical interaction between the drug and the protein should be the same regardless of the source of the protein. This data argues against Pfs16 being a specific binding target of Pfs16.

      We don’t agree with this statement, since the drug is binding the protein in native lysate – this may be a multi-meric complex (homo or hetero) which only exists at certain stages. As such we disagree with the reviewer that this argues against Pfs16 being the target.

      In addition to the above concerns, the fact that this compound doesn't inhibit the earlier functions of Pfs16 in gametocytogenesis, and that it doesn't inhibit P. berghei, also argue against this being the specific target of this drug. Whilst the authors have a valid argument that these findings don't exclude the possibility of stage-specific targeting of Pfs16, we could also argue that all the phenotypic data in figures 4-6 is merely correlative of a drug that acts at the same point in the lifecycle as Pfs16.

      We have discussed this in the manuscript and strongly feel the reviewer is being unnecessarily dismissive of a body of work that is coherent. We are happy to tone down the narrative of the paper with Pfs16 being the exclusive target. Structural homology of P. berghei Pfs16 orthologues has never been done but it would not be unprecedented if another target was functionally homologous (an idea we are currently pursuing). Stage specificity is also possible given the nature of Pfs16 (e.g. if it is in a complex). The reviewer appears fixated on a singular entity and unable to imagine a complex scenario where structure or protein-protein interactions might affect drug binding (as it does with other proteins present in complexes, e.g. proteasomal targeting drugs).

      Overall, I believe that significant additional studies would be required to identify the target of this compound. Either by repeating the included studies with additional controls and conditions, or by follow-up studies such as genetic manipulation (knock-down or overexpression) or heterologous expression and biophysical binding studies.

      Alternatively, the manuscript could be restructured as primarily a report on the phenotypic effect of this compound on microgametogenesis, with the target identification work reported as a hypothesis-generating chemoproteomics study that provides some ideas about possible targets, but requires substantial follow-up to confirm the target (which may be beyond the scope of this report?).

      We strongly disagree with this reviewer’s entire dismissal of an extensive body of work. In line with other reviewers comments we accept a need to tone down our conclusions, but do not consent to dropping the majority of the paper in favour of a phenotypic descriptive work.

      MINOR COMMENTS The manuscript is very well-written and presented.

      Several of the conclusions are overstated (as detailed above) and several statements should be tempered based on this data (e.g. statements linking DDD01028076 effects to Pfs16 function).

      We can address the overstatement of conclusions in a revised manuscript.

      I find the term 'crosslinking' confusing for the photo-affinity labelling, as crosslinking in proteomics often refers to crosslinking between proteins (not between protein and drug).

      This is simple to address – to minimise confusion for readers, we can simply state where photoaffinity labelling and bioconjugation were performed (and not refer to the latter as crosslinking).

      The data and terminology around activity (IC50) for compounds in table 1 is a little confusing. Some IC50 values are reported as >1000, while others have precise mean values reported over 1000, and others are >10,000 or >25,000. This is especially confusing where 9 is claimed to have retained activity, but is >1000. If consistent thresholds are not appropriate then perhaps including dose response curves in the supp data might be necessary to explain these?

      We can simply provide the provide IC50s for compounds of greater potency. We are also happy to provide the curves but with such a large body of work already, this might be unnecessary.

      Reviewer #3 (Significance (Required)):

      The work is potentially interesting to Plasmodium biology and drug discovery researchers. The concept of a transmission-blocking drug is quite attractive to this community, so the topic is highly relevant. Keeping in mind that this compound was reported previously, the main novelty is in defining it's window of activity during the microgametogenesis process, and differentiating this from other drugs/compounds that inhibit this process. There is clearly an advance in knowledge presented here.

      If Pfs16 were to be confirmed as the target of this series then I think that this study would have much greater impact and attract interest from a broad audience. However, at this stage I don't see strong evidence for this hypothesis, and some of this data casts significant doubt on the likelihood that Pfs16 is the direct target.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      The manuscript by Yahiya et al describes an extensive investigation of the mode of action of DDD01028076, which specifically inhibits microgametogenesis in Plasmodium falciparum. The phenotypic characterisation of the MOA uses some very nice imaging to demonstrate the point at which this compound inhibits microgametogenesis. The authors have also attempted to identify the molecular target using chemoproteomics and label-free CETSA techniques. The photoaffinity labelling and pull-down approach suggested the Pfs16 may be preferentially enriched by a PAL probe that is representative of this series. However, the data supporting the validation of this target is not very conclusive, and in some cases argues against Pfs16 being a specific target of DDD01028076. Whilst the presented data makes a significant contribution to the literature regarding a novel drug candidate that targets microgametogenesis, it does not support the author's claims that Pfs16 is the target.

      Major Concerns:

      The strongest evidence for Pfs16 being the target comes from the chemoproteomics pull-down study that found Pfs16 to be the most significantly enriched protein by compound 2 vs DMSO. However, this should be interpreted with caution as it is based on only 3 replicates and omics studies are prone to false-positives. That only 125 proteins were detected also raises questions about the coverage of the proteomics, it is quite possible that the actual target is not detectable using this method, and the Pfs16 appears because it is one of the more abundant proteins during this stage of the lifecycle.

      Somewhat concerningly, the control with 1 as the competitor did not show significant enrichment of Pfs16, although a trend was observed. More concerning, was the lack of enrichment when using DDD01028076 as the competitor. This result essentially proves that Pfs16 is not the specific target (and the argument about reversibility is unlikely since most drugs are reversible binders, but many have worked with this type of approach). It is surprising that DDD01028076 (ideally the (-) form) wasn't used as the competitor for the proteomics study. This compound has ~100-fold better potency than the probe 2, which should provide much better competition that 1. It would also be more specific than 1, which is an important control considering that (-)-DDD01028076 has activity in the low nanomolar range, whereas 2 acts in the micromolar range. Non-specific interactions are an important consideration to exclude, and whilst 1 is structurally similar, it is not very potent and therefore not the best control to find the target associated with activity.

      A closer look at the gels in the supplementary data raises many questions that undermine the authors conclusions:

      • Fig S1a - The lane without probe (2) still identifies Pfs16 (or a protein at that MW) as the most abundant protein. Also, as the Pfs16 band increases, you can see that most other proteins also increase in abundance, so either the loading is inconsistent, or the probe actually causes non-specific enrichment of many proteins. This figure also indicates that the washing protocol is not sufficient to remove non-specific binders. Given the covalent nature of the PAL approach I would think a very thorough washing protocol could be employed. -- Running another negative control in the proteomics using one of the inactive controls from table 1 might help to disambiguate specificity.
      • Fig S2a - The anti-Pfs16 Western blots show that this protein is actually enriched more in the flow-through than the eluates. This shows that this protein is not specifically enriched by the PAL-CuAAC pull-down, it is just more abundant in the treated samples.
      • Fig S2b - The darkest Pfs16 spot is actually the sample with no UV treatment. This is a negative control, so should not enrich the target protein. This sample also has significant signal in replicates A and C.
      • Fig S2c - This blot is very messy and difficult to read, but in general the Pfs16 spots in the IGF don't correlate with the intensities in the anti-Pfs16 western.
      • Fig S2 - This data, and the main figures based on this data, generally don't support the hypothesis that Pfs16 is the specific target. The controls are not as would be expected, and there are no loading controls. Looking at the flow-throughs suggests that there was just more Pfs16 (and possibly total protein) in the treated samples before the enrichment step. The Pfg377 also appears quite variable in the different samples, with replicates B and C not consistent with A.

      The other major concern is with the CETSA analysis, which appears to show very minor stabilisation of Pfs16, but the specificity of this target is questionable, and the data has the following inconsistencies.

      • The supplementary data only shows n=1, yet there are error bars in the main figures. Where did these come from?
      • The samples with apparent destabilisation are all near the edge of large western blots, which often doesn't run straight and has no loading controls. We need to see the loading controls.
      • The melting temperature of Pfs16 is extremely high at around 85 degrees C. Most plasmodium proteins melt at around 50-60 degrees (Dzekian et al, 2019). Even the cited work on membrane proteins didn't go to those temperatures (Kawatkar et al, 2019) Can this high temperature be explained, and has the CETSA approach been validated at such high temperatures where additional physical and chemical processes may be occurring in the sample?
      • The lack of difference between + and - isomers suggests that the very small stabilisation observed here is not specific to drug activity, but is more likely a non-specific binding effect. Additional negative control compounds might help here, but the + isomer is probably the best negative control (albeit the concentrations were not ideal in the presented data).
      • The very high concentration (100uM) increases the chances of non-specific effects being observed here (especially since the authors claim to see stabilisation at about 10nM). The study should be repeated at lower concentrations (with negative controls) in order to confirm a specific binding effect.
      • The concentration-ranging study was performed at 78.4 degrees, at which temperature very little denaturation of Pfs16 occurs fig S4a (and Fig 3b-c). Therefore, you would not expect to see any drug-induced stabilisation, and it is not plausible that significant stabilisation could occur at this temperature. Therefore, the apparent destabilisation at sub-10nM drug concentrations is highly questionable.
      • Stabilisation of Pfs16 did not occur in lysates from younger gametocytes (fig s4g-h), but this is a biophysical assay, so regardless of the function of this protein at different stages, the biophysical interaction between the drug and the protein should be the same regardless of the source of the protein. This data argues against Pfs16 being a specific binding target of Pfs16.

      In addition to the above concerns, the fact that this compound doesn't inhibit the earlier functions of Pfs16 in gametocytogenesis, and that it doesn't inhibit P. berghei, also argue against this being the specific target of this drug. Whilst the authors have a valid argument that these findings don't exclude the possibility of stage-specific targeting of Pfs16, we could also argue that all the phenotypic data in figures 4-6 is merely correlative of a drug that acts at the same point in the lifecycle as Pfs16.

      Overall, I believe that significant additional studies would be required to identify the target of this compound. Either by repeating the included studies with additional controls and conditions, or by follow-up studies such as genetic manipulation (knock-down or overexpression) or heterologous expression and biophysical binding studies. Alternatively, the manuscript could be restructured as primarily a report on the phenotypic effect of this compound on microgametogenesis, with the target identification work reported as a hypothesis-generating chemoproteomics study that provides some ideas about possible targets, but requires substantial follow-up to confirm the target (which may be beyond the scope of this report?).

      Minor comments

      The manuscript is very well-written and presented.

      Several of the conclusions are overstated (as detailed above) and several statements should be tempered based on this data (e.g. statements linking DDD01028076 effects to Pfs16 function).

      I find the term 'crosslinking' confusing for the photo-affinity labelling, as crosslinking in proteomics often refers to crosslinking between proteins (not between protein and drug).

      The data and terminology around activity (IC50) for compounds in table 1 is a little confusing. Some IC50 values are reported as >1000, while others have precise mean values reported over 1000, and others are >10,000 or >25,000. This is especially confusing where 9 is claimed to have retained activity, but is >1000. If consistent thresholds are not appropriate then perhaps including dose response curves in the supp data might be necessary to explain these?

      Significance

      The work is potentially interesting to Plasmodium biology and drug discovery researchers. The concept of a transmission-blocking drug is quite attractive to this community, so the topic is highly relevant. Keeping in mind that this compound was reported previously, the main novelty is in defining it's window of activity during the microgametogenesis process, and differentiating this from other drugs/compounds that inhibit this process. There is clearly an advance in knowledge presented here.

      If Pfs16 were to be confirmed as the target of this series then I think that this study would have much greater impact and attract interest from a broad audience. However, at this stage I don't see strong evidence for this hypothesis, and some of this data casts significant doubt on the likelihood that Pfs16 is the direct target.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      Summary: * Saha et al. characterize Drosophila egg chambers that are mutant for cup and identify an increase in the number of a specialized type of follicle cells, the border cells. They demonstrate that this increase correlates with an expanded domain of STAT activity and reduced Notch signaling in anterior follicle cells. Determining that cup is required in the germline cells, the authors postulate and provide some evidence that cup mutants prevent germline Delta from properly signaling to follicle cells. In line with this, they also show that blocking endocytosis phenocopies some aspects of cup mutants, particularly border cell numbers and Delta levels, which they monitor cytoplasmically and at the cell surface. Lastly, they demonstrate that activation of Rab11 can rescue Delta levels and border cell number in cup mutants. They conclude that a key function of Cup in the germline is to traffic Delta to signal to follicle cells, and that the endocytic processing of Delta is required for its function.*

      Major comments:

      • The findings of this study are interesting and novel. The authors have completed a lot of experiments and analyzed the results carefully and in great detail. Experimental design is described adequately and statistical analysis is sufficient. While the main results are largely convincing and support the conclusions, there are some weaknesses that need to be addressed.*

      Response: We thank the reviewer for appreciating our work and we have tried to address concerns of the reviewers to the maximal possible extent with the hope to strengthen our claims further.

      One major concern is that the vast majority of the experiments were conducted with a single homozygous allele for cup. The authors claim this was necessary because other alleles arrest oogenesis, which is understandable, but it leaves the potential problem that the allele, a P-element insertion, may affect other genes, or there may be other unidentified mutations on the mutant chromosome. The authors are able to partially rescue the border cell phenotype with overexpression of Cup and can also mimic the outcome with RNAi in the germline, which helps alleviate some of this concern, but this was only done for one set of experiments (those in figure 1). Similar experiments need to be included to demonstrate the same outcomes when cut is disrupted by other alleles/methods for at least some of the Notch/Delta analyses since this is key to the paper's conclusions.

      Response____: We acknowledge the concern raised by reviewer and to address it, we evaluated different allelic combination of Cup to rule out issues with background mutation. We evaluated the Delta count, NICD and border cell numbers in a different allelic background of cup8/ cup01355. Satisfyingly we observed similar results like that observed for cup01355/ cup01355 homozygotes. This result is included as (Fig S1E-G)

      In addition, we have specifically downregulated Cup function in the germline employing the RNAi approach and validated the non-cell autonomous effect of Cup function in border cell fate specification. This result is included in (Fig 1M-O)

      A second concern is that some evidence is circumstantial or indirect. Specifically, the authors argue that the effect of Cut is due to trafficking of Delta, but do not consider the possibility that Delta could be more directly regulated or that other factors may be relevant. Border cell specification is rescued by increasing recycling in cup mutants, but this could be due to recycling of more factors besides Delta. To address this more directly, the authors should overexpress Delta in the germline of cut mutants. It is possible the disruption of Delta in cut mutants is due to changes in Delta protein stability/levels, so the experiment may also clarify this issue. If this is the case, it may be that hypomorphic Delta mutants would have a defect on border cell number, which could be examined separately. If Delta levels are low, endocytosis and recycling increases may also rescue cut mutants indirectly, but the conclusion about what Cut regulates may differ.

      Response: As per the suggestion of the reviewer, we did attempt to over express Delta in the germline of cup mutants egg chambers. Unfortunately, we couldn’t record any Delta overexpression as the available vector (UASt- Delta) can drive stable expression only in the somatic cells but not in the germline cells. However, to check out the possibility if Delta was being directly regulated by Cup, we compared the levels of proteins between wild type and Cup mutant egg chambers (Figure 4E-G). Unlike our expectation we didn’t observe any significant differences in the levels of Delta in Cup compared to the control. This kind of supports our belief that Cup may not be directly regulating the levels of Delta in the germline.

      Another concern is that Cup's main role is a confusing since it regulates many things, including cytoskeleton and cytoskeleton is necessary for general health and vesicle trafficking in the egg chamber - how do the authors think Rab11 upregulation is overcoming these defects?

      Response: We appreciate the reviewer for raising this concern as it kind of intrigued us to examine if the overexpression Rab11CA was rescuing the cytoskeleton too. Interestingly, we observed that Rab11CA overexpression restored the actin filament in Cup mutant germline(figure S6H-K). This result is in line with report that Rab11 effector Nuf can modulate actin polymerization (Jian Cao et al.,2008).

      Rab11CA rescues Delta levels almost completely in cut mutants but only partially rescues Notch activation, suggesting there are other problems in these egg chambers that could contribute to the defects. While exploring possible other factors is beyond the scope of this work, the authors may want to acknowledge this issue.

      Response: We do agree with the reviewer that we only observe partial rescue of the NRE GFP with Rab11CA, it suggests that Cup can affect different aspect of egg chamber development independent of Rab11 function.

      Minor comments:

      It would help the presentation of the paper to introduce Notch/Delta signaling during oogenesis in the introduction. More introduction and clarity about the number of polar cells at early stages and their role in the border cell cluster may also be useful to the reader.

      Response: We have modified the introduction to highlight the role of Notch/ Delta signaling in early oogenesis.

      It is notable that the primary phenotype of a change in border cell numbers is quite subtle, often only affecting 1-2 cells, and the variation in different genotypes and experiments is sometimes also that large. The authors do a good job of being careful to count the cells at a specific developmental time and do appropriate statistical tests within an experiments. Still, it difficult to be sure that the effects are due to the gene being manipulated specifically or the genetic background. Related to this, a few issues should be addressed. Notably, at earlier stages, Notch signaling impacts cell division, so some of the phenotypes might be explained by there being more total cells in the domain instead of more signaling. The authors show Cut is in the same domain and pH3 is similar, but they didn’t seem assess overall numbers.

      Response: As per the suggestion of the reviewer, we assessed the total number of follicle cell nuclei in stage 8 egg chambers. This analysis was done each confocal z slide of the egg chamber taking care that each nuclei (DAPI) was counted only once. Satisfyingly we didn’t observe any significant difference in the number of follicle cell nuclei between wild type and cup mutant egg chambers supporting our earlier claims with pH3 and Cut antibody that cell proliferation is not responsible for the excessive border cell fate in Cup mutants. This result in included in (Fig S2O-Q)

      Secondly, for the stat suppression of cut (figure 2L), the authors need to show the stat-/+ control for comparison to make a conclusion about suppression versus additive effects.

      Response: As per the suggestion of the reviewer, we have included the data for statp1681/+ control in figure 2L.

      In addition, prior work (Wang et al 2007) expressed DN Kuz in border cells and did not see a change in specification, unlike what is claimed here. In the experiment in question, the control has lower than normal numbers of border cells and the DN Kuz has a number more typical of the controls in other experiments- so this is a concern that there is something else in the genetic background influencing the numbers. Other controls could help make this case, but ultimately this result is probably not necessary for the main argument. Thus the authors might consider leaving it out the Kuz analysis or perhaps can comment on the discrepancy with prior published results.

      Response: We have removed the data on Kuzbanian and have added data that suggests that Notch activation in the follicle cells downstream of Cup facilitates specification of appropriate number of migratory border cells (Fig 3K-N).

      Can the authors comment on why the volume of the border cell cluster increases more dramatically (>2x) than the number of cells (30% more)? * Does the increase in border cell number change the migratory capacity? That is, do the clusters in cut mutant egg chambers migrate normally while the egg chamber looks okay?*

      Response: We believe that dramatic increase in the volume of the border cell cluster I (>2x) than the number of cells (30% more) is due the loose arrangement of the cells in the border cell cluster. Interestingly, the cup mutant border cell clusters do exhibit migration defect that we are examine as part of separate study.

      Several of the figure legend titles state conclusions that are over interpretations of the data shown:

      - Figure 3 legend is overstated- these experiments do not assay STAT activity, only border cell number, so the title can be simplified to say that.

      Response: We have modified the Figure legend in line with the data presented.

      - For figure 4, both cytoskeleton and Delta are shown to be disrupted in cup mutants, but they are not directly linked, eg, the experiments do not show a change in Delta in cytoskeletal mutants alone. While it is interesting that cup mutants have disrupted cytoskeleton, ultimately this result is not well connected to the main issue of Notch/Delta signaling; in fact, it becomes confusing how anything can be trafficked to the cell surface if there is poor cytoskeletal organization. Since the authors favor the hypothesis that the cytoskeleton is not the key to the border cell specification difference, they may want to move this result out of figure 4.

      __Response: __We have included the data that suggests that cytoskeleton organization is critical for Delta trafficking. Specifically we demonstrate that treatment of egg chambers with Cytochalasin D exhibits accumulation of Delta in the nurse cell cytoplasm (Fig S5D-F).

      - The Figure 5 legend is also overstated- these experiments show that Delta is higher in cup mutants and endocytosis mutants AND that endocytosis (of something) is required in the germline for border cell number- but these results are not linked in this figure. More evidence for this connection does come later in figure 6. * Some figure legends are quite brief and could benefit from a little more detail on what is being shown*.

      __Response: __We have modified the title of the Figure legends with respect to data presented.

      Figure layout could be improved by keeping images consistent sizes and making sure graph text is large enough to read easily. Figures in general could be streamlined by having negative results and less pertinent results in supplemental data.

      Response: We have reorganized the figures and worked on the graph text for easy read.

      Not all papers cited in the text are in the reference list.

      Responses: We have modified the title of the figure legends and cross checked our reference list with the papers mentioned in the main text.

      CROSS-CONSULTATION COMMENTS

      I generally agree with the other reviewers that there are concerns with the precise function of cup in this context, and that some revision is needed, including editing of the writing. In response to reviewer 2, prior published studies only detected Cup in germline, but it is possible that it is expressed in follicle cells at a low level. The mutant clonal experiment in follicle cells that the authors did had no effect on border cells, so that provides some evidence the role is non-autonomous. I agree with reviewer 2's concern that the authors overstate the connection between cup and Delta and border cells based on their data and need a few more experiments to tie things together. I understand reviewer 3's concerns that the experimental effects on border cell numbers are very small and variable- I listed this as a minor concern, though, since this number is mainly being used as a read-out for STAT signaling levels and the data were extensively quantified and statistically tested.

      Reviewer #1 (Significance (Required)):

      My expertise is in cell migration, developmental biology, and Drosophila genetics. This paper will be of broad interest in these fields as it incorporates aspects of each in its characterization of a new regulatory mechanism to induce a motile cell population non-cell-autonomously, which is an exciting finding. Specifically, the work increases our understanding of the intersection between Notch and Jak/STAT signaling, which many researchers study - these were both known to be involved in border cell specification. The study provides more detailed characterization of the signaling and specification process in general, and makes significant advances in understanding how Delta signals are produced and presented from germline cells to receiving cells in the soma. Cut has not been previously implicated in these signaling pathways, so that is also novel, although its precise mechanistic role here is still somewhat unclear.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      In this manuscript, Saha et al. made a detailed description of the role of the mRNA binding protein Cup in specifying the number of Border Cells (BC) during Drosophila melanogaster oogenesis. First of all, they show that females homozygote for a hypomorph allele of cup have higher number of BCs compared to Wild Type (WT) females. They present a series of experiments that points towards the phenotype being due to a specific role of cup in the nurse cells that non-cell autonomously regulates BC specification. Also, they show that this phenotype is the result of an increase in the levels of JAK/STAT signalling in the BC, a major determinant of BC

      fate. In addition, they show that cup mutant egg chambers exhibit a downregulation of

      the Notch (N) pathway function in the BCs and that over-activating Notch results in the rescue of the number of BCs. Moreover, the authors present data on the effect of cup in Delta (Dl) trafficking in the nurse cells: They found that cup mutant egg chambers show increased number of Dl puncta within the cytoplasm of the nurse cells, but reduced numbers in the nurse cell-Anterior Follicle Cell (AFC) boundary as a result of defective Dl endocytosis. Finally, they were able to rescue the Dl trafficking phenotype, as well as the number of BC by overexpressing an active form of Rab11.

      Mayor points:

      In this study, the authors employed an hypomorph allele of Cup to generate egg chambers where both germline and somatic cells are mutant for Cup. They did a series of experiments to try to demonstrate that the Border Cell (BC) specification phenotype they observe is non-cell autonomous and that is due to the Loss of Function (LOF) of Cup exclusively in the nurse cells. Although I appreciate the difficulties of eliminating or reducing the levels of Cup specifically in the nurse cells only during mid-oogenesis, I feel like this is key to be able to claim that this effect of Cup in BC specification is really non-cell autonomous. The reasons why I still have some doubts that there might be some cell autonomous effects in the FCs are the following:

      o The authors show that cup01355 mutant egg chambers have a phenotype in Dl trafficking. Although they analysed in detail the effects on Dl in the nurse cells, their images show that there might be a defect in Dl levels/trafficking in the Follicle Cells (FCs) as well (Fig5A-B). It has been shown that Dl mut FCs have reduced levels of Notch activity due to reduced lateral inhibition (Poulton et al., 2011), so there is a possibility that the reduced levels of Notch activity in the cup01355 egg chambers might be due, partially, to defects in Dl trafficking/levels in the FCs, rather than in the nurse cells. o The authors tested the role of the Notch pathway in the cup mutant phenotypes by measuring the number of NICD puncta in the signal receiving cells as proxy for Notch activity (Fig4). Although I understand the rationale, I am not convinced that they can completely rule out that the changes in NICD puncta number in FCs is not due to some effect of cup LOF on Notch trafficking in these cells.

      o In figure 6, the authors show that expression of a constitutively active form of Rab11 specifically in the nurse cells restores the BC number to that of the WT. However, the levels of Dl particles and, especially the levels of NRE-GFP expression, remains slightly lower than in the WT conditions.

      Response: We do agree with the reviewer that we only observe partial rescue of the NRE GFP with Rab11CA, it suggests that Cup can affect different aspect of egg chamber development independent of Rab11 function. This has been acknowledged in the main text and it now reads as “We did note that irrespective of partial rescue in the levels of NRE-GFP and Delta puncta count, a complete reversion to wild type border cell numbers was observed when Rab11CA was overexpressed in the cup mutant germline. This may suggest either that border cell fate specification is quite robust beyond a certain base level of signaling or Cup may affect other aspects of egg chamber development independent of Rab11 function.”

      One of the main conclusions of this study is that cup regulates BC specification through a non-cell autonomous mechanism that involves communication between nurse cells and AFCs. For that reason, I think in order to conclusively say that, the authors need to try to remove the function of cup specifically in the nurse cells. They mentioned they have tried different ways of doing this unsuccessfully, but do not specify how they have tried. I suggest using the cup-RNAi line combined with a nurse cell specific Gal4 and a ubiquitous gal80ts line (tub-Gal80ts), if they have not try this. I do not expect the authors to repeat all the experiments with this condition, but at least they should test the main findings i.e. number of BCs, JAK/STAT overactivation and Notch attenuation.

      Response: To further support the non-autonomous role of Cup in border cell fate specification, we down regulated Cup function in germline nurse cells employing Mat-alpha GAL4 and Cup RNAi. Since Mat-alpha GAL4 driver has weak expression in the nurse cells of early stage chambers, it enabled us to evaluate Cup function during mid oogenesis. Consistent with our expectation, we observed higher number number of border cells in the migratory cluster compared to the control supporting our conclusion that germline Cup modulates the number of adjacent anterior follicle cells that acquire migratory border cell fate. The above results are included in (Fig 1M-O). In addition over expression if Cup cDNA in the anterior follicle cells failed to the rescue the excessive border cells observed in the Cup mutant egg chambers supporting the germline role of Cup further. This result in included in (Fig S1L-O).

      • The authors have shown in Figure 3 that there is a decrease in Notch signalling in the AFCs in cup01355 egg chambers. In order to test that the BC number phenotype observe in this condition is due to that effect on Notch signalling they have done a rescue experiment using the antimorphic Notch allele Nax-16. Since in this condition all cells (nurse cells and FCs) have increased levels of Notch, they cannot conclusively say that the increase in Notch function in the FCs rescues the cup

      phenotype. If they want to show that the function of Notch is specifically needed in the FCs, they should over-activate Notch exclusively in the AFCs. For instance, they could express a constitutively active form of Notch, such as UAS-NICD (Go et al., 1998) or UAS-NDECD (Fortini et al., 1993), specifically in the AFCs. Otherwise, they should re-write the text since they cannot conclusively say that the increase in Notch function in the FCs rescues the cup phenotype.

      Response: Following the suggestion of the reviewer, we attempted over expression of NICD in the follicle using driver slbo-GAL4 in the cup mutant background. Gratifyingly, we observed rescue in the border cell fate of Cup mutant egg chambers. However, we didn’t observe any rescue in the morphology of nurse cell nuclei of Cup mutants. This supports our conclusion that increase in Notch function in the FCs rescues the cup phenotype with respect to the border cell fate only. (Fig 3K-N).

      • The authors had made a great effort to prove that proper Delta endocytosis in the nurse cells is essential for adequate Notch signalling in the AFCs and right number of BCs recruitment. Specifically:

      o They checked the consequences on Dl trafficking of down-regulation of rab5 or auxilin, but they did not test the effect in BC numbers * o They show that downregulating the function of shi affects the number of BCs, but did not show the effect of this condition in Dl trafficking. * Consequently, they cannot conclusively say that effects on trafficking of Dl affect number of BCs, since they haven't really tested both effects on the same background. I think that for simplification, they should test both, effects on Dl trafficking and number of BCs in one of those genetic backgrounds and leave the other two for supplementary material. Alternatively, they should re-write their conclusion for this section.

      Response: As Rab11GTPase over expression rescued the excessive border cell fate in the cup mutants, to test the specificity we downregulated Rab11 function in the germline itself to check Delta trafficking and border cell fate specification. We employed a late expressing GAL4 driver in the germline and observed that down regulation of Rab11 function resulted in more number of follicle cell acquiring border cell fate and decrease in the number of Delta puncta at the interface of Anterior follicle cells and nurse cells. This phenotype is reminiscent of the Cup mutants suggesting that perturbing the recycling component of endocytosis perse affects border cell fate and Delta trafficking. This result in included in (Fig 6D-I)

      • Their results clearly show that Dl accumulates in puncta, suggesting that there might be a defect in Dl trafficking, and although their rescue experiments point towards an scenario where Rab11-dependent Dl recycling is being affected, I think there are some weak points on their arguments. The fact that Rab11-KD does not generally affect Notch signalling in the FCs, as shown in (Windler & Bilder, 2010) argues against their conclusion that the effect of cup in nurse cells on Rab11 function is responsible for the defects in Dl trafficking and, subsequently, on Notch activity in AFCs. An alternative explanation is that Rab11 overactivation in the Cup mutant background compensates for a different defect on Dl trafficking, for example, Rab4-dependent recycling pathway. Another possibility is that AFCs could be specially sensitive to changes in Rab11-dependent Dl trafficking defects in the nurse cells. To distinguish between these two possibilities, they should perform some of the following experiments:
      • o First of all, there are a number of endosome markers that can be used to check in which step of the endocytic route Dl is being accumulated, including (but not limited to) anti-Rab11 antibody, anti-Rab5, anti-Rab7, tub-Rab4-mcherry. They should do co-localization experiments with Dl and endosomal markers.*
      • o Also, they could check what happens to the number of BCs and Dl trafficking when Rab11 function is blocked in the nurse cells, in a similar way to what they did with Auxillin, Rab5 and Shi. They could use some of the tools described in (Satoh et al., 2005)*

      Response: We have perturbed Rab11 function during mid oogenesis which is quite distant from early stage egg chambers examined by Windler & Bilder. We observed that down regulation of Rab11 activity in germline affects both border cell fate in the AFCs and Delta trafficking in the germline itself. Protein Trap analysis of Rab11 in wild type and Cup mutant background suggests Rab11 is enriched in the trans-golgi network where the activity of Rab11 is modulated through nucleotide exchange. Over all our results suggest that Rab11 activity is diminished in the cup01355 egg chambers and thus stimulating the recycling endocytosis restores Notch signalling in the AFCs, limiting JAK-STAT activation and restricting BC cell fate specification.

      • The authors final model is one in which cup in the nurse cells regulates Rab11 function to ultimately control JAK/STAT signalling in the AFCs. However, they have not looked at the status of JAK/STAT signalling in their Rab11-CA rescue experiments. I think this experiment will really round-up their work.* Response: The border cell fate is linked to activation of JAK-STAT signaling in the anterior follicle cells. As we have already exhausted the STAT antibody, it will difficult to access the levels of STAT perse.

      Minor points:

      • The authors tested if the extra BC phenotype observed in the cup mutant egg chambers is due to defects in FCs endoreplication. I have two questions related to this section.*

      • o First of all, I do not understand the rationale behind this idea that defects in FCs endoreplication would result in extra BCs. Please explain and add any relevant references.*

      • o Secondly, they say that they used Cut and Phospho-Histone3 as endoreplication markers. I believe that what they mean is that the absent of these two markers indicates that FCs have exit the cell cycle and enter the endocycle (Sun & Deng, 2005), however they are not markers of endoreplication. Please, re-write to make this clear.*

      Response: The follicle cell exhibits a switch from mitotic to endocycle phase at a particular stage of oogenesis (Sun & Deng’ 2005). Our premise is that incase this switch is delayed, will the extra proliferation can account for the excessive border cell fate? In this context we have modified the text to render clarity to this section.

      • The authors tested whether the levels of Notch activity were altered in the cup mutant egg chambers. For that, they used an NRE-GFP construct that shows a clear reduction in the levels of Notch activity in the AFCs. They also used the number of NICD and NECD puncta in signal receiving and sending cells respectively, as proxy of Notch activity. Although I understand the rationale, there are other explanations for this phenotype as discussed above. Thus, if they want to have an alternative way of showing the dampening of Notch signalling, they could use the levels of expression of well characterised targets of Notch in the FCs, such us hnt and E(spl)mb-CD2 or E(spl)m7. Response: We believe that our new set of data with NICD over expression (in the AFCs) rescuing border cell fate in Cup mutants coupled with NRE-GFP, NICD, NECD data now lends stronger support to our claim that Notch signaling in the follicle cells is indeed downstream of Cup function in developing egg chambers.

      • In M&M the authors explain that NRE-GFP levels were expressed in Fold change. However, in figure 3C the units of the graph are Fluorescence Intensity in a.u. Please,*

      check this small inconsistency

      Response: We have modified this as per reviewer’s suggestion.

      • In figure 4, they show the quantification of tubulin fibres within the nurse cells, however they are missing a similar analysis of Phalloidin (Pha) fibres/levels. I think this experiment and figure will be more complete if the authors added such a quantification of the effects of cup LOF in Pha distribution. Also, the authors do not show the single Pha channel in Fig4C, which would greatly helped to appreciate the differences between the WT and Cup LOF nurse cells. I suggest modifying the figure to better show the changes in Pha distribution. Response: We have modified the figure and included quantitation of actin fibre length in Supplementary figure 6H- K.

      • In figure 4F-G the authors are showing the general effect of cup LOF in Delta distribution. They indicate with yellow arrowheads the cytoplasmic Dl puncta accumulation in the nurse cells, however it is almost impossible to see such puncta with that level of magnification/resolution. I suggest removing the arrowheads, since the figure 4H-I shows the same puncta more clearly. Response: We have modified the figure to render clarity

      • In the Dl trafficking experiments (Fig4 H-I,K,L and Fig5A-C), the authors measured the number of puncta in the anterior nurse cell-follicle cell junction. In order to do those types of quantifications they need to be able to tell the cell boundaries that separate FCs from the nurse cells. Please, clarify the criteria for determining if the puncta are within the FCs or the underlying nurse cells. Response: Delta, NICD, NECD proteins marks the apical surface of the follicle cells. We used this as a reference to segregate nurse cell puncta with respect to follicle cells. This has been elaborated in the Material & Method section.

      • In figure 6C-D the authors show example images of egg chambers expressing Rab11-CA-YFP using the germline specific nos-Gal4. However, in the images it looks like the YFP signal is coming from the surrounding stretched FCs. Please check that these are the right images or explain the inconsistency.

      Response: We have crosschecked the images and the YFP signaling is from nurse cell periphery which gives the wrong impression that it is from stretched follicle cells.

      • In figures 1R, 2L, 3Q, 6I, 6M, the authors should show the results of the statistical analysis between all the conditions tested. I think that this is crucial to be able to tell whether some of the rescues are complete or only partial. *Responses: To avoid cramming the Figures, we have including some of the p values in the Figure legends. *

      • Line 174: should say "mutant egg chambers".*
      • Line 281: There is a reference that is missing from reference list: Liu et al., 2010;*
      • Line 292: The reference for the NRE-GFP construct is not the correct one, since that references to a review article. Please, add the correct reference.*
      • In line 462 of the manuscript you have a reference that is missing from your reference list.*
      • In line 394 the authors say: "protein, it's enrichment in the cytoplasmic fraction of the cup mutant egg chambers", but I think that they meant mutant nurse cells.*

      Response: We have modified the text as per the all the suggestions above Reviewer #2 (Significance (Required)):

      The BC migration is an excellent model to study collective cell migration and how epithelial cells can acquire migratory behaviours. After years of study, there is good understanding of the signals and genetic circuits that regulate BCs specification and migration (Montell et al., 2012), but there are not many studies, to my knowledge, that describe a role of nurse cells in specifying or guiding the migration of these cells. Thus, this study by Saha and colleagues is one of the first studies that show a role for nurse cells in specifying the number of BCs.

      My field of expertise is in cell-cell communication through different pathways, including Notch and Integrin signalling. I have studied the role of endocytosis in regulating Notch signalling in various contexts, including follicular epithelium in Drosophila ovaries.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      This manuscript describes an investigation into the signaling that induces the differentiation of follicle cells into border cells in the Drosophila ovary. Previous studies have established the border cells as an informative model for studying how epithelial cells delaminate and undergo collective cell migration, and have identified the JAK-STAT and Notch pathways as important regulators of the process. Here, the authors performed a forward genetic screen and identified cup as another gene that is involved in the regulation of border cell differentiation. Their findings are consistent with a model in which cup is required in germ cells for the endocytosis of the Notch ligand, Delta. In cup mutants, impaired trafficking of Delta leads to decreased Notch signaling in follicle cells, which allows for increased JAK-STAT expression in follicle cells and an increase in the number of follicle cells that differentiate into border cells. Overall, the approach is thorough and the phenotypes are clear and well-described. The quantification of phenotype penetrance and of aspects of the images, such as pixel intensities and the number of particles in a region is a strength of the paper. The use of multiple independent methods to test key points is another strength. However, there are several concerns that should be addressed before the paper is considered for publication:

        1. The central phenotype that this paper is based on is a difference in the number of border cells per cluster in wildtype and mutant genotypes. However, this phenotype is fairly subtle in some cases (e.g. in Fig. 2L, it varies by only about 10% between control and mutant) and it is somewhat variable. For example, the number of cells in border cell clusters of the controls range from 4.49 in Fig. 3M to 6.41 in Fig. 1F. Considering that the mutant values fall within this range in some cases (e.g. 5.98 in Fig 3M) and the difference between the means from control and mutant genotypes is often less than two, the significance of this phenotype is unclear. How does this compare to other mutants that have been described to affect border cell specification? Are there any consequences for the differentiation of the follicle or the function of the egg caused by this defect?*

      Response: We are using the border cell number as readout for the output of JAK-STAT signaling. Though the difference in numbers may appear to be subtle, we believe our data clearly demonstrates that Cup non cell autonomously regulates border cell fate by modulating Notch signaling in the follicle cells*. *

      • Wang, et al. (PMID 17010965) have described previously that Notch signaling, and*

      Kuzbanian specifically, is required for border cell migration. The authors should cite this paper and discuss their findings in light of this study. For example, if Notch signaling is impaired in cup mutants, is border cell migration also impaired? Likewise, the citation of the Assa-Kunik, 2007 study as evidence that Notch and JAK-STAT signaling act antagonistically (Line 286) is a bit of an oversimplification. While that study does show that Notch and JAK-STAT act antagonistically at earlier stages of follicle development, Fig. 6 of that paper shows that a Notch reporter and a JAK-STAT reporter are both expressed concomitantly in border cells of a Stage 10 follicle and in the anterior follicle cells of what looks like a Stage ~8 follicle. The authors should discuss the apparent contradiction between their findings and this study.

      Response: We provide genetic evidence to support our claims that Cup in the germline modulates Notch activation in the anterior follicle cells thus limiting border cell fate specification to a few. The overlap in the expression of Notch reporter m7-lacz and STAT in the follicle cells and border cells is interesting and will need further investigation in real time to decipher any comparison between the two studies.

      • Lastly, the manuscript contains many grammatical errors, incomplete sentences, improper punctuation and spacing, and informal writing, such as the use of contractions. It should be thoroughly edited for content and clarity.*

      Response: We have tried to edit the manuscript with the aim to improve on the language, grammar and punctuations.

      Reviewer #3 (Significance (Required)):

      Although the identification of cup as a contributor to the regulation of border cell differentiation is novel, the other main regulators investigated in this study, including Notch and JAK-STAT signaling, have been identified previously. The role of cup in this context seems to be to fine tune Notch signaling and it seems to play a relatively minor role in the process of border cell specification. In addition, the conclusions of this paper are not well-integrated into the existing literature on Notch and JAK-STAT signaling in border cells, and the discussion about the broader implications of this study for the understanding of Notch signaling was not well-developed. However, the careful documentation and quantification of the phenotypes reported in this study adds rigor and allows for firm conclusions. For these reasons, this study may have a lasting but perhaps somewhat incremental impact on the study of border cell migration in the Drosophila ovary.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      Summary:

      In this manuscript, authors establish a glyco-profiling platform for the functional analysis of genes involved in pseudaminic (Pse) and legionaminic (Leg) acid biosynthetic pathways. They used B. subvibroides and C. crescentus specific mutants in pseI and legI genes involved in the Pse and Leg biosynthesis, respectively, and cross-complementation assays with orthologous genes from different bacterial species, analysing motility and flagellin glycosylation. These assays show that Pse and Leg biosynthetic pathways are genetically different and recognize the LegX enzyme as a critical element in the Leg-specific enzymatic biosynthesis. Since that legX orthologous were only identified in the genome of bacteria with Leg biosynthetic pathways, it becomes a good marker to distinguish Leg from Pse biosynthesis pathways and a novel bioinformatic criterion for the assignment and discrimination of these two pathways. Reconstitution of Leg biosynthetic pathway of B. subvibroides in the C. crescentus mutant that lack flagellins, PseI and FlmG, complemented with both flagellin and FlmG of B. subvibroides, identified a new class of FlmG protein glycosyltransferases that modify flagellin with legionaminic acid. Furthermore, the construction of a chimeric FlmG through domain substitutions, allowed to reprogram a Pse-dependent FlmG into a Leg-dependent enzyme and reveal two modular determinants that govern flagellin glycosyltransferase specificity: a glycosyltransferase domain that accepts either Leg or Pse, and a specialized flagellin-binding domain to identify the substrate.

      Major comments:

      The conclusions obtained are convincing and well-supported. However, I think some points should be specify or clarify.

      1.- In the mutants (pseI, legI, flmG,...) the non-glycosylated flagellin are exported and assembled in a flagellum filament shorter than the WT strain. However, motility in plates is absent or very reduced. This might be produced by instability of the flagellum filament when rotating in a semi-solid surface. MET was performed from plates or liquid cultures? Do the author analyses motility in liquid media? If they did, changes in motility were observed?

      Response: The Caulobacter ΔpseI mutant accumulates low levels of flagellin in the supernatant. TEM analysis reveals that the flagellar filament is not assembled and only the hook structure is visible (PMID: 33108275). Brevundimonas subvibrioides ΔlegI or ΔflmG cells feature a shorter filament compared to WT by TEM. In all these analyses, TEM was performed on cells grown in broth to exponential growth phase as detailed in the Experimental procedures section. These mutant cells do not swim when analyzed by phase contrast microscopy. While is not known if swimming on semi-solid medium would further destabilize the flagellar structures seen in liquid cultures by TEM, there is more residual motility in B. subvibrioides mutants that make a short filament compared to C. crescentus mutants that lack the flagellar filament. Thus, our analyses point to a positive correlation between the residual motility and residual filament length when comparing the B. subvibrioides and C. crescentus mutants.

      2.- In page 5, lines 158-163, the analysis, by HPLC, of derivatized nonulosonic acid from B. subvibroides flagella, shows a major peak at 9.8 minutes retention and a minor peak at 15.3 minutes. Since that Pse-standard have retentions peaks at 9.7 and 13 minutes, and Leg-standard at 12.3 minutes, the authors cannot infer, only with these data, the flagella sugar is a legionaminic acid derivative. In my opinion, should be included that inference comes from the data obtained by HPLC analysis and genetic approaches. Thanks. Corrected. 3.- In page 5, line 173-175. Authors indicate, "While no difference in the abundance of flagellin was observed in extracts from mutant versus WT cells, flagellin was barely detectable in the supernatants of mutant cultures, suggesting flagellar filament formation is defective in these mutants". MET images show that the flagellum filament length is shorter in the mutants than in the WT strain. Therefore, if the same number of mutants and WT cells has been used in the immunodetection assays, there should be more flagellin monomers in the WT samples than in the mutants ones and flagellin bands should be less intense in mutant samples corresponding to the anchored flagellum. Why bands corresponding to flagellin in mutants and WT show similar intensity in the immunodetection assays (Figure 3C and D)? Furthermore, in lane 177-178, authors suggest that LegI and FlmG govern flagellin glycosylation and export (or stability after export). However, if filament stability is affected, the amount of flagellin monomers in the supernatant of mutants should be higher than in the WT. However, immunodetection assays show less abundance of flagelin monomers in the supernatant of mutants. Please, can you clarify this? In relation to this point, I suggest that authors include, in the experimental procedures, how they obtained the supernatants to flagellin immunodetection, as well as why they used anti- FljKCc anti-serum to detect the B. subvibroides flagellin.

      We thank the reviewer for raising this point. We have now clarified this question in the updated Experimental procedures section. Our immunoblots harbor the same number of cells harvested in exponential phase (OD=0.4). One mL of cells was harvested from cultures by centrifugation at full speed. The supernatant that was used for the immunodetection corresponds to the supernatant after the centrifugation. The supernatant fraction contains flagella that have been shed during the cell cycle at the swarmer cell to stalked cell (G1-S) transition of C. crescentus and B. subvibrioides.

      Thus, it is clear that the majority of flagellins detected by immunoblotting are in fact cell associated and specifically the intracellular flagellins. The evidence for this is that the levels are comparable between WT and ΔflmG mutant cells, even though the latter has shorter or no flagellar filaments. Moreover, while C. crescentus cells are not constantly flagellated during the cell cycle, flagellins are detectable on cell-associated samples by immunoblotting even when cells do not yet or no longer have a flagellar filament. Based on these two points, we conclude that the total flagellin levels associated with cells do not reflect the levels of flagellin assembled into a flagellar filament, but rather the flagellin bulk present in the cytoplasm.

      Consistent with this view, we previously reported that C. crescentus ΔpseI cells have the same amount of flagellins in cell lysates compared to the WT strain (PMID: 33108275), even though the mutant cells lack a flagellar filament. Thus, the results obtained here are consistent with previous observations and indicate that B. subvibrioides flagellin glycosylation mutants also still produce comparable amounts of flagellins intracellularly like the WT strain, despite the absence of flagellin glycosylation and inefficient assembly into a flagellar filament.

      Concerning the potential role of LegI and FlmG in flagellin stability after export, we were referring to protein stability (half-life), not filament stability. Glycosylation may impact the half-life of extracellular flagellins since glycosylation can protect from proteolytic degradation of proteins, possibly in this case by different proteases that may accumulate in the supernatant. Thus, non-glycosylated flagellins could be more easily degraded by extracellular proteases once they are exported, ultimately resulting in a lower amount in the supernatant.

      Addressing the final question about the specificity of the anti-FljKCc antiserum: we used this anti-serum because it detects the B. subvibrioides flagellins owing to the high sequence similarity between B. subvibrioides flagellins and C. crescentus flagellins. We previously showed that the anti-FljKCc anti-serum detects all six flagellins from C. crescentus, as determined by individually expressing each flagellin in a strain deleted for all six flagellin genes (Δfljx6) (PMID: 33108275). FljKCc (against which the antibody was raised) is 65% similar to the most distant C. crescentus flagellin, FljJ. As the similarity of FljKCc to the three B. subvibrioides flagellins ranges from 74% -67% sequence similarity, they should be even better recognized by the anti- FljKCc antibody than C. crescentus FljJ. However, on immunoblots we cannot attribute the signal to any individual B. subvibrioides flagellin as they could all co-migrate on SDS-PAGE and therefore all flagellins might reside in the same immunoblot band. However, we can clearly demonstrate that the immunoblot band corresponds to flagellins: a B. subvibrioides ΔflaF mutant (see below) that we constructed revealed that the flagellin signal is lost, as is the case for a C. crescentus ΔflaF mutant (PMID: 33113346). In the case of C. crescentus, the FlaF secretion chaperone is required for flagellin translation (synthesis) and we suspect that this also the case for B. subvibrioides FlaF. This experiment provides additional evidence that the B. subvibrioides flagellins are recognized by the anti-FljK (C. crescentus) anti-serum.

      4.- Authors demonstrate the specificity of the GT-B domain of FlmG, using a chimeric FlmGCc-Bs in a mutant of C. crescentus that lacks FlmG and harbour the Leg biosynthetic pathway of B. subvibroides. However, since that TPR comes from C. crescentus, this chimeric protein, could be transfer the legionaminic acid to the flagellin of B. subvibroides? Furthermore, the complementation of this mutant with the FlmGBs did not support efficient flagellin modification and this might be related to the TPRCc domain. Therefore, in my opinion, the chimeric protein should be introduced in the B. subvibroides∆flmG background. The answer to the first question is “No” or “very inefficiently” as determined from immunoblot analyses of B. subvibrioides ΔflmG cells expressing the chimeric FlmG_Cc-Bs protein that we now show in Fig S2B.

      Expression of the different FlmG (FlmG_Cc, FlmG_Bs, FlmG_Cc-Bs) in C. crescentus cells producing Pse or Leg revealed that FlmG_Bs does not support efficient flagellin modification with Pse in C. crescentus, likely because FlmG_Bs interacts poorly with the C. crescentus flagellins. By using the FlmG_Cc-Bs chimera we hoped to overcome this interaction problem with the C. crescentus flagellins (because the FlmG chimera harbors the C. crescentus TPR to bind the C. crescentus flagellins), however glycosyltransfer still does not occur efficiently because the GT domain from FlmG_Bs does not function with Pse. However, FlmG_Cc-Bs can modify the C. crescentus flagellins once C. crescentus is genetically modified to produce CMP-Leg (instead of CMP-Pse). This confirms that the FlmG TPR from C. crescentus is important for flagellin modification through the FlmG/flagellin interaction and that GT_B type glycosyltransferase only transfers Leg. In addition, we have now added as Fig S2B an immunoblot and as Fig S2C a motility test of B. subvibrioides ΔflmG cells expressing the FlmG_Cc-Bs chimeric protein in which we only observed little modification of B. subvibrioides flagellins and a poor motility, respectively. We extended our discussion of these results.

      5.- Page 8, line 299-301. Authors point out that C. crescentus that lacks FlmG and harbour the Leg biosynthetic pathway of B. subvibroides and the chimeric FlmGCc-Bs, although it has a glycosylated flagellin, whose mobility in SDS-PAGE is like the WT strain, is non-motile. They suggest that additional factors exist in the flagellation pathway that exhibit specificity towards the glycosyl group that is joined to flagellins. However, would be interesting to see if the flagellum filament has similar length to the WT strain or at least, it has increased in relation to the flagella length of the mutant. If flagella length has not increased, it could suggest that changes in the glycan type might affects the flagellin assembly or the stability of the flagellum filament. Therefore, would be also important to analyse its motility in liquid media.

      To investigate why the C. crescentus cells that produce Leg and express the chimeric FlmGCc-Bs glycosyltransferase are non-motile (Figure S5B) despite flagellin modification (by immunoblotting, Figure 7C), we employed two strategies. First, we performed immunoblot analyses on the supernatant fraction from these cells to determine if flagellins accumulate extracellularly. As now showed in Figure S5A, only low amounts of C. crescentus flagellins modified by Leg are present in the SN fraction. Second, we conducted TEM analyses of cells grown to exponential growth phase in broth. As shown in Figure S5C, the C. crescentus cells producing Leg and expressing FlmG_Cc-Bs glycosyltransferase harbor a shorter flagellum compared to those expressing the FlmG_Cc in which C. crescentus flagellins are modified by Pse. Altogether these results explain why these cells are non-motile both on soft agar plate and in liquid.

      Minor comments: 1.- Pag 3 line102. Please change ".....two predicted synthases, a PseI and LegI homolog, and C. crescentus only encodes only PseI...." to ".....two predicted synthases, a PseI and LegI homolog, and C. crescentus only encodes a PseI...." 2.- Figure 2 A. Plasmid nomenclature (Plac-neuB) is confusing because C.c. ΔpseI cells express predicted LegI or PseI synthases. Please change to Plac, as in Figure 2B and 4. Figure 2A and 2B do not contain any complementation with Bacillus subtilis (Basu), however two complementation are labelled as Bs in Figure 2A and 2B. Furthermore, no Bs are present in the Figure 2 legend. 3.- Legend of figure 3 should include B. subvibrioides abreviation Bs. Line 774: Please change ".......glycosylation and secretion in B. subvibrioides." to ".......glycosylation and secretion in B. subvibrioides (Bs)." 4.- Figure 3. In order to keep a similar nomenclature in all plasmids, plasmid Plac-legI syn and Plac-flmG should be labelled as Plac-legIBs syn and Plac-flmGBs.

      5.- Legend of figure 4 should include B. subvibrioides abreviation Bs. Line 791: Please change "....... complementation of the B.subvibrioides ΔlegI mutant with ...." to "....... complementation of the B.subvibrioides (Bs)ΔlegI mutant with ...." Furthermore, Legend of figure 4 indicate in line 795, that immunoblots reveal the intracellular levels of flagellin, however figure 2 and 3 show immunoblot of cell extracts. Please, correct this sentence. 6.- Legend of figure 5, 6 and 7 should include B. subvibrioides abreviation Bs. Line 808: Please change "Predicted Leg biosynthetic pathway in B. subvibrioides " to"Predicted Leg biosynthetic pathway in B. subvibrioides (Bs)" Line 834: Please change "....affects motility, flagellin glycosylation and secretion in B. subvibrioides."to "....affects motility, flagellin glycosylation and secretion in B. subvibrioides (Bs).Line 852: Please change "...acetyltransferase in flagellar motility of B. subvibrioides cells." to ""...acetyltransferase in flagellar motility of B. subvibrioides (Bs) cells." Furthermore, figure 5 should include C. crescentus abbreviation. Line 815: Please change "....whole cell lysates from C. crescentus mutant cultures......." to "....whole cell lysates from C. crescentus (Cc) mutant cultures......." 7.- In my opinion it would be useful to include a scheme of the gene organization involved in Leg biosynthesis in B. subvibrioides.

      8.- Legend of figure S1 should include B. subvibrioides (Bs) and C. crescentus (Cc) abbreviations. Line 888-867: Please change "...C. crescentus ΔpseI cells and B. subvibrioides ΔlegI cells with plasmids expressing..." to "...C. crescentus (Cc) ΔpseI cells and B. subvibrioides (Bs) ΔlegI cells with plasmids expressing..." Furthermore, the name and abbreviations (Mm, So, Ku, Pi, Dv) of the species used should be included in the legend. Why the authors used a plasmid with a Pvan promoter in these assays? Why the authors changed the code color of pseI and legI orthologous genes? It would be more useful and understandable follow the code color used in figure 2 and 4.

      Page 6 line 200, Please change ".....complementing synthases exhibit greater overall sequence similarity to LegI than Pse of C. jejuni. 22268,....." to ".....complementing synthases exhibit greater overall sequence similarity to LegI than PseI of C. jejuni. 22268,....." 10.- Page 7 line 231, Please change ".....negative bacteria A. baumannii LAC-4 (GCA_000786735.1)[38] and P. sp. Irchel 3E13..." to ".....negative bacteria A. baumannii LAC-4 (GCA_000786735.1)[38] and Pseudomonas sp. Irchel 3E13..." 11.- Introduce a line break between line 503 and 504. 12.- Page 14 line 543, please change "XbaI" to "XbaI" Thanks for the careful editing. We changed the text as suggested by the reviewer. We also added a scheme showing the genetic organization of the genes involved in Leg production and present as Figure 1B. When this study was initiated, the pMT335 plasmid with a Pvan promoter was used before we switched to using the pSRK plasmid with Plac promoter for better induction. Note that the results with Pvan or Plac are comparable regarding the PseI synthases interchangeability. Color code is now homogenous through the manuscript.

      Reviewer #1 (Significance (Required)):

      This is an interesting manuscript that contributes to the knowledge of the legionaminic biosynthetic pathway and establish a glyco-profiling platform for the functional analysis of genes involved in pseudaminic (Pse) and legionaminic (Leg) acid biosynthetic pathways. The analysis of Leg patway allowed to identify a gene (legX) that can be used to distinguish Leg from Pse biosynthesis pathways, becoming a bioinformatic tool for the assignment and discrimination of these two pathways. Furthermore, a new class of FlmG protein glycosyltransferases, able to transfer Leg to the flagellin, has been identified and its analysis reveal two modular determinants that govern flagellin glycosyltransferase specificity: a glycosyltransferase domain that accepts either Leg or Pse, and a specialized flagellin-binding domain to identify the substrate.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      Summary: Viollier and co-workers present a study in which they preform an elegant and rigorous genetic profiling of the the legionaminic and pseudaminic acid biosynthesis and flagellar glycosylation pathways in C. crescentus (native Pse) and B. subvibrioides (native Leg). They use motility as a representative readout for functional flagellar glycosylation with these microbial sialic acids. They discover orthologous Pse synthase genes can replace the function of the native synthase in C. crescentus and orthologous legionaminic acid synthase genes can achieve the same in B. subvibrioides. However, not vice versa indicating a strong preference for each microbial sialic acid stereoisomer in these species. For the Leg biosynthesis pathway, which requires GDP-GlcNAc, the authors also identify LegX as an essential component to synthesize this sugar nucleotide and thus a marker for Leg biosynthesis pathways. Upstream in theses pathways, they also identify a new class of FlmG flagellar protein glycosyltransferases. Importantly, through heterologous reconstitution experiments to uncovered that these glycosyltransferases possess two distinct domains, a transferase domain the determines specificity for either CMP-Leg or CMP-Pse, and a flagellin-binding domain to achieve selectivity for the substrate. Interestingly, by creating chimeric FlmG for these two domains between C. crescentus and B. subvibrioides they show that these two modular parts can be interchanged to adapt flagellin glycosyltransferase specificity in these species. Major comments: The key conclusions of the manuscript by Viollier and co-workers are convincing and well supported by their experiments and used methods, with respect to the insulation of the Leg and Pse biosynthetic pathways, they key role of LegX in launching the Leg pathway and the successful reconstitution of Leg glycosylation in a previously Pse-producing C. crescentus strain. Finally, they convincingly show that a chimeric version of the involved glycosyltransferases is functional, which besides intriguing future glycoengineering possibilities also emphasizes the two discrete domains in these transferases that dictate their sugar nucleotide and acceptor specificity. There is one additional experiment I would suggest with relation to the detection and confirmation of Pse and Leg on flagella of respectively, C. crescentus and B. subvibrioides. In the case of C. crescentus the detected DMB derivatized monosaccharide co-elutes with a validated standard of tri-acetylated Pse, which is convincing evidence of its identity. However, for B. subvibrioides. Their DMB derivatized monosaccharides from its flagella, results in a peak the does not co-elute with the only Leg standard (Leg5Ac7Ac) they have, it does elute at the same time as their Pse standard. Although it cannot of course be Pse as B. subvibrioides. Does not possess a Pse biosynthesis pathway, it also does not provide enough evidence to conclude that it is a Leg derivative. An MS(-MS) measurement of the eluted signal would not be a big investment in time and resources and would provide additional evidence to at least assign this peak to microbial sialic acid related to the present Leg biosynthesis pathway. It the identified mass would lead to identification of the derivative, it would also add to the proper characterization of the flagella glycosylation in the bacterium.

      We have now added the glycopeptide analyses as requested. They are described in the last experimental section and confirm our results.

      The data and the methods presented in this study are presented with sufficient detail so that they can be reproduced? However, I would suggest as is common nowadays in most journals that the authors include images of the raw unprocessed blot in de supporting info.

      *The motility pictures are representative of three independent experiments and the immunoblots are representative of at least two independent experiments. This has now been mentioned in the Experimental procedures. The raw unprocessed blots have now been added as supporting info. *

      Minor comments: There are a few textual errors that the authors should fix: -page 2, line 70: change "used" to "use" -page 11, line 407: add the word "are" after Pse On page 2, line 36, the authors state that "most eubacteria and the archaea typically decorate their cell surface structures with (5-, 7-)diacetamido derivatives, either pseudaminic acid (Pse) and/or its stereoisomer legionaminic acid (Leg,". This should be nuanced as to my knowledge it is not most eubacteria, but more a subset as identified by Varki in his seminal PNAS paper. The authors clearly present their data and conclusions in the figures of this manuscript. However, I would recommend the take a critical look at the drawing of their monosaccharide chair conformations and the positioning of the axial and equatorial groups on these chairs in Figure 1 and 5, as these are in most cases drawn a bit crooked, which can easily be corrected. We corrected the text as the reviewer suggested. We changed the sentence in the introduction to be more nuanced. The drawing of the monosaccharide has been improved.

      Reviewer #2 (Significance (Required)):

      The family of carbohydrates called sialic acids was long thought to exclusively occur in glycoproteins and glycolipids of vertebrates, but has since also been found in specific microbes. Especially symbiotic and pathogenic microbes associated with the humans express a wide array of unique microbial sialic acids for which their functional roles are not well understood and the associated glycosylhydrolase and glycosyltransferase have in most cases not been identified yet. The authors present an impressive insight into flagellar glycosylation with Pseudaminic and Legionaminic acid in two bacterial species, using genomic analysis, rewiring, immunoblots and motility assays as their main tools. They provide compelling evidence on the insulation of the Pse of Leg pathway in these species, the flexibility in exchanging between biosynthetic enzymes from the same pathway between various species. Crucially, most glycosyltransferases that add the Pse or Leg glycoform onto various acceptor sites in bacteria, have up to this point remained elusive in most cases. It is therefore very valuable information that the authors here provide on the involved glycosyltransferases. Especially, on the two domains that govern their sugar nucleotide and acceptor specificity, and that these can be reengineered as chimeric glycosyltransferases. To me as a chemical glycobiologist this provides compelling possibilities for glycoengineering possibilities in future studies in the field to elucidate the functional roles of Pse and Leg glycosylation.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      Summary of the findings and key conclusions (including methodology and model system(s) where appropriate): Kint et al describe a neat study of bacterial flagellin glycosylation by a recently identified class of protein glycosyltransferases called FlmG. The experiments are well designed, the data presented is convincing and the conclusions drawn are mostly in line with the experimental evidence presented. These are the key findings. Kint et al show that genetic tools and motility can be used as a readout to probe the sugar biosynthesis pathway in bacteria. Using the recently characterized system of Caulobacter crescentus, they have performed a survey of different PseI/LegI/NeuB genes from various bacteria, checking whether they could rescue the motility defect in C. crescentus ΔpseI cells. They found that those genes that did confer motility also had higher sequence similarity to C. jejuni PseI than to C. jejuni LegI or C. jejuni NeuB. They also found that these genes also restored flagellin glycosylation as checked by mobility shift on gel electrophoresis with immunoblotting to anti-FljK antibody. This survey brought up an interesting finding that the PseI/LegI/NeuB orthologs of the closely related Brevundimonas species were unable to confer motility to C. crescentus ΔpseI cells, and were more similar to C. jejuni LegI than to C. jejuni PseI or C. jejuni NeuB. They also performed similar glycoprofiling experiments using B. subvibrioides ΔlegIBs cells and various PseI/LegI/NeuB orthologs from different bacteria, which indicated the restoration of motility by putative LegI synthases. Kint et al demonstrate flagellin glycosylation in B. subvibrioides by performing in-frame deletions of FlmG, and LegI genes in B. subvibrioides and checking for motility, presence of flagella, and flagellin glycosylation by motility shift on gel electrophoresis. Further, they confirm the critical nature of GDP-GlcNAc for Leg biosynthesis by assessing flagellin glycosylation and motility in B. subvibrioides with an in-frame deletion in PtmE/LegX and by performing heterologous complementation with an M. humiferra PtmE ortholog. They also reconstitute the legionaminic acid biosynthesis pathway in C. crescentus cells that lack flagellins, PseI and FlmG, and show that the heterologously expressed B. subvibrioides flagellin is glycosylated by heterologously expressed B. subvibrioides FlmG. Finally, they also show that whereas the CcFlmG cannot substitute for BsFlmG and vice versa, a chimeric FlmG bearing the TPR domain from C. crescentus FlmG (that recognizes C. crescentus FljK) and the GT domain from B. subvibrioides FlmG (that transfers CMP-Leg) modifies CcFljK in C. crescentus cells that lack CcFlmG but express both Pse (endogenously) and Leg (from the reconstituted pathway). This demonstrates the modularity of the FlmG glycosyltransferases. Kint et al provide the chemical nature of C. crescentus flagellin glycosylation. Kint et al have analyzed the glycans released from the flagellin by acid hydrolysis and clearly shown the nature of the glycan in C. crescentus flagellin to be Pse4Ac5Ac7Ac by use of Pse standards. The glycan from B. subvibrioides was distinct from the Leg standard used, and could be a Leg derivative distinct from Leg5Ac7Ac.

      Major comments: 1. Table 1 and Text in Results, lines 116-119, "In support of the notion that derivatization occurs after the PEP-dependent condensation reaction to form Pse or Leg, our glyco profiling analysis revealed that putative PseI proteins (identified by sequence comparisons to C. jejuni 11168, Table S1) conferred motility to C. crescentus ΔpseI cells, whereas putative LegI synthases did not." Not clear how putative PseI and LegI synthases were identified. Table 1 only lists overall percent sequence identity and similarity to Cj PseI, LegI and NeuB, and percent identities and similarities of the various nonulosonic synthases to these proteins are in the similar range, as expected. In the absence of sequence alignments indicating the presence of conserved residues, particularly related to the substrate binding region, that are distinct in these paralogs, calling out the type of synthase based on the highest percent identity (to Cj PseI, LegI or NeuB) is speculative. Also, Shewanella oneidensis does not follow the pattern of highest similarity to NeuB3. Second, in the absence of data showing that the Leg and Pse found in these different organisms actually are different derivatives, this does not support that "derivatization occurs after the PEP-dependent condensation reaction to form Pse or Leg". Putative PseI and LegI were proposed based on BlastP analyses in which the protein sequences of interest were aligned to the three experimentally validated synthases from C. jejuni 11168: PseI, LegI, NeuB as well as PseI from C. crescentus, as indicated in Table S1. While, the assignment of the donor sugar is based only on the sequence identity and similarity to LegI or PseI, this assignment corresponds well according to the restoration of the motility of the C. crescentus ΔpseI mutant upon expression of PseI ortholog and B. subvibrioides ΔlegI mutant with heterologous LegI expression.

      It is true that for Shewanella oneidensis the assignment as PseI or LegI is ambiguous, exhibiting nearly identical similarity, but it is quite distinct from NeuB. This actually makes the S. oneidensis synthase a very interesting case to explore the enzymology of its Pse/LegI ortholog, knowing that it has been previously shown that this bacterium glycosylates its flagellins with Pse derivatives (PMID: 24039942). The results from our genetic complementation analysis are however very clear (PseI ortholog) and very consistent with the functional analysis in S. oneidensis.

      Concerning the different derivatives of Pse or Leg: McDonald and Boyd (PMID 32950378) recently published a review giving some examples of Bacteria/Archaea experimentally shown to contain Pse/Leg-derivatives: C. jejuni 11168 modifies its flagellin with 5,7-N-acetyl Pse, Sinorhizobium fredii NGR234 (not used in this study but in our previous work PMID 33113346 and showed to restore the motility of C. crescentus ΔpseI cells) modifies its capsule with 5-acetamido-7-3-hydroxybutyramido-Pse), Treponema denticola modifies its flagellin with 7-(2-metoxy-4,5,6-trihydroxy-hexanoyl-Pse, A. baumannii LAC-4 produces 5,7-N-acetyl-8-epi-Leg to decorate the capsule, Halorubrum sp. PV6 modifies the LPS with N-formylated Leg and L. pneumophila produces 5-acetamidino-Leg.

      The reviewer is right in that we do not know the exact version of Pse or Leg produced in C. crescentus and B. subvibrioides, HOWEVER, the fact that complementation works with the majority of the orthologs of PseI and LegI including many from bacteria that are known to produce modified Pse derivatives for example in Shewanella oneidensis and Treponema denticola, the most likely explanation is that derivatization occurs after the PseI or LegI step, but we concede that the results are also compatible with a promiscuous enzyme that can accept different Pse derivatives or different Leg derivatives.

      1. Related to (1), Text in Results, lines 130-131, "We conclude from our survey that (heterologous) PseI synthase activity generally confers motility to C. crescentus ΔpseI cells, whereas LegI-type (or NeuB-type) synthases are unable to do so." There is no a priori evidence provided indicating that these were PseI or LegI type synthases. So the conclusion really is that assuming only PseI type synthases would be able to rescue the motility defect in C. crescentus ΔpseI cells, this glyco-profiling motility assay now provides the first biochemical evidence telling us which synthases are Pse-type, and which are Neu/Leg-type. And in my view, this is the conclusion of greater significance in the field - to be able to now identify which is a PseI and which a LegI based on these complementation assays. However, if the authors still wish to retain their original conclusion, they could cite or provide evidence (either biochemical evidence in this work or reported literature regarding the sugar synthesized or bioninformatics analysis regarding the presence of distinct genes such as the Ptm genes for legionaminic acid biosynthesis pathway or genes that differ in their enzyme activities and overall fold such as PseB/LegB or PseG/LegG in the gene neighborhood) indicating or suggesting the PseI/LegI/NeuB nature of the different synthases. Also, methods for the bioinformatics analysis (eg. BLASTp settings used, dates of searches, whether regular BLAST or PSI-BLAST was used, etc.) are missing in the manuscript, and need to be included. We agree that for many PseI or LegI tested, there is no provided biochemical evidence. HOWEVER, this is not the case for some of them including the PseI, LegI and NeuB from Campylobacter jejuni (PMID 19282391), some A. baumannii strains (α-epi-legionaminic acid for A. baumannii LAC-4 PMID 24690675), Shewanella oneidensis (Pseudaminic acid with methylation PMID 23543712), Legionella pneumophila (Legionaminic acid PMID 18275154) or Halorubrum sp. PV6 (N-formylated legionaminic acid PMID 30245679). Thus, we maintain the two conclusions: the PseI and LegI synthases are generally interchangeable and the complementation assays can enable to identify and assign PseI and LegI function. BLAST2P was used to compare the protein sequences of the tested NeuB-like synthases with NeuB1, LegI (NeuB2) and PseI (NeuB3) from Campylobacter jejuni but also with PseI from C. crescentus. BLOSUM62 matrix was used as well as a word of size 3 for the comparison. We have now added this procedure in the legend of the Table S1.
      2. It is interesting that there is still a signification amount of flagellin secretion/assembly in the B. subvibrioides LegI and FlmG mutants. It will be good to see a discussion about whether this is likely from due to low level of function despite the in-frame deletion of genes; how many flagellin subunits are likely to have managed secretion and assembly in these short flagella; whether there is any redundancy of LegI / FlmG (perhaps with lower levels of expression); considering Parker and Shaw's findings of glycosylation being required for flagellin binding to the chaperone and subsequence secretion in A. caviae whether there is a FlaJ homolog in B. subvibrioides. Also, can the authors rule out the possibility that absence of glycosylation does not affect flagellin assembly but makes the flagellum prone to shear/breaks in B. subvibriodes, resulting in smaller flagella? How many flagellins are there in B. subvibrioides? Is it possible that one is glycosylated but another/others are not, and that is the reason for the small flagellum in these mutants? The number of flagellin subunits that are assembled into a full-length flagellar filament is unknown in C. crescentus and in B. subvibrioides. There are 3 different flagellin genes that are now presented schematically in Figure 1C. No redundancy has been found for LegI or FlmG. It is possible that the B. subvibrioides is better in exporting non-glycosylated flagellin or that the capping proteins can function better with sugar modification or that the filament of B. subvibrioides mutants is less fragile when it is non-glycosylated or that its flagellins “stick” better. It is also possible that short filaments are not actually containing flagellins mounted on the hook but another protein that polymerizes aberrantly in the absence of Leg or FlmG. This remains to be investigated and compared to the situation of Pse and FlmG mutants of C. crescentus.

      B. subvibrioides possesses an ortholog of the C. crescentus flagellin secretion chaperon FlaF (PMID 33113346). As observed in C. crescentus, FlaF likely has a role in flagellin translation as its inactivation totally prevents flagellins production (see answer to reviewer #1). For C. crescentus, bacterial two hybrid experiments revealed that FlaF can interact with non-glycosylated flagellins in E. coli. Thus, it is strongly possible that FlaF/flagellins interaction is not dependent on the flagellins glycosylation state. In addition, the short flagellum filament observed in B. subvibrioides ΔlegI or ΔflmG mutants argues that at least some flagellins are secreted while not glycosylated.TEM pictures have been performed in liquid medium from exponential growth phase. In this condition, no fragment of flagella was observed in the culture medium by TEM but only small flagella with a hook structure attached. Also, flagella breaks might result in more random length of flagellum.

      Three flagellins are in B. subvibrioides (Bresu_2403 is 59% identical with FljLCc, Bresu_2638 is 57% identical with FljKCc and Bresu_2636 is 62% identical with FljJCc). We now show this genetic organization of the flagellins in Fig. 1C. The three flagellins are all detected by the anti-FljKCc anti serum (see answer and figure to reviewer #1). We cannot attribute the immunoblot signal to any individual B. subvibrioides flagellin as they could all co-migrate on SDS-PAGE. However, the signal often looks like a doublet (as shown in Figure 4B for example) suggesting that at least two flagellins are detected and this doublet is always found to migrate faster in absence of glycosylation that could indicate that all B. subvibrioides flagellins (or at least 2) are modified.

      Text in Results, lines 170-171, "We then probed the resulting ΔlegIBs and ΔflmGBs single mutants for motility defects in soft agar and analyzed flagellin glycosylation by immunoblotting using antibodies to FljKCc". Was the antibody to FljKCc determined to also specifically bind to FljKBs? Also, how many flagellins are there in B. subvibrioides? Are all detected with this antibody? Antibodies raised to FljKCc were raised against His6-FljK produced in E. coli (previously published in Ardissone et al, 2020). This serum recognizes the 6 flagellins from C. crescentus (PMID: 33108275). It recognized the three flagellins from B.s. (see answer to reviewer #1).

      It is interesting that C. cresentus cells expressing Pse (endogenously) and Leg (reconstituted pathway), and BsFlmG and BsFljK (corresponding to Figure 5C) are not motile. Was the motility assay done for the experiment of figure 5B as well? Are the C. crescentus cells lacking Pse and FlmG but with heterologous expression of Leg and BsFljK and BsFlmG also non-motile? Also, it will be good to see the TEM images for these cells.

      C. crescentus cells that produce Pse (endogenously) or Leg (reconstituted pathway) and BsFlmG and BsFljK (formerly Figure 5C and now as Figure 7C) are indeed not motile as shown by the motility tests presented in Figure S5B. Motility assays with cells used in the former Figure 5B (now Figure 7B) have also been done and are now presented Figure S4B. These cells are non-motile because BsFljK is not efficiently secreted (or unstable after secretion) as shown on the immunoblot of the supernatant fraction in Figure S4A lower panel. As a result, flagellar filament is not properly assembled as only a short flagellum was observed by TEM in such cells compared to the WT C. crescentus (Figure S4C and S4D).

      Immunoblotting of the supernatants should be shown (in addition to the cell extracts) for Figures 5B and 5C so that the reader can appreciate whether glycosylation has taken place but secretion/assembly has not. Further, HPLC of the acid extracts from flagellin could be done to unambiguously show whether the CcFlmG has transferred Pse and the BsFlmG and Cc-BsFlmG have transferred Leg on to the CcFljK in Figure 5c, and the identity of the sugar, if any, transferred by CcFlmG in the absence of Pse, and BsLeg genes or BsLegX gene in figure 5B.

      *__ Immunoblots of the supernatants for Figure 5B (now Figure 7B) have been done and been added (Figure S4A lower panel). BsFljK is barely detected in the supernatant whatever its glycosylation state (with or without Leg). Note that in the supporting info where the raw unprocessed blot used for this panel is shown, a positive control of blotting (C. crescentus Δfljx6 mutant expressing CcFljK from pMT463) has been used. Immunoblots of the supernatant from Figure 5C (now 7C) have been done and been added in figure S5A. The CcFljK modified with Leg is poorly secreted (or unstable after secretion). As a result, these cells only harbor a short flagellum compared to those that are able to modify CcFljK with Pse (Figure S5C).

      HPLC of the acid extracts from flagellins have been performed on purified flagella obtained by ultracentrifugation. As C. crescentus cells expressing BsFlmG and Cc-BsFlmG harbor no or short flagellar filament, the purification by ultracentrifugation is limited. Thus, to further confirm that CcFlmG has transferred Pse and Cc-BsFlmG (and BsFlmG) has transferred Leg on CcFljK (former Figure 5C and now Figure 7C), we performed immunoblots on the cell extracts of C. crescentus ΔflmG ΔpseI cells that cannot produce Pse but able to produce Leg (reconstituted pathway). These experiments, now presented in Figure 7C (lower panel) confirmed that no modification of CcFljK was observed in C. crescentus cells expressing CcFlmG whereas CcFljK is modified in C. crescentus expressing Cc-BsFlmG, confirming that Cc-BsFlmG has transferred Leg (the only NulO produced in this condition).__*

      Text in discussion, lines 334-338, "By extension, having recognized the LegX/PtmE enzyme as a critical element in the Leg-specific enzymatic biosynthesis step (Figure 6) likewise offers another functional, but also a novel bioinformatic, criterion for the correct assignment and discrimination of predicted stereoisomer biosynthesis routes residing in ever-expanding genome databases" It will be nice to see a discussion on the prevalence of PtmE versus GlmU (or equivalent gene), PtmF, PtmA, PgmL in the Leg synthesizing organisms. Is the PtmE but not the other genes found in all cases, which makes it better as a molecular determinant for bioinformatics predictions of the type of pathway? Also, on whether PtmE has any homology to genes in other pathways (not associated with flagellin glycosylation) and how reliable a marker it is to differentiate Leg biosynthesis from Neu5Ac biosynthesis pathways.

      GlmU is a potential bifunctional UDP-N-acetylglucosamine diphosphorylase/glucosamine-1-phosphate N-acetyltransferase that can be part of both Pse and Leg pathway (PMID 19282391). Accordingly, a GlmU ortholog is found in C. crescentus and B. subvibrioides that we showed are producing Pse and Leg, respectively. Thus, GlmU cannot be attributed to a Leg pathway signature. On the other hand, PtmE is barely found in the organisms from which PseI orthologs restore the motility of C. crescentus ΔpseI cells.

      PtmF, PtmA, PgmL and GlmS are proposed to act upstream of the production of GlcN-1-P that is a precursor of both UDP-GlcNAc and GDP-GlcNAc, the precursors of Pse and Leg respectively. In addition, orthologs of these genes are not prevalent in the Leg synthetizing organisms present in Table S2 using BlastP analyses with C. jejuni proteins as templates.PtmE ortholog is found in most of the Leg synthetizing organisms as shown in Table S2 and often genetically linked with other genes coding for proteins involved in Leg production (shown with the asterisk * in table S2). Of note, PtmE is found not only in organisms that modify flagellin(s) with Leg but also in organisms that add Leg on capsule such as A. baumannii LAC-4.

      It is not clear from the methods or the figure legends how many times the immunoblotting, motility experiments were done; how many experiments/trials are the images representative of? The motility pictures are representative of three independent experiments. The immunoblots are representative of at least two independent experiments. This information is now added in the Experimental procedures section.

      Minor comments:

      1. The gene for GlcN-1-P guanylyltransferase in the Leg-specific enzymatic biosynthesis step is already known as PtmE from the work of Schoenhofen's group. For the sake of consistency, it would be better to retain the nomenclature as PtmE throughout the manuscript instead of introducing the name LegX, which makes it sound like it is a previously unknown gene.

      2. Text in abstract, lines 15-17: "Sialic acids commonly serve as glycosyl donors, particularly pseudaminic (Pse) or legionaminic acid (Leg) that prominently decorate eubacterial and archaeal surface layers or appendages" The glycosyl donor is the nucleotide sugar and not the nonulosonic acid or sialic acid... rephrasing required for accuracy. Done

      3. Text in abstract, lines 18: "a new class of FlmG protein glycosyltransferases that modify flagellin" The authors are presumably referring to FlmG as the new class of protein glycosyltransferases... rephrasing required for accuracy Corrected
      4. Text in introduction, lines 41-42 "Pse and Leg derivatives synthesized in vitro can be added exogenously in metabolic labeling experiments" It should be "derivatives of Pse and Leg precursors" and not "Pse and Leg derivatives" corrected
      5. Text in introduction, line 46 "Pse- or Leg-decorated flagella may also be immunogenic." This sentence is not referenced and it is not clear why it is written here.

      6. Text in introduction, lines 63-66 "The synthesis of CMP-Pse or CMP-Leg proceeds enzymatically by series of steps [20-22], ultimately ending with the condensation of an activated 6-carbon monosaccharide (typically N-acetyl glucosamine, GlcNAc) with 3-carbon pyruvate (such as phosphoenolpyruvate, PEP) by Pse or Leg synthase paralogs, PseI or LegI, respectively" The synthesis begins with activated GlcNAc. The substrate for condensation is not activated GlcNAc. It is 2,4-diacetamido-2,4,6-trideoxy-D-mannopyranose in case of LegI and 2,4-diacetamido-2,4,6-trideoxy-b-L-altropyranose in case of PseI. Indeed, we modified the sentence.

      7. Text in introduction, line 70 "for used as glycosyl donors" Typographical error, "for use as glycosyl donors" Corrected
      8. Text in Results, line 102, "C. crescentus only encodes only PseI" Do the authors mean "only one PseI"? Corrected
      9. Text in Results, lines 108 and 109, "Such modifications could occur before the PseI synthase acts or afterwards. In the latter case, most (if not all) synthases would be predicted to produce the same Pse molecule," Do the authors know of any reports of modifications occurring before the PseI synthase? Please cite references, if known. Why "most (if not all)"? If the former case is true, the PseI synthase might not be able to accept the substrate. Correct. Because we cannot test all enzymes we must keep the statement non-committing.

      “Most (if not all)” refers to the latter case i.e. the modification occurs after PseI synthase. In this context, PseI should do the same reaction, however, there might be some exceptions.

      There is, to our knowledge, no reports showing that modifications occur before the PseI synthase. The glyco-profiling experiments all suggest that modification occurs after Pse production based on our motility readout. It is possible that PseI enzymes that condense a modified precursor would not be functional in our motility assay.

      Text in Results, lines 141-143, "our bioinformatic searches using C. jejuni 11168 as reference genome identified all six putative enzymes in the B. subvibrioides ATCC15264 genome (CP002102.1) predicted to execute the synthesis of Leg from GDP-GlcNAc" Not clear how this was done. Do the authors mean that they used the genes from C. jejuni 11168 as the query genes to identify homologs in B. subvibrioides ATCC15264 genome (CP002102.1)? Or did they use putative genes from B. subvibrioides ATCC15264 genome (CP002102.1) and pull out homologs from C. jejuni 11168 by using C. jejuni 11168 as the reference genome? We now have modified the sentence to make it clearer.

      At first reading, the flow of the manuscript is difficult to follow due to the figures not appearing in full in order of their occurrence. For instance, Figures 5B and 5C are discussed only in the end of the manuscript after the results of Figures 6 and 7. Other instances also exist. The authors may consider re-ordering the figure parts if possible so that all parts of each figure appear in order of occurrence in the manuscript text. Thanks for raising this issue. We have now tried to address this concern by re-organizing the order of occurrence of the figures. Notably we have now exchanged Figure 5 (on Leg pathway reconstitution and FlmG rewiring) with Figure 7 (on LegB and LegH). We modified the text accordingly. We hope that it makes the manuscript and corresponding figures easier to follow.

      Reviewer #3 (Significance (Required)):

      The nonulosonic acids, Pseudaminic acid and Legionaminic acid, are abundant in bacterial systems in the capsular and lipopolysaccharides as well as in glycoprotein glycans. The Ser/Thr-O-nonulosonic acid glycosylation of flagellins has been studied with respect to the system of Maf glycosyltransferases in Campylobacter jejuni, C. coli, Helicobacter pylori, Aeromonas caviae, Magnetospirillum magneticum, Clostridium botulinum and Geobacillus kaustophilus, and recently with respect to the system of FlmG glycosyltransferases by Viollier's group in Caulobacter crescentus. However, the determinants that govern the glycosyltransferase function are not still well known. Kint et al have performed excellent work using bacterial genetics tools to (1) highlight the "functional insulation" of the Leg and Pse biosynthesis pathways, (2) demonstrate the modularity of the FlmG glycosyltransferase proteins with respect to the flagellin binding and glycosyltransferase domains. This work makes a significant advance in the field with respect to (1) understanding flagellin glycosylation by FlmG, (2) making designer protein Ser/Thr-O-glycosyltransferases, and (3) bioinformatics analysis of genomes with respect to the Pse/Leg/Neu nonulosonic acid biosynthetic potential encoded. The findings will be of great interest to scientific audiences working in the areas of glycobiology and bacteriology. My area of expertise: Maf flagellin glycosyltransferases

    1. Author Response

      Reviewer #1 (Public Review):

      The software presented in this paper is well documented and represents a significant achievement that breaks new ground in terms of what is possible to render and explore in the web browser. This tool is essential for the exploration of SC2 data, but equally useful for the tree of life and other tree-like data sets.

      Thank you for reviewing my work and for this generous assessment.

      Reviewer #2 (Public Review):

      This manuscript describes a web-based tool (Taxonium) for interactively visualizing large trees that can be annotated with metadata. Having worked on similar problems in the analysis and visualization of enormous SARS-CoV-2 data sets, I am quite impressed with the performance and "look and feel" of the Taxonium-powered cov2tree web interface, particularly its speed at rendering trees (or at least a subgraph of the tree).

      Thank you for the kind words.

      The manuscript is written well, although it uses some technical "web 2.0" terminology that may not be accessible to a general scientific readership, e.g., "protobuf" (presumably protocol buffer) and "autoscaling Kubernetes cluster". The latter is like referring to a piece of lab equipment, so the author should provide some sort of reference to the manufacturer, i.e., https://kubernetes.io/.

      Thank you for flagging this. I have now replaced the colloquial "protobuf" with "protocol buffer". I have now provided a URL for Kubernetes. It is always difficult to judge how much to explain technical terms. I certainly agree that many people will be unfamiliar with, for instance, protocol buffers, but an explanation of what they are (which may not be particularly important for understanding Taxonium) can sometimes overshadow more important details. So my preference in that particular case is for an interested reader to research the unfamiliar term.

      In other respects, the manuscript lacks some methodological details, such as exactly how the tree is "sparsified" to reduce the number of branches being displayed for a given range of coordinates.

      This is an important point also raised by Reviewer 3. I have added a new section in the Materials and Methods which discusses this in some detail.

      Some statements are inaccurate or not supported by current knowledge in the field. For instance, it is not true that the phylogeny "closely approximates" the transmission tree for RNA viruses.

      I agree that this was an overly broad claim, and have softened it, now saying:

      "The fundamental representation of a viral epidemic for genomic epidemiology is a phylogenetic tree, which approximates the transmission tree and can allow insights into the direction of migration of viral lineages."

      Mutations are not associated with a "point in the phylogeny", but rather the branch that is associated with that internal node.

      I have changed this as suggested.

      A major limitation of displaying a single phylogenetic tree (albeit an enormous one) is that the uncertainty in reconstructing specific branches is not readily communicated to the user. This problem is exacerbated for large trees where the number of observations far exceeds the amount of data (alignment length). Hence, it would be very helpful to have some means of annotating the tree display with levels of uncertainty, e.g., "we actually have no idea if this is the correct subtree". DensiTree endeavours to do this by drawing a joint representation of a posterior sample of trees, but it would be onerous to map a user interface to this display. I'm raising this point as something for the developers to consider as a feature addition, and not a required revision for this manuscript.

      I entirely agree with this point. I have added a sentence in the discussion:

      "Even where sequences are accurate, phylogenetic topology is often uncertain, and finding ways to communicate this at scale, building on prior work [Densitree citation] would be valuable."

      The authors make multiple claims of novelty - e.g., "[...] existing web-based tools [...] do not scale to the size of data sets now available for SARS-CoV-2" and "Taxonium is the only tool that readily displays the number of independent times a given mutation has occurred [...]" - that are not entirely accurate. For example, RASCL (https://observablehq.com/@aglucaci/rascl) allows users to annotate phylogenies to examine the repeated occurrence of specific mutations. Our own system, CoVizu, also enables users to visualize and explore the evolutionary relationships among millions of SARS-CoV-2 genomes, although it takes a very different approach from Taxonium. Taxonium is an excellent and innovative tool, and it should not be necessary to claim priority.

      I agree that comparisons with existing tools are difficult and often provide a sense of unnecessary competition. I attempted to be quite careful in the specific section focused on comparison, but may have been less careful earlier on. The intent with this first sentence in the abstract was to provide a succinct description of the gap that Taxonium was developed to fill with "however, existing web-based tools for analysing and exploring phylogenies do not scale to the size of datasets now available for SARS-CoV-2". I have now removed the words "analysing and", focusing on the exploration of phylogenies. I think this new sentence is defensible in that valuable tools such as CoVizu intentionally do not explore a phylogeny directly but instead take a multi-level approach, and this new sentence better matches the comparisons in the paper. In the second sentence, I have removed the phrase "is the only tool that", which I agree adds little and may not be accurate, depending on one's interpretation of "readily". Thank you for these points.

      Although the source code (largely JavaScript with some Python) is quite clean and has a consistent style, there is a surprising lack of documentation in the code. This makes me concerned about whether Taxonium can be a maintainable and extensible open-source project since this complex system has been almost entirely written by a single developer. For example, usher_to_taxonium.py has a single inline comment (a command-line example) and no docstring for the main function. JBrowsePanel.jsx has a single inline comment for 293 lines of code. There is some external documentation (e.g., DEVELOPMENT.md) that provides instructions for installing a development build, but it would be very helpful to extend this documentation to describe the relationships among the different files and their specific roles. Again, this is something for the developers to consider for future work and not the current manuscript.

      This is an entirely fair comment. The version of Taxonium presented in the manuscript is "2.0", which is a new version built from scratch with considerably less technical debt than the version that preceded it. Its technical strengths are that (with the exception of the backend) it is relatively well-modularised into functional components. But the limitations that the reviewer notes with respect to commenting are entirely fair. What I would say is that in the time since this manuscript was submitted, several important features have been added by an external collaborator, Alex Kramer, most notably the Treenome Browser (https://www.biorxiv.org/content/10.1101/2022.09.28.509985v1). I hope that the ability of Alex to add these features with little need for support provides some evidence of Taxonium's extensibility. But I acknowledge there is room for improvement.

      Reviewer #3 (Public Review):

      The paper succinctly provides an overview of the current approaches to generating and displaying super-large phylogenies (>10,000 tips). The results presented here provide a comprehensive set of tools to address the display and exploration of such phylogenies. The tools are well-described and comprehensive, and additional online documentation is welcome.

      The technical work to display such large datasets in a responsive fashion is impressive and this is aptly described in the paper. The author rightly decides that displaying large phylogenies is not simply a matter of rendering "more nodes", and so in my eyes, the major advancement is the approach used to downsample trees on-the-fly so that the number of nodes displayed at one time is manageable. This is detailed only briefly (Results section, 1st paragraph, 2 sentences). I would like to see more discussion about the details of this approach.

      Thank you for this point, also raised by Reviewer 2. I have now added a lengthy section on this in the Materials and Methods, which I hope is helpful. The approach is not especially sophisticated, but it does the job and runs quickly.

      Examples that came up while exploring the tool: the (well implemented) search functionality reports results from the entire tree (e.g. in Figure 4, the number of red circles is not a function of zoom level), how does this interact with a tree showing only a subset of nodes?

      Yes, this is an important feature which I perhaps did not do justice to in the write-up. I have included in the new section in the Materials and Methods a paragraph discussing search results:

      "In order to ensure that search results are always comprehensive, but at the same time to avoid overplotting, we take the following approach::

      ● Searches are performed across every single node on the tree to select a set of nodes that match the search. The total number of matches is displayed in the client.

      ● If fewer than 10,000 matches are detected, these are simply displayed in the client as circles

      ● If more than 10,000 matches are detected, the results are sparsified using the method above, and then displayed.

      ● Upon zooming or panning, the sparsification is repeated for the new bounding box."

      How is the node order chosen with regards to "nodes that would be hidden by other nodes are excluded" and could this affect interpretations depending on the colouring used?

      This perhaps was slightly sloppy language which did not directly describe the implementation. I have now rephrased this to "only nodes that overlap other nodes are excluded", as we don't in fact consider a notion of z-index when doing this. The way the sparsification works (now better described) means that the nodes excluded are determined essentially by position and I don't foresee this introducing particular biases, but this was an insightful point to raise.

      Taxonium takes the approach of displaying all available data (sparsification of nodes notwithstanding). Biases in the generation of sequences, especially geographical, will therefore be present (especially so in the two main datasets discussed here - SARS-CoV-2 and monkeypox). This caveat should be made explicit.

      This is certainly true. I have added this new paragraph in the Discussion:

      "A further challenge is the vastly different densities of sampling in different geographic regions. Because Cov2Tree does not downsample sequences from countries which are able to sequence a greater proportion of their cases, the number of tips on a tree is not indicative of the size of an outbreak and in some cases even inferences of the directionality of migration may be confounded. There would be value in the development of techniques that allow visual normalisation of trees for sampling biases, which might allow for less biased phylogenetic representations without downsampling."

      Has the author considered choosing which nodes to exclude for sparsified trees in such a way as to minimise known sampling biases?

      The last sentence of the new paragraph above alludes to a sort-of-similar approach. I hadn't directly considered the approach the reviewer suggests. It is an interesting idea. The downsampling algorithm has to be very computationally inexpensive but it would be interesting to explore ways to do this. I am tracking this in https://github.com/theosanderson/taxonium/issues/437.

      Interoperability between different software tools is discussed in a technical sense but not as it pertains to discovering the questions to ask of the data. As an example, spotting the specific mutations shown in figure 3 + 4 is not feasible by checking every position iteratively; instead, the ability to have mutations flagged elsewhere and then seamlessly explore them in Taxonium is a much more powerful workflow. This kind of interoperability (which Taxonium supports) enhances the claim of "providing insights into the evolution of the virus".

      Thank you for flagging this point -- I am very excited by the growing ecosystem of interoperable tools. You are absolutely right that most of the insights Taxonium can bring into evolution rely also on this broader ecosystem. I have added a florid sentence in the concluding paragraph: "It forms part of an ecosystem of open-source tools that together turn an avalanche of sequencing data into actionable insights into ongoing evolution."

      The prosaic reason I don't discuss Taxonium's interoperability features in more detail in this manuscript is that it aims to describe the version of Taxonium I initially developed, and these features were developed collaboratively by a broader group later on (and after deposition of this preprint).

      Taxonium has been a fantastic resource for the analysis of SARS-CoV-2 and this paper fluently presents the tool in the context of the wider ecosystem of bioinformatic tools in use today, with the interoperability of the different pieces being a welcome direction.

    1. We have spent too much time on inward-lookingdebates that pit distant against close reading, and not enough time understandingconnections to other disciplines.

      Of course with innovation comes back lash, it is within human nature to want to not have/want change.

      Ie. Technology such as the newest phones and older generations not wanting to learn/ not knowing how to understand then.

      We tend to pit every new idea to an assortment of way/methods/things we already know rather than exploring them for what they were thought to be made for. I think seeing that there was an inward debate on the subject wasn't much of surprise but rather a given! Knowing of critics such as Stephen Marche and Stanly Fish, it is easy to see way it was the way it is.

      I do however wonder which other disciplines we could better connect to? And whether it would be a better use of our time just understanding distant reading at its surface level or to keep a comparative with others too? (I would say comparing to others may help in the overall scheme of things).

    1. Winston Churchill's "Blood, Toil, Tears, and Sweat" Speech On Friday evening last I received from His Majesty the mission to form a new administration. It was the evident will of Parliament and the nation that this should be conceived on the broadest possible basis and that it should include all parties. I have already completed the most important part of this task. A war cabinet has been formed of five members, representing, with the Labour, Opposition, and Liberals, the unity of the nation. It was necessary that this should be done in one single day on account of the extreme urgency and rigor of events. Other key positions were filled yesterday. I am submitting a further list to the king tonight. I hope to complete the appointment of principal ministers during tomorrow. The appointment of other ministers usually takes a little longer. I trust when Parliament meets again this part of my task will be completed and that the administration will be complete in all respects. I considered it in the public interest to suggest to the Speaker that the House should be summoned today. At the end of today's proceedings, the adjournment of the House will be proposed until May 21 with provision for earlier meeting if need be. Business for that will be notified to MPs at the earliest opportunity. I now invite the House by a resolution to record its approval of the steps taken and declare its confidence in the new government. The resolution: "That this House welcomes the formation of a government representing the united and inflexible resolve of the nation to prosecute the war with Germany to a victorious conclusion." To form an administration of this scale and complexity is a serious undertaking in itself. But we are in the preliminary phase of one of the greatest battles in history. We are in action at many other points — in Norway and in Holland — and we have to be prepared in the Mediterranean. The air battle is continuing, and many preparations have to be made here at home. In this crisis I think I may be pardoned if I do not address the House at any length today, and I hope that any of my friends and colleagues or former colleagues who are affected by the political reconstruction will make all allowances for any lack of ceremony with which it has been necessary to act. I say to the House as I said to ministers who have joined this government, I have nothing to offer but blood, toil, tears, and sweat. We have before us an ordeal of the most grievous kind. We have before us many, many months of struggle and suffering. You ask, what is our policy? I say it is to wage war by land, sea, and air. War with all our might and with all the strength God has given us, and to wage war against a monstrous tyranny never surpassed in the dark and lamentable catalogue of human crime. That is our policy. You ask, what is our aim? I can answer in one word. It is victory. Victory at all costs — Victory in spite of all terrors — Victory, however long and hard the road may be, for without victory there is no survival. Let that be realized. No survival for the British Empire, no survival for all that the British Empire has stood for, no survival for the urge, the impulse of the ages, that mankind shall move forward toward his goal. I take up my task in buoyancy and hope. I feel sure that our cause will not be suffered to fail among men. I feel entitled at this juncture, at this time, to claim the aid of all and to say, "Come then, let us go forward together with our united strength."

      Important speech by Winston Churchhill

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      The reviews are on balance an accurate, thoughtful, thorough assessment of the manuscript. We appreciate the careful engagement with the B cell differentiation aspect of our work. We identify 2 major critiques from the reviews:

      1. The manuscript should make stronger connections with existing literature on ____in-vitro _and _in-vivo ____B cell differentiation. We agree the manuscript should be revised to interact more holistically and carefully with relevant B cell differentiation research. In this respect, the reviewers both help by pointing to high-quality and relevant literature that will be discussed and cited.

      The cytokine mixture we used on the B cells was not defined / described in the manuscript. This fact hinders the interpretation of the data because B cells will respond to diverse stimuli in quite different ways.

      We agree this hinders interpretation of the data, and the reviewers bring up astute points about different types of stimuli (TD vs. TI vs. TLR vs. BCR). Unfortunately, the manufacturer of the product, Stem Cell Technologies, will not disclose exactly what is in the product. Given we are in strong agreement with the reviewers on this point, we analyzed the cytokine contents of the cocktail and our cell culture supernatants using a luminex cytokine panel. We present a discussion of our findings on this data in a supplementary note and figure. We acknowledge this analysis is non-exhaustive, because it does not include possible additions of non-cytokine stimulants. However, we maintain it adds much clarity to the interpretation of the data.

      We note that the contents of the stimulation cocktail are knowable and well-defined. These attributes are in contrast to almost all B cell stimulation protocols of which are aware. Typical stimulation protocols use various types of feeder cells, cytokines, and FBS (Fetal Bovine Serum). In particular, the feeder cells and FBS, are highly variable between labs, lots, and even experiments. FBS has a myriad of issues which are described here (____https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8349753/____). Major variability, from genomic to phenotypic, has been described in laboratory cell lines like the ones used as feeder cells. With respect to B cells specifically, large differences in B cell activation programs are observed between lots of FBS, as described here (____https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7854248/#r5____). Additionally, we have observed the presence of bovine viruses and other contaminants in FBS (unpublished data). Thus, the stimulation protocol we used is reproducible and robust in ways generally unseen by us in B cell stimulation literature. In summary, we view this cocktail as useful in a similar way to how FBS is useful to biologists – a major difference being that this cocktail is better defined and controlled. We provide similar thoughts in our supplementary note.

      A final general point we will make is about the significance of our work, which appears to be lost on Reviewer #1. Similar technical and conceptual advances by our lab have been cited 1000s of times. Thus, we think the impact of our scientific approach speaks for itself. Many of our results confirm and expand on previous literature about B cells. We deliberately chose to make this novel technical and conceptual advance in the well-studied system of B cell differentiation. This allows us to integrate our findings with prior literature and helps validate the general approach. Reviewer #1 has performed a scholarly service by independently verifying our findings are coherent with existing literature, and we thank them for that.

      In response to the reviews, we have edited the manuscript to reference even more of the papers in the field which report similar findings. Thus, our concordance with prior literature should be viewed as a strength of the manuscript. It shows readers of the manuscript the conceptual framework we use here is valid and can generate similar insights in less well-studied systems. For example, the approach developed here could be used in non-B cells, non-human immune systems, or even non-model organisms. In response to the reviewers critique, we modified the discussion of our work in multiple places to emphasize these points.

      Description of the planned revisions
      

      Insert here a point-by-point reply that explains what revisions, additional experimentations and analyses are planned to address the points raised by the referees.

      1) Which B cell activation protocol was used? No information is provided in the main text or supplementary information. Yet, this information is key to fully understand many of the conclusions of this work (e.g., ... memory B cells are intrinsically two-fold more persistent in vitro (2A)), which largely depend on the nature of the stimuli used in the in vitro B cell culture.

      We used the B cell activation protocol developed by StemCell Technologies as described in our methods section. We agree the reader and scientific community would benefit from additional information about this cocktail. To this end, we added a discussion of the cocktail to the supplementary information. We also used a cytokine analysis panel to analyze the cocktail, which provided detailed although non-exhaustive information about what is in the cocktail.

      2) It would be informative to use more than one B cell activation program, e.g., CD40L with or without a cytokine as well as CD40L vs. CpG-DNA. Authors make broad statements about B cell fates without discussing the impact of a given signal on a given B cell fate. For instance, do memory B cells follow the same differentiation program upon stimulation with CD40L, IL-4 or a combination of CD40L and IL-4? How about differences between a TD signaling program such as that provided by CD40L and IL-10 and TI signaling program such as that provided by CpG-DNA and IL-10?

      This is a good point. We agree stimulation using a panel of different agents would be a worthwhile experiment. It stands as a goalpost for future studies. Currently, performing single cell RNA sequencing on so many samples is both beyond the scope of this manuscript and very resource intensive. ____

      3) Page 3, first line: After low quality and non-B cells (Fig S1A & B). What does this statement mean? The sentence seems incomplete.

      Thank you for catching this typo. It is now clarified in the manuscript that we removed these cells bioinformatically.

      4) First, we noted that non-B cells present in the input rapidly became undetectable by day 4, which shows the specificity of the cytokines for B cell expansion. Which cytokines are we talking about? No detail is provided.

      We now provide our analysis of the stimulation cocktail, in Supplementary note 1 and Supplementary figure 1A. We still believe it is an interesting observation that this cocktail specifically stimulates B cells because many cytokines are not specifically B cell division signals, there were some impurities in the input population, and many cytokines are produced by the cultured cells themselves.

      5) Plasmablasts were not distinguished from plasma cells.

      We agree this is an interesting and important distinction to make. We have now distinguished between these classes of cells.

      6) Critically, we observed no appreciable evidence of hypermutation in vitro (S2C), consistent with prior literature (Bergthorsdottir et al. 2001). This statement is vague, misleading and likely inaccurate for the following reasons. (a) The B cell culture conditions used by the authors are completely unknown. (b) It was shown that SHM can be achieved under specific in vitro B cell culture conditions that include the presence of activated CD4+ T cells (PMID: 9052835; PMID: 10092799; PMID: 10878357; PMID: 12145648). Did authors try to recapitulate those culture conditions?

      We see how this statement could be misunderstood. We only claim not to observe evidence of hypermutation in our specific culture conditions, which is important for the inferences we make. We added language to make this more clear.

      We did not try to recapitulate the conditions in the references supplied by the reviewer. We note that these references use cell lines and not B cells. While there is immensely valuable work done on cell lines, they behave very differently from actual cells and these findings may not be relevant to our human B cells.

      7) Some of the reported findings are repetitive of previously published results and provide no additional new information. For example:

      1. a) "Interestingly, we found mutated B cells were far more likely to express genes involved in T cell interaction (2B), suggesting Memory B cells are intrinsically licensed to enter an inflammatory state which activates T cells". This evidence is already published (PMID: 7535180 among many other published studies).

      We will cite this paper, which is a landmark study. We don’t claim we are the first to discover a propensity of mutated B cells to present help to T cells, but note that we were able to observe this fact via lineage tracing in a single experiment, which is a conceptual and technical advance. Additionally, we report an entire transcriptional module of genes which are upregulated in memory B cells vs. Naive B cells exposed to the same stimulus. This adds to the systematic understanding of the Memory B Cell activation program.

      1. b) "Instead, Naive B cells were biased toward expressing lectins and CCR7, suggesting Naive B cells are intrinsically primed to home into the lymphatic system and germinal centers (2B)". This evidence is already published (PMID: 9585422 among many other published studies).

      While this is an interesting and important paper referenced by the reviewer, we are unable to find anything similar to our claim about naive B cells in the reference provided. The investigators do not discuss intrinsic differences between memory and naive subsets when responding to the same stimulus.

      8) We quantified the in vitro dynamics of CSR through the lens of mutation status, which revealed strongly different fate biases between germline and mutated cells (2D). Most strikingly, B cells which switched to IGHE were almost exclusively derived from germline progenitors: the ratio of germline IGHE cells to mutated IGHE cells was (8-fold - inf, 95 % CI). Also this evidence is not novel (PMID: 34050324 among other published studies) and, again, must reflect the presence of specific culture conditions that remain completely undisclosed. This is incredibly confusing.

      Thank you for providing this reference, we were not aware of this interesting study. These studies are quite different and complementary. Differences between these studies likely reflect the fact that their B cells are isolated from a niche, rather than generated ____in-vitro_. Most of the tissue-resident cells in their study are quite mutated, and thus are not the Naive B cells we are making a claim abountj. In fact, despite their claim of low mutational load, these cells would fall into the “mutated” or even “heavily mutated” categories we defined in our paper. Cells with mutation levels of 5% are not thought to be Naive in any classification scheme. Our study showed that, _in vitro____, IGHE B cells effectively came exclusively from germline progenitors, their study shows no such result. The novelty of this finding was appreciated by reviewer 2.

      9) Authors should mention that non-switched memory B cells include IgDlowIgM+CD27+ and IgD-IgM+CD27+ memory B cells. Some authors define these distinct memory B cell fractions as marginal zone (MZ) or MZ-like B cells (please, notice that splenic MZ B cells recirculate in humans) and IgM-only B cells, respectively (PMID: 28709802; PMID: 9028952; PMID: 10820234; PMID: 11158612; PMID: 26355154; PMID: 15191950; and PMID: 24733829 among many other published studies).

      We appreciate these points. We attempted to classify our B cells within this taxonomy and found no such separation clearly exists in single-cell RNA-seq profiles. Instead, we opted to re-classify our data with a state-of-the-art algorithm called celltypist (DOI: 10.1126/science.abl5197____ )____ which harmonizes cell annotations across a growing number of single-cell RNA sequencing studies. While this classification system is not currently mutually exclusive / completely exhaustive, we believe using this system provides standardization and data availability that are key for sharing results. As single-cell RNA-seq and flow/mass cytometry harmonize their classification systems, anyone should be able to transfer their preferred classification scheme to the cells profiled here.

      10) Thus, CSR from IGHM cells did not meaningfully contribute to the abundance of IGHA+ cells in the population. Also this conclusion may be misleading and/or inaccurate. Indeed, an efficient class switching to IgA requires the exposure of naïve B cells to the cytokine TFG-beta in addition to a robust TD (CD40L) or TI (CpG DNA or BAFF or APRIL) co-signal. Was TGF-beta present in this culture?

      This is a good point about TGF-beta and switching to IgA. Here is a clear example of the novelty and power of our approach, as well as the benefits of using a well-characterized system such as B cell differentiation. Lineage tracing clarifies between two explanations for why there are IgA cells in the output population. One explanation is that non-IgA B cells in the input switch to IgA, driven by TGF-beta. Another explanation is that IgA cells in the input expand modestly and account for IgA cells in the output. Lineage tracing offers clear evidence that the latter explanation is true. Following from this, our approach allows us to make a strong inference that TGF-beta is not present in the incompletely determined cytokine mixture. We are not sure how this conclusion may be misleading or inaccurate, as it is a clear and simple description of our data, not a claim about what factors are necessary for switching.

      11) In contrast, we noted that many intraclonal class-switching events appeared to be directly from IGHM to IGHE. Explanations involving unobserved cells with intermediate isotypes notwithstanding, these data illustrate the relative ease with which B cells can switch directly to IGHE. It is very difficult to interpret this statement, as no information regarding the B cell-stimulating conditions used is provided. In addition, relevant literature is not quoted (e.g., PMID: 34050324).

      We clarify our discussion here to claim the ease with which peripheral blood IGHM B cells switch to IGHE. Again, lineage tracing has allowed us to distinguish between two very different population-level phenomena. One explanation is that undetected IGHE+ progenitors in the input population expanded rapidly and account for the IGHE+ cells. Another explanation is that cells class-switch to IGHE. Our data are consistent with the latter. We note that this validates the conceptual use of lineage tracing to understand rapid population dynamics in immune responses and cell differentiation protocols. This is a strength of our manuscript. We appreciate the the reviewer has furnished relevant studies, which we will cite.

      12) Our data for IGHE cells contrasts with in vivo data which show IgE B cells to be: (1) very rare, (2) apparently derived from sequential switching (e.g. from IgG1 to IgE) (Horns et al., 2016; Looney et al., 2016), and (3) often heavily hypermutated (Croote et al., 2018).

      While this reviewer agrees with the first comment (switching to IgE is relatively rare in vivo, at least in healthy individuals), the other statements are quite inaccurate. Indeed, unmutated extrafollicular naïve B cells from tonsils and possibly other mucosal districts directly class switch from IgM to IgE in healthy individuals, thereby generating a low-affinity IgE repertoire. In principle, low-affinity IgE antibodies may protect against allergy by competing with high-affinity IgE specificities. In allergic patients, high-affinity IgE clones emerge from class-switched and hypermutated memory B cells that sequentially switch from IgG1 or IgA1 to IgE as a result of specific environmental conditions, including an altered skin barrier (PMID: 22249450; PMID: 30814336; PMID: 32139586).

      Moreover, in contrast to what stated by authors, sequential IgG1/IgA1-to-IgE class switching mostly occurs in allergic patients but is less frequent in healthy individuals, where IgE specificities are less mutated (PMID: 30814336). Along the same lines, IgE is heavily mutated only in allergic individuals with significant molecular evidence of sequential IgG1/IgA1-to-IgE class switching (PMID: 30814336; PMID: 32139586). Overall, the data provided by Swift M. et al. are largely confirmatory of previously published evidence.

      We appreciate the clarification of this complex field and will cite the relevant literature. We also agree with the reviewers assessment that our data are validated by other approaches and groups. We see that our discussion of IgE B cells should have included that caveat that we are discussing IgE B cells detected in the peripheral blood. We have restricted claim to the suggestion that if our conditions mimic such niches where B cells switch to IgE, there are clearly efficient mechanisms which limit the amount of circulating IGHE B cells mechanisms in comparison to other isotypes.____

      Taken together, these data suggest that while direct switching to IGHE from Naive progenitors is trivial in vitro, niche factors or intrinsic death programs efficiently limit their generation or lifetime in vivo." I cannot understand this conclusion, which seems to contradict earlier statements.

      We hope we have clarified via the above comment.

      13) I am not sure I learned much regarding the "cell-intrinsic" fate bias and transcriptional memory of B cells after reading this elegantly presented but confusing and superficially discussed manuscript (please, see also comments 15-23).

      We understand the reviewer is confused about various aspects of our manuscript and appreciate the opportunities to clarify. We show cells with broad identities (such as germline vs. mutated or naive vs. memory) respond differently to the same stimulus. These are cell intrinsic fate biases. We quantify them and provide statistical bounds on the effect sizes of these differences, which to our knowledge has not been done. We agree with the reviewer that in the case of memory and naive B cells, much is already known about their biases. We recapitulate some of this knowledge, while adding a quantitative and an unbiased transcriptomic lens with which to view the biases. However, our analysis moves beyond cell types broadly defined, and focuses on the concept that each clone is a cell state or identity, where some of the identity may be faithfully propagated over generations and other information may not be. To this end, we tracked the transcriptome of clones during differentiation. We show that B cell clones share highly similar cell fates, implicating cell-intrinsic heterogeneity as a major contribution to diversity in immune responses. We note this reviewer did not critique this aspect of our work. The review also did not critique Figure 3 or 4, in which we present a quantitative analysis of which transcriptional programs are maintained by B cells and contribute to their clonal identity. Finally, via our analysis of human long-lived plasma cells, we report these transcriptional identities are observed in-vivo, over long time scales. This type of cell-intrinsic bias has not been studied or described to our knowledge. These findings were of particular interest to reviewer 2 and other readers of the manuscript.

      MINOR COMMENTS

      Thank you for reading the manuscript carefully and providing these comments and observations. We have fixed all clerical errors that were pointed out. We also responded to some of these minor comments here, and made changes to the manuscript to clarify.

      1) Figures 1B, S1C and S1D are not referred to in the text. x

      2) 2B in the Text is 2D in the Figure. x

      3) 2D in the Text is 2E in the Figure. x

      4) Figure 2D seems to show only 10 genes. Please, clarify.

      We clarify in the manuscript that we present the top differentially expressed genes

      5) 2E in the text is 2F in the Figure. x

      6) Figure S3B is not indicated in the text. x

      7) Figure 3E is not indicated in the text. x

      8) Figure S4A is not indicated in the text. x

      9) In some sections of the text, Figure panels are not sequentially discussed, which makes the text very difficult to follow. x

      Reviewer 2:

      Major comments:

      On p.3 the authors assume that a B cell with an unmutated BCR in the time course arose from a naive B cell progenitor. However, it is also possible that it arose from a IgM memory B cells since they also contain a non-negligible proportion of cells with 0 mutations. This was initially seen already in the Klein, J Exp Med, 1998 paper and later confirmed by e.g. Weller et al, J Exp Med, 2008 and Wu et al, Front Immunol, 2011. And since the authors herein and others have demonstrated that IgM memory B cells have a high proliferative capacity it is possible that IgM memory B cells are overrepresented among those unmutated BCRs seen in the cultures.

      The finding that IgM memory B cells are highly proliferative is not novel. It has been demonstrated by other groups before and one good example is Seifert et al, PNAS, 2015 where IgM memory B cells proliferated significantly more to BCR stimulation than naive or IgG memory B cells. However, it is also shown that IgG memory B cells are more responsive to TLR9 stimulation than IgM memory B cells as demonstrated by e.g. Marasco et al, Eur J Immunol, 2017. This is not discussed by the authors and should be added into the discussion for context of their finding by scRNAseq methods.

      These are astute points. We incorporated a more nuanced discussion of the prior literature about highly proliferative IgM memory B cells, which have been reported before. We also added a figure which identifies the genes associated with proliferative clones in Figure 3d, which adds to our understanding of the gene regulatory networks which govern IgM memory B cell behavior. We appreciate the reference to the Seifert et al paper, which is relevant and high quality work. We concur that a discuss of Marasco would be helpful, especially because it is unknown if a TLR9 agonist is in the stimulation cocktail, but their data would suggest there is not.

      The notion that a memory transcriptional program can be induced without SHM is not novel and this should be brought up in the discussion. One paper showing a memory transcriptional program in unmutated memory B cells is Kibler et al, Front Immunol, 2022.

      We were not aware of this literature and have now cited it in our discussion of this finding.

      The observation that memory B cells are more likely to enter an inflammatory state and support T cells has been suggested by other groups (Seifert et al, PNAS, 2015; Magri et al, Immunity, 2017, Grimsholm et al, Cell Reports, 2020).

      We have now cited and discussed a number of papers which contain similar findings. We note that we add to the holistic understanding of this phenomenon via our single cell transcriptomic approach.

      Please provide the age distribution of the peripheral blood samples as well.

      We have now provided the age distribution of the peripheral blood samples

      Please show flow cytometry analysis of the cultures to assist in assessing subset distribution, viability and plasma cell differentiation for each time point. This can be provided as supplementary information.

      We did not use flow cytometry for subset distribution and measurements of differentiation per se, only to exclude non-viable cells and we have now made this clearer in the methods section. We also now include representative plots show our sorting strategy.

      The stimulation cocktail used for this study, what does it contain? This needs to be specified in the manuscript and not only referring to the manufacturer. This has major impact on the results since different stimulatory agents will induce different pathways.

      This is a valid point that we addressed in our response to reviewer 1. See supplementary note 1 and Figure S1A for our analysis of the stimulation cocktail.

      Minor comments:

      Please avoid the term plasma B cells, does it refer to plasmablasts and/or plasma cells?

      Thank you for the suggestion, we have modified our language to refer to plasmablasts and plasma cells separately.

    1. Author Response

      Reviewer #1 (Public Review):

      In this manuscript by Kim et al., the authors use live-cell imaging of transcription in the Drosophila blastoderm to motivate quantitative models of gene regulation. Specifically, they focus on the role of repressors and use a 'thermodynamic' model as the conceptual framework for understanding the addition and placement of the repressor Runt, i.e. synthetic insertion of Runt repressor sites into the Bicoid-activated hunchback P2 enhancer. Coupled with kinetic modeling and live-cell imaging, this study is a sort of mathematical enhancer bashing experiment. The overarching theme is measuring the input/output relationship between an activator (bicoid), repressor (runt), and mRNA synthesis. Transcriptional repression is understudied in my opinion. One finding is that the inclusion of cooperativity between trans-acting factors is necessary for understanding transcriptional regulation. Most, if not all, of the tools used in this paper have been published elsewhere, but the real contribution is a deep, quantitative dissection of transcriptional regulation during development. As such, the only real questions for this referee are whether the modeling was done rigorously to produce some general biological conclusions. By and large, I think the answer is yes.

      We thank the reviewer for this thoughtful evaluation of our work. We agree with the reviewer’s assessment that transcriptional repression, especially the quantitative dissection of transcriptional repression, is understudied compared to transcriptional activation.

      Comments:

      Fig. 6 was the most striking figure for this referee, specifically that different placements of Runt molecules on the enhancer lead to distinct higher order interactions. I am wondering if the middle data column in Fig. 6 represents a real difference from the other two, and if so, it seems that the positioning - as opposed to simply the stoichiometry - is essential in cooperativity. This conclusion implies that transcriptional regulation is more precise than what some claim is just a mushy ball of factors close to a promoter. In other words, orientation may matter. Proximity may matter. Interactions in trans matter.

      We thank the reviewer for pointing out a feature of our data that we did not emphasize enough originally. Indeed, the construct in the middle column, which we termed [101], could be better recapitulated with the simplest model of zero free parameters than the other two constructs. As the reviewer pointed out, this raises an interesting question about the “grammar” of an enhancer: the placement and orientation of binding sites for transcription factors might matter yet we do not have a clear understanding of the logic. We have now incorporated a discussion of this topic in the Discussion section.

      There needs to be at least one prediction which is validated at the level of smFISH / mRNA in the embryo. Without detracting from the effort the authors have expended in looking directly at transcription, if the effects can't be felt by the blastoderm at the level of mRNA/cell, it becomes difficult to argue for the relevance to development. Also, I feel there is little chance that these measurements can be quantitatively replicated unless translated to the level of total protein or mRNA. Such a measurement (orthogonal quantitative confirmation of the repressor cooperativity result) would also assuage my concern about the time averaging as shown in Fig. S3.

      Our study focused on predicting the initial rate of transcription because it is the measurable quantity that most directly relates to the binding and action of the transcriptional activators and repressors used in this study. We argue that the action of transcription factors would be more accurately assessed by monitoring the rate of transcription, rather than the accumulated mRNA, which could be confounded by the dynamics of the whole transcription cycle—initiation, elongation and termination—as well as nuclear export, diffusion and degradation of transcripts. We are, of course, excited to eventually be able to predict a whole pattern of cytoplasmic mRNA over space and time from knowledge of the enhancer sequence. However, if we cannot predict the initial rate of RNA polymerase loading dictated by an enhancer, we argue that there is little hope in predicting such cytoplasmic patterns. We emphasized this point in the Discussion (Line XX-YY). Regardless, to assuage the reviewer’s concern, we have performed additional analyses to assess the effect of repression at the level of accumulated mRNA.

      First, we have quantified the accumulated mRNA during nuclear cycle 14, which is the time window that we have focused on in this study. To make this possible, we have integrated the area under the curve of MS2 time traces which has been already shown to be a reporter of the total amount of mRNA produced by FISH (Garcia et al., Current Biology 23:2140, 2013;Lammers et al., PNAS 17:836, 2020). This integration reporting on accumulated mRNA is now shown for all constructs in the presence and absence of Runt protein in the new Figure S17. This figure clearly shows that the consequences of repression are present in the blastoderm, not just at the level of transcriptional initiation, but also at the level of accumulated mRNA.

      We then compared the accumulated mRNA profiles shown in Figure S17 to the initial rate of RNAP loading at each position of the embryo along the anterior-posterior axis for all constructs in the presence and absence of Runt protein. These new results are shown in a new figure, Figure S19. Interestingly, we saw a good correlation (Pearson correlation coefficient of 0.90) between these two metrics. Thus, we argue that our conclusion that higher-order cooperativity is necessary to account for the initial rate of RNA polymerase loading would still hold for predicting the accumulated mRNA.

      Reviewer #3 (Public Review):

      The authors have presented results from carefully planned and executed experiments that probe enhancer-drive expression patterns in varying cellular conditions (of the early Drosophila embryo) and test whether standard models of cis-regulatory encoding suffice to explain the data. They show that this is not the case, and propose a mechanistic aspect (higher order cooperativity) that ought to be explored more carefully in future studies. The presentation (especially the figures and schematics) are excellent, and the narrative is crisp and well organized. The work is significant because it challenges our current understanding of how enhancers encode the combinatorial action of multiple transcription factors through multiple binding sites. The work will motivate additional modeling of the presented data, and experimental follow-up studies to explore the proposed mechanisms of higher order cooperativity. The work is an excellent example of iterative experimentation and quantitative modeling in the context of cis-regulatory grammar. At the same time, the work as it stands currently raises some doubts regarding the statistical interpretation of results and modeling, as outlined below.

      We thank the reviewer for noting the significance of our work. We tried our best to address the concerns of the reviewer regarding the statistical interpretation of results and theoretical modeling throughout our responses below.

      The results presented in Figure 5 are used to claim that the data support (i) an unchanging K_R regardless of the position of the Runt site in the enhancer and (ii) an \omega_RP that decreases as the site goes further away from the promoter, as might be expected from a direct repression model. This claim is based on only testing the specific model that the authors hypothesize and no alternative model is compared. For instance, are the fits significantly worse if \omega_RP is kept constant and the K_R allowed to vary across the three sites. If different placements of the Runt site can result in puzzling differences in RNAP-promoter interaction, it seems entirely possible that the different site placements might result in different K_R, perhaps due to unmodeled interference from bicoid binding. Due to these considerations, it is not clear if the data indeed argue for a fixed K_R and distance-dependent \omega_RP.

      We apologize for the lack of justification in assuming that Kr remains constant and wrp varies depending on the position of the Runt binding sites. Following the reviewer’s suggestion, we tested the alternative scenarios where we either fix or vary different combinations of wrp and Kr for our one-Runt binding site constructs. The result is now shown in a new figure, Figure S16. In short, as reported by the Akaike Information Criterion (AIC) in Figure S16F, the MCMC fit explains the data best in the scenario of fixed Kr and different wrp values for one-Runt binding site constructs. Furthermore, we also performed the MCMC inference in the case where we varied both Kr and wrp values across constructs. From this analysis, we obtained similar values of Kr while having different values of wrp across constructs as shown in Figure S16G. Overall, we believe that this evidence strongly supports our assumption of having consistent Kr values but different wrp values for the one-Runt binding site constructs.

      Results presented in Figure 6 make the case that higher order cooperativity involving two DNA-bound molecules of Runt and the RNAP is sufficient to explain the data. The trained values of such cooperativity in the three tested enhancers appear orders of magnitude different. As a result, it is hard to assess the evidence (from model fits) in a statistical sense. Indeed, if all of the assumptions of the model are correct, then using the high-order cooperativity is better than not using it. To some extent, this sounds statistically uninteresting (one additional parameter, better fits). It is not the case that the new parameter explains the data perfectly, so some form of statistical assessment is essential.

      The inferred cooperativity values are indeed orders of magnitude different. However, the cooperativity terms can be also written as “w = exp(-E/(kBT))”, where the E is the interaction energy, kB is the Boltzmann constant, and T is the temperature. As a result, we should compare the magnitude of the different cooperativities on a log-scale. In brief, the interaction energies wrr from the three two-Runt binding site constructs range between 0 and 1kBT, and the higher-order cooperativity wrrp has an energy between -2 and 4kBT. Interestingly, these energies are of the same order of magnitude as the interaction energies typically reported for bacterial transcription factors (e.g., Dodd et al., Genes and Development 18:344-54, 2004). It is important to note that our inferred interaction energies could be either positive or negative, suggesting that both cooperativity and anti-cooperativity can be at play depending on the architecture of the two Runt binding sites. We now report on these interactions in the language of energies Table S1 and elaborate on this in the Discussion section (Line XX-YY).

      Finally, following the reviewer’s suggestion on statistical assessment of whether addition of parameters indeed explains the data better, we adopted the Akaike Information Criterion (AIC) as a metric to compare different models used in Figure 6 and now show the results in a new panel, panel G. Briefly, AIC is calculated by assessing the model’s ability to explain the data while penalizing for having more parameters. The smaller the AIC value is, the better the model explains the data. As we have claimed, the AIC showed a dramatic decrease when adopting the higher-order cooperativity as shown in Figure 6G. Thus we argue that the addition of higher-order cooperativity, while not being able to completely explain the data, is indeed capable of increasing the agreement between experiments and theory across all our two-Runt site constructs.

      Moreover, it is not the case that the model structure being tested is the only obvious biophysics-driven choice: since this is the first time that such higher order effects are being tested, one has to be careful about testing alternative model structures, e.g., repression models that go beyond direct repression and pairwise cooperativity that goes beyond the traditional approach of a single (pseudo)energy term.

      We agree with the reviewer that alternative models with different mechanisms of repression should be mentioned. We have clarified this point further in Discussion (Line XX -YY). In summary, we tested both “competition” and “quenching” models of repression as proposed in Gray et al, (Genes and Development 8:1829, 1994). Interestingly, Figure S5 shows that the “competition” model gives a worse fit compared to the “direct repression” and “quenching” models for the one-Runt binding site cases. We further tried to test these alternative models in the case of two-Runt binding sites constructs. The result is shown in Figure S7 (competition) and S8 (quenching). These figures also reveal that the “competition” model underperformed compared to the “direct repression” or “quenching” models. For the “quenching” model to fit the data, we also had to invoke higher-order cooperativity that is beyond pairwise cooperativity. Thus, we believe that the requirement of higher-order cooperativity holds regardless of the choice of the specific model. Of course, our models of repression are very likely an oversimplification of how repressors actually work. However, given that these simple models have been a prevalent choice of proposed mechanisms for repression in the field of transcriptional repression for the past decades, we believe that the significance of our work lies in the fact that we challenged these models by turning them into precise mathematical statements (in the form of widespread thermodynamics models) and confronting them with quantitative data.

      The general theme seen in Figure 6 is seen again in Figure 7, when a 3-site construct is tested: model complexities inferred from all of the previous analyses are insufficient at explaining the new data, and new parameters have to be trained to explain the results. The authors do not seem to claim that the higher order cooperativity terms (two parameters) explain the data, rather that such terms may be useful.

      We agree that our previous approach was confusing. Figure 7A indeed incorporated all inferred parameters from the previous rounds of inference (Kb, wbp, p, R, as well as Kr, wrp, wrr, and wrrp). However, it is clear that this set of parameters, even including the higher-order cooperativity from two-Runt binding sites cases, was not enough to explain the data from three-Runt binding sites case. Thus, we had to invoke another free parameter, which we termed wrrrp, to explain the data. We have revised Figure 7B such that it is now showing the “best” MCMC fit which explains the data quite well (instead of just showing the “improvement” of fits).

    1. Author Response

      Reviewer #1 (Public Review):

      This paper introduces a new statistical framework to study cellular lineages and traits. Several new measures are introduced to infer selection strength from individual lineages. The key observation is that one can simply relate cumulants of a fitness landscape to population growth, and all of this can be simply computed from one generating function, that can be inferred from data. This formalism is then applied to experimental cell lineage data.

      I think this is a very interesting and clever paper. However, in its current form the paper is very hard to read, with very few explanations beyond the mathematical observations/definitions, which makes it almost unreadable for people outside of the field in my opinion. Some more intuitive explanations should be given for a broader audience, on all aspects : definitions of fitness « landscape », selection strength(s), connections between cumulants and other properties (including skewness) etc... There are many new definitions given with names reminiscent of classical concepts in evolutionary theory, but the connection is not always obvious. It would be great to better explain with very simple, intuitive examples, what they mean, beyond maths, possibly with simple examples. Some of this might be obvious to population geneticists, and in fact some explanations made in discussion are more illuminating, but earlier would be much better. I give more specific comments below.

      We thank the reviewer for calling our attention to the lack of accessible explanations on the significant terms and quantities in this framework. Following the suggestion in the comments below, we added Box 1, providing intuitive and plain explanations on the terms of fitness, fitness landscape, selection, selection strength, and cumulants. In each section, we explain the standard usage of these terms in evolutionary biology and clarify the similarities and differences in this framework. We also added a figure to Box 1 and provided a schematic explanation of the relationships among chronological and retrospective distributions, fitness landscapes, and selection strength. We believe that these explanations and a figure would better clarify the meanings and functions of these quantities.

      Major comments :

      1) the authors give names to several functions, for instance before equation (1) they mention « fitness landscape », then describe « net fitness » , which allows the authors to define « fitness cumulants ». Later on, a « selection » is defined. Those terms might mean different things for different authors depending on the context, to the point there are sometimes almost confusing. For instance, why is h a « landscape » ? For me, a landscape is kind of like a potential, and I really do not see how this is connected to h. « fitness cumulants » is particularly jargonic. There are also two kinds of selection strengths, which is very confusing. I would recommend that the authors make a glossary of the term, explain intuitively what they mean and maybe connect them to standard definitions.

      We appreciate the suggestion of making a glossary of the terms. Following the suggestion, we added Box 1 to provide intuitive and plain explanations of the terms used in this framework.

      In Box 1, we explain why we called h(x) a fitness landscape, referring to its standard usage in evolutionary biology. In evolutionary biology, fitness landscapes (also called adaptive landscapes) are visual representations of relationships between reproductive abilities (fitness) and genotypes. The height of landscapes corresponds to fitness. Since constructing "genotype space" is usually difficult, fitness is often mapped on an allele frequency or phenotype (trait) space to depict a "landscape." Fitness landscapes introduced in our framework are analogous to those in evolutionary biology in that fitness differences are mapped on trait spaces. Although fitness landscapes in evolutionary biology are usually metaphorical or conceptual tools for understanding evolutionary processes, the landscapes in our framework are directly measurable from division count and trait dynamics on cellular lineages.

      We also explain "selection" and "selection strength" in Box 1. As pointed out, we define three kinds of selection strength measures. These three measures share a similar property of reporting the overall correlations between traits and fitness. However, they also have critical differences regarding additional selection effects they represent: S_KL^((1)) for growth rate gain, S_KL^((2)) for additional loss of growth rate under perturbations, and their difference S_KL^((2))-S_KL^((1)) for the effect of selection on fitness variance. We restructured the sections in Results and clarified these important meanings of the different selection strength measures.

      We removed the term "fitness cumulants" as this is non-general and might cause confusion to readers. We now rephrased this more precisely as "cumulants of a fitness landscape (with respect to chronological distribution)." Besides, we added a general explanation of "cumulants" to Box 1 and clarified what first, second, and third-order cumulants represent about distributions.

      2) Along the same line, it would be good to give more intuitive explanations of the different functions introduced. For instance I find (2) more intuitive than (1) to define h . I think some more intuition on what the authors call selection strengths would be super useful . In Table 1 selection strengths are related to Kublack Leibler divergence (which does not seem to be defined), it would be good to better explain this.

      In addition to Box 1, we included more intuitive explanations on fitness landscapes and selection strength where they first appear in the Theoretical background section. As pointed out, descriptions of the linkage between the selection strength measures and Kullback-Leibler divergence were only in the Supplemental Information in the original manuscript. We now explicitly show this linkage where we first define the selection strength.

      Following this comment, we also changed the definition of a fitness landscape from the original one to h(x)≔τΛ+ln⁡〖Q_rs (x)/Q_cl (x)〗 (Eq. 1), using the chronological and retrospective distributions introduced in the preceding paragraph. This definition is mathematically equivalent to the previous one, but we believe it is more intuitive.

      3) It seems to me the authors implicitly assume that, along a lineage, one would have almost stationary phenotypes (e.g. constant division rate) . However, one could imagine very different situations, for instance the division rates could depend on interactions with other cells in the growing population, and thus change with time along a lineage. One could also have some strong random components of division rate over time . I am wondering how those more complex cases would impact the results and the discussion

      We thank the reviewer for pointing out our insufficient explanation of an essential feature of this framework. As we now explain in the "Examples of biological questions" section (L62-65) and Discussion (L492-493), this framework does not assume stationary phenotypes (traits) on cellular lineages. On the contrary, we developed this framework so that one can quantify fitness and selection strength even for non-stationary phenotypes (traits) due to factors such as non-constant environments and inherent stochasticity.

      In fact, if traits are stationary in cellular lineages, this framework becomes essentially identical to the individual-based evolutionary biology framework (see ref. 26, for example). Our framework assumes a cell lineage as a unit of selection and any measurable quantities along cellular lineages as lineage traits, whether they are stationary or non-stationary. Therefore, our framework can evaluate fitness landscapes and selection strength without explicitly taking the environmental conditions around cells into account. This means that h(x) and S[X] in this framework extract the correlations between the traits of interest and division counts among various factors that could potentially influence division counts. On the other hand, this framework has a limitation due to this design: it cannot say anything about the influence of factors such as non-quantified traits and potential variations in environmental conditions. We now explain these important points explicitly in the revised manuscript (L493-496).

      Likewise, stochasticity in division rate does affect division count distributions, and its influence appears as differences in the selection strength of division count S[D]. As stated in the text, S[D] sets the maximum bound for the selection strength of any lineage trait (L143-145). Therefore, S_rel [X]≔S[X]/S[D] reports the relative strength of the correlation between the trait X and lineage fitness in a given level of S[D] in each condition.

      To clarify the influence of stochasticity in division rate, we present a cell population model in which cells divide stochastically according to generation time (interdivision time) distributions in Appendix 2 (we moved this section from the Supplemental Information with modifications). We can confirm from this model that the shapes of generation time distributions influence the selection strength S[D]. Importantly, one can understand from this model that stochasticity in generation times constantly introduces selection to cell populations and modulates the growth rate and selection strength even in the long-term limit. We now clarify this important point in the Discussion (L519-526).

      4) « Therefore, in contrast to a common assumption that selection necessarily decreases fitness variance, here we show that under certain conditions selection can increase fitness variance among cellular ». This is a super interesting statement, but there is such a lack of explanations and intuition here that it is obscure to me what actually happens here.

      When a decrease in fitness variance by selection is mentioned in evolutionary biology, an upper bound and inheritance of fitness across the generations of individuals are usually assumed. In such circumstances, selection drives the fitness distribution toward the maximum value, and the selection eventually causes fitness variance to decrease. However, even in this process, a decrease is not assured for every step; whether selection reduces fitness variance at each step depends on the fitness distribution at that time.

      In our argument, we compared fitness variances between chronological and retrospective distributions. We showed both theoretically and experimentally that there are cases where the variances of the retrospective distributions (distributions after selection) become larger than those of the chronological distributions (distributions before selection). The direction of variance change depends on the shape of chronological distributions, primarily on the skewness of the distributions (positive skew for increasing the variance and negative skew for decreasing the variance). The direction of variance changes can also be probed by the difference between the two selection strength measures S_KL^((2))-S_KL^((1)). Notably, we can demonstrate that there are cases where retrospective fitness variances are larger than chronological fitness variances even in the long-term limit, as shown by a cell population model in Appendix 2.

      We now explain what kind of situations are usually premised when reduction of fitness variance is mentioned and clarify that, in our framework, we compare the fitness variances between chronological and retrospective distributions (L542-548). We also explain that a selection effect on fitness variance generally depends on fitness distribution and that a larger fitness variance in retrospective distribution is possible even in the long-term limit (L548-557).

      Reviewer #2 (Public Review):

      The paper addresses a fundamental question: how do phenotypic variations among lineages relate to the growth rate of a population. A mathematical framework is presented which focuses on lineage traits, i.e. the value of a quantitative trait averaged over a cell lineage, thus defining a fitness landscape h(x). Several measures of selection strengths are introduced, whose relationships are clarified through the introduction of the cumulant generating function of h(x). These relationships are illustrated in analytical mathematical models and examined in the context of experimental data. It is found that higher than third order cumulants are negligible when cells are in early exponential phase but not when they are regrowing from a stationary phase.

      The framework is elegant and its independence from mechanistic models appealing. The statistical approach is broadly applicable to lineage data, which are becoming increasingly available, and can for instance be used to identify the conditions under which specific traits are subject to selection.

      We appreciate the reviewer for the positive evaluation. We will reply to your specific comments below.

      Reviewer #3 (Public Review):

      In this work the authors have constructed a useful mathematical framework to delineate contributions leading to differences in lineages of populations of cells. In principle, the framework is widely applicable to exponentially growing populations. An attractive feature is that the framework is not tailored to particular growth models or environmental conditions. I expect it will be valuable for systems where contributions from phenotypic heterogeneity overwhelm contributions from intrinsic stochasticity in cellular dynamics.

      I am generally very positive about this work. Nevertheless, a few specific concerns:

      1) In here, lineages are considered as fitter if they have more division events. But this consideration neglects inherent stochasticity in division events. Even in a completely homogeneous population, the number of division events for different lineages is different due to intrinsic stochasticity, but applying the methods discussed in this manuscript may lead to falsely assigning different fitness levels to different lineages. The reason why (despite having different number of division events) these lineages ought be assigned the same fitness level is that future generations of these cells will have identical statistics, in contrast with those of cells that are phenotypically different. Extending the idea to heterogeneous populations, the actual difference in fitness levels may be significantly different from what is obtained from the mathematical framework presented here, depending on the level of inherent stochasticity.

      We thank the reviewer for the comment on the point of which our explanation was insufficient in the original manuscript. Intrinsic stochasticity in interdivision time (generation time) is, in fact, critical for selection. For example, if a cell divides with a generation time shorter than the average due to stochasticity, this cell is likely to have more descendant cells in the future population on average than the other cells born at the same timing, even if the descendants follow identical statistics. Therefore, the properties of intrinsic stochasticity, including shapes of generation time distributions and transgenerational correlations, significantly affect the overall selection strength S_KL^((1)) [D] (and also S_KL^((2)) [D]). We now explain this important point in the Results section, referring to the analytical model in Appendix 2 (L327-334), and also in Discussion (L519-524).

      Importantly, even when cell division processes seem purely stochastic, different states in some traits might underlie these variations in generation times. In such cases, evaluating h(x) and S_rel [X] can still unravel the correlations between the trait values and fitness. Especially, the relative selection strength S_rel [X]≔S_KL^((1) ) [X]/S_KL^((1) ) [D] extracts the correlation of the trait values in a given level of division count heterogeneity in each condition. We now clarify this important aspect of the framework in Discussion (L524-526).

      When a cell population is composed of heterogeneous subpopulations each of which follows a distinct statistical rule, our framework evaluates the combined effects from the heterogeneous rules and the inherent stochasticity of each subpopulation. Untangling these two contributions is generally challenging unless we have appropriate markers for distinguishing the subpopulations. However, when the subpopulations follow significantly distinct statistics, the division count distribution should become skewed or multimodal, and the difference between the two selection strength measures S_KL^((2) ) [D]-S_KL^((1) ) [D] can suggest the existence of such subpopulations. Therefore, detailed analyses using all the selection strength measures and the fitness landscapes can provide insights into cell populations’ internal structures and selection.

      We now explain the effect of inherent stochasticity in generation times (L327-334 and L519-524) and discuss how we can probe the existence of subpopulations based on the selection strength measures (L508-512). Please also refer to our reply to the comment 3 of reviewer #1.

      2) In one of the sections the authors mention having performed analytical calculations for a cellular population in which cells divide with gamma distributed uncorrelated interdivision times. It's unclear if 1) within specific sub-populations, cells with the sub-population divide with the same division time, and the distribution of division times is due to the diverse distribution of sub-populations; or 2) if there are no such sub-populations and all cells stochastically choose division time from the same distribution irrespective of their past lineage. If the latter, then I do not see the need for a lineage-based mathematical formulation when the problem can dealt with in much simpler traditional ways which so not keep track of lineages.

      We dealt with the situation of 2) in this model. As noted by the reviewer, we can calculate the chronological and retrospective mean fitness and the population growth rate by a simpler individual-based age-structured population model (see ref. 10, for example). However, applying this framework to this model can clarify the utility of the cumulant generating function, the meaning of the differences between these fitness measures, and the effect of statistical properties of intrinsic stochasticity on long-term growth rate and selection. Therefore, we kept this model in Appendix 2 (the section is moved from Supplemental Information) with additional clarification of our motivation for analysis and the implication of the results.

      3) The analytical calculations provided seem to be exact only for trajectories of almost infinite duration (or in practice, duration much greater than typical interdivision time). For example, if the observation time is of the order of division time, this would create significant artifacts / artificial bias in the weights of lineages depending on whether the cell was able to divide within the observation time or not. Thus, the results claiming that contributions of higher order cumulants become significant in the regrowth from a late stationary phase are questionable, especially since authors note that 90% of cells showed no divisions within the observation time.

      We thank the reviewer for an insightful comment. It is true that the duration of observation influences the results. In the regrowing experiments with E. coli, we aimed to compare the two cell populations regrowing from different stages of the stationary phase. Therefore, it is appropriate to fix the time windows between the two conditions. Even though a significant fraction of cell lineages remains undivided, the regrowing cells already divide several times within this time window. Therefore, the results are valid if we compare and discuss the selection levels in this time scale. However, clarification of the selection in the longer time scales requires a more detailed characterization of lag time distributions under both conditions.

      We now clarify the range of validity of the results and the limitations on prediction for the long-term selection without knowing the details of the lag time distributions in Discussion (L536-539).

    1. Author Response:

      Reviewer #1 (Public Review):

      Here, Servello et al explore the role of temperature and the temperature-sensing neuron AFD in promoting protection against peroxide damage. Unlike many other environmental threats, peroxide toxicity is expected to be temperature-dependent, since its chemical reactivity should be enhanced by higher temperatures. The authors convincingly and rigorously show that transient exposure to 25C, a condition of mild heat stress in C. elegans, activates animals' defenses against peroxides but potentially not other agents. Interestingly, this response requires the temperature-sensing AFD neurons, though whether temperature-dependent AFD activity is itself involved in this regulation is not explored. Further, the authors find that temperature regulates AFD's expression of the insulin ins-39 and provide evidence supporting the idea that repression of ins-39 at 25C contributes to enhanced peroxide defense. The authors use transcriptomic approaches to explore gene expression changes in animals in which AFD neurons are ablated, providing evidence that the FoxO-family transcription factor DAF-16 potentiates AFD signaling. However, because AFD ablation triggers effects broader than transient 25C exposure, the significance of these findings for temperature-dependent peroxide defense is somewhat unclear. Additionally, the possibility that DAF-16 (as well as another protective factor, SKN-1) function in parallel to temperature stress is consistent with many of the results shown but is not as thoroughly considered. Together, these studies identify a fascinating example of pre-emptive threat response triggered by the detection of a potentiator of that threat, a phenomenon they term "enhancer sensing." While some predictions of the specificity of this phenomenon remain untested, the paper provides intriguing insight into the potential mechanisms by which it may occur.

      Major issues:

      The dependence of the enhancer-sensing phenomenon on AFD leads the authors to conclude that the 25C stimulus is sensed by AFD itself, but this needs to be directly tested. To do this, they could ask whether tax-4 function is required in AFD, or use mutants in which AFD's thermosensory function is compromised.

      We thank the reviewer for suggesting these experiments. As requested, we determined whether previously identified mechanisms for temperature perception by the AFD neurons were required for the temperature-dependent regulation of peroxide resistance using gcy-18 gcy-8 gcy-23 triple mutants and the respective single mutants. The findings from the new experiments lead us to conclude that temperature perception by AFD via the GCY-8, GCY-18, and GCY-23 receptor guanylate cyclases, which are exclusively expressed in the AFD neurons, contributes to the temperature-dependent regulation of peroxide resistance in C. elegans. These experiments are detailed in the following new paragraph in the results section:

      “Last, we determined whether previously identified mechanisms for temperature perception by the AFD neurons were required for the temperature-dependent regulation of peroxide resistance. The AFD neurons sense temperature using receptor guanylate cyclases, which catalyze cGMP production, leading to the opening of TAX-4 channels (Goodman and Sengupta, 2019). Three receptor guanylate cyclases are expressed exclusively in AFD neurons: GCY-8, GCY-18, and GCY-23 (Inada et al., 2006; Yu et al., 1997) and are thought to act as temperature sensors (Takeishi et al., 2016). Triple mutants lacking gcy-8, gcy-18, and gcy-23 function are behaviorally atactic on thermal gradients and fail to display changes in intracellular calcium or thermoreceptor current in the AFD neurons in response to temperature changes (Inada et al., 2006; Ramot et al., 2008; Takeishi et al., 2016; Wang et al., 2013; Wasserman et al., 2011). We found that when grown and assayed at 20°C, gcy-23(oy150) gcy-8(oy44) gcy-18(nj38) triple null mutants survived 43% longer in the presence of tBuOOH than wild-type controls (Figure 3J). In contrast, at 25°C, the gcy-23 gcy-8 gcy-18 triple mutants showed a 12% decrease in peroxide resistance relative to wild-type controls (Figure 3K). Therefore, the three AFD-specific receptor guanylate cyclases influenced the temperature dependence of peroxide resistance, lowering peroxide resistance at 20°C and slightly increasing it at 25°C. At 20°C, the gcy-8(oy44), gcy-18(nj38), and gcy-23(oy150) single mutants increased peroxide resistance by 10%, 51%, and 21%, respectively, relative to wild-type controls (Figure 3L). Therefore, each of the three AFD-specific receptor guanylate cyclases regulates peroxide resistance. We conclude that temperature perception by AFD via GCY-8, GCY-18, and GCY-23 enables C. elegans to lower their peroxide resistance at the lower cultivation temperature.”

      The enhancer-sensing model is fascinating, but as it stands it is somewhat oversold. The authors could tone down the writing, indicating that this model is suggested rather than shown. Alternatively, they could more carefully test some of its predictions - for example by exploring the response to other threats (e.g. some of the toxicants described in Fig. S5) at 20C and 25C in WT and AFD-ablated animals.

      We edited the manuscript and expanded the manuscript’s discussion to address these concerns as well as similar concerns from reviewer #3. In the paper we show that the regulation of the induction of H2O2 defenses in C. elegans is coupled to the perception of temperature (an inherent enhancer of the reactivity of H2O2). To understand the significance of this finding in an evolutionary context, and to explain why such a regulatory system would evolve, we introduced in the discussion a new conceptual framework, “enhancer sensing,” and devoted a section of the discussion to demonstrating that the phenomenon that we observed could not be adequately explained by existing frameworks used to understand the evolutionary origins of the regulatory systems for defense responses.

      We now realize that we did not sufficiently and clearly explain the scope for the criterion for establishing a phenomenon represents enhancer sensing, leading to incorrect predictions by reviewer’s 1 and 3 about (a) whether what we observed in C. elegans is an instance of enhancer sensing (or more proof is needed) and (b) what the enhancer sensing model for the coupling of temperature perception to H2O2 defense would predict about how temperature and the AFD neurons would affect resilience to other chemicals. We regret failing to adequately explain the model’s scope and predictions and believe that we have now explicitly addressed the scope of what constitutes enhancer sensing and the predictions of the model. In particular, we previously did not spell out (a) the distinction between the enhancer sensing strategy and the mechanistic implementation of that strategy; and, importantly, (b) we did not discuss what the enhancer sensing strategy coupling temperature perception to H2O2 defense in C. elegans predicted (and did not predict) about whether a similar strategy would be expected to be used by C. elegans to deal with other temperature-dependent threats. We now address these issues in two new paragraphs in the discussion that read:

      “We show here that C. elegans uses an enhancer sensing strategy that couples H2O2 defense to the perception of high temperature. We expect this strategy’s output (the level of H2O2 defense) to provide the nematodes with an evolutionarily optimal strategy across ecologically relevant inputs (cultivation temperatures) (Kussell and Leibler, 2005; Maynard Smith, 1982; Wolf et al., 2005). This strategy is implemented at the organismic level through the division of labor between the AFD neurons, which sense and broadcast temperature information, and the intestine, which responds to that information by providing H2O2 defense (Figure 9D). Ascertaining that C. elegans relies on this enhancer sensing strategy does not depend on the temperature information broadcast by AFD exclusively regulating defense responses to temperature-dependent threats, because the regulation of defenses towards temperature-insensitive threats could affect defenses towards temperature-dependent threats; for example, suppressing defenses towards a temperature-insensitive threat would be beneficial if those defenses interfered with H2O2 defense or depleted energy resources contributing to H2O2 defense.

      As with any sensing strategy, enhancer sensing strategies are more likely to evolve when sensing is informative and responding is beneficial. In their natural habitat, C. elegans encounter many environmental chemicals that, like H2O2, are inherently more reactive at higher temperatures. It will be interesting to determine the extent to which C. elegans uses enhancer sensing strategies coupling temperature perception to the induction of defenses towards those chemicals, and whether those strategies rely on temperature perception and broadcasting by the AFD neurons. We expect that sensing strategies regulating defense towards those chemicals would be more likely to evolve when those chemicals are common, reactive, and cause consequential damage.”

      We note that our ability to predict survival to other toxicants, such as those that trigger specific gene-expression responses that are AFD-dependent but are unaffected between 20C and 25C (as proposed by the reviewer), is limited not only by our lack of knowledge about the specific mechanisms that protect worms from those toxicants, but also by our lack of knowledge about whether defense towards hydrogen peroxide interferes (or synergizes) with defense towards each of those toxicants and whether defense towards those toxicants interferes (or synergizes) with H2O2 defense. We therefore think that those experiments would be better addressed in future studies.

      The role of ins-39 remains somewhat speculative. Fig 4F shows that ins-39 mutants have a reduced induction of peroxide defense, but it seems that this could be the result of a ceiling effect. The authors' model predicts that overexpression of ins-39, particularly at 25C, should sensitize animals to peroxide damage, a prediction that should be tested directly. Further, the authors seem to assume that AFD is the relevant site of ins-39 function, but this needs to be better supported.

      As requested by all three reviewers, we determined whether ins-39 gene expression in AFD was sufficient to lower peroxide resistance by restoring ins-39(+) gene expression only in the AFD neurons using the AFD-specific gcy-8 promoter. As predicted by the reviewer, these worms were more sensitive to peroxide than wild-type worms. The findings from this experiment lead us to conclude that expression of ins-39 in the AFD neurons was sufficient to regulate the nematode’s peroxide resistance. The new section reads:

      “Next, we determined whether the INS-39 signal from AFD regulated the nematode’s peroxide resistance. The tm6467 null mutation in ins-39 deletes 520 bases, removing almost all the ins-39 coding sequence (Figure 5A), and inserts in that location 142-bases identical to an intervening sequence located between ins-39 and its adjacent gene. In nematodes grown and assayed at 20°C, ins-39(tm6467) increased peroxide resistance by 26% relative to wild-type controls (Figure 5F). To determine whether ins-39 gene expression in AFD was sufficient to lower peroxide resistance, we restored ins-39(+) expression only in the AFD neurons using the AFD-specific gcy-8 promoter (Inada et al., 2006; Yu et al., 1997) in ins-39(tm6467) mutants. Expression of ins-39(+) only in AFD eliminated the increase in peroxide resistance of ins-39(tm6467) mutants (Figure 5F). Notably, the peroxide resistance of the two independent transgenic lines was 28% and 30% lower than that of wild-type controls, likely due to overexpression of the gene beyond wild-type levels. We conclude that the gene dose-dependent expression of ins-39 in the AFD neurons regulated the nematode’s peroxide resistance.”

      The temperature-shift experiments in figure 5G (formerly 4F) indicated that the effect on peroxide resistance at 20C of growth at 25C and of the ins-39 mutation were non additive. We interpreted this epistatic interaction to be due to action in a common pathway. It is possible that while growth at 25C increases the subsequent peroxide resistance at 20C, it could limit the nematodes’ subsequent peroxide resistance at 20C (beyond those peroxide-resistance increasing effects) when in combination with another intervention, even if those interventions acted via parallel mechanisms—a ceiling effect, as proposed by the reviewer. We favor the alternative interpretation, that the mechanisms act sequentially, because of our findings that ins-39 gene expression within AFD was lower at 25C than at 20C, leading us to propose the sequential model in figure 5H (formerly 4G).

      Most of the daf-16 and skn-1 experiments are carried out in AFD-ablated animals, making the relevance of these findings for the 25C-dependent induction of peroxide defense somewhat unclear. As the authors show, AFD ablation causes much more extensive changes than transient 25C exposure, clearly seen in slope of the line in 3C. Further, unlike 25C exposure, AFD ablation is a chronic and non-physiological state. It would be useful for the authors to be cautious in their interpretation of these findings and to be clearer about how strongly they can connect them to the "enhancer sensing" phenomenon. Along these lines, the potentiation idea could be toned down a bit. Much of the data is consistent with parallel function for daf-16 (and skn-1) - for example, Fig 5C indicates additive effects of daf-16 and 25C exposure; 6C shows that AFD ablation still has a clear effect on peroxide sensitivity in the absence of both daf-16 and skn-1; and Fig S8a shows that much of the transcriptional response to AFD ablation (along PC1) is intact in daf-16 animals.

      We have made several adjustments in the text to address these concerns. As the reviewer noted, the experiments with skn-1 were performed only in AFD ablated worms. We have renamed the section heading to “SKN-1/NRF and DAF-16/FOXO collaborate to increase the nematodes’ peroxide resistance in response to AFD ablation” to make that clear.

      In contrast, the peroxide resistance experiments with daf-16 were done also in worms grown at 25C and then shifted to 20C during the peroxide resistance assay. The connection of daf-16 with the temperature dependent regulation of peroxide resistance was established in temperature shifts experiments in daf-16 single mutants (Figure 6C, formerly 5C) and in transgenic worms rescuing the daf-16 mutant only in the intestine (Figure 6F). In the revised text we make it clearer that the effect of the daf-16 mutation is bigger when the nematodes are shifted from 25C to 20C: “The daf-16(mu86) null mutation decreased peroxide resistance in nematodes grown at 25°C and assayed at 20°C by 35%, a greater extent than the 21% reduction in peroxide resistance induced by that mutation in nematodes grown and assayed at 20°C (Figure 6C).”

      As the reviewer noted, daf-16 and skn-1 have a role in peroxide resistance when the AFD neurons are not ablated (albeit a smaller one than when those neurons are ablated). We have made several changes and additions to the text to make that explicit. Most notably, the revised last paragraph of the SKN-1 section now reads: “We propose that when nematodes are cultured at 20°C, the AFD neurons promote signaling by the DAF-2/insulin/IGF1 receptor in target tissues, which subsequently lowers the nematode’s peroxide resistance by repressing transcriptional activation by SKN-1/NRF and DAF-16/FOXO. However, this repression is not complete, because both daf-16(mu86) and skn-1(RNAi) lowered peroxide resistance at 20°C when the AFD neurons were present. It is also likely that DAF-16 and SKN-1 are not the only factors that contribute to peroxide resistance in AFD-ablated nematodes at 20°C, because AFD ablation increased peroxide resistance in daf-16(mu86); skn-1(RNAi) nematodes, albeit to a lesser extent than in daf-16(+) or skn-1(+) backgrounds.”

      The potentiation idea was specific to the effects of DAF-16 on gene expression. As the reviewer noted, much of the transcriptional response to AFD ablation is intact (albeit reduced in magnitude) in AFD-ablated daf-16 mutants, leading to a shift in the PC1 score for the mutant. At the level of the expression of individual genes, we quantified those effects in Figure 8G (formerly 7D). When we did the RNAseq experiments we had expected that lack of daf-16 would eliminate either all the changes in gene expression induced by AFD ablation or eliminate those changes for a subset of genes. Instead, what we found was much more subtle, and unexpected: the size of the gene expression change induced by AFD ablation was reduced by the daf-16 mutation, and that reduction was systematic. Specifically, we found that the bigger the change in gene expression induced by AFD ablation, the bigger the effect of daf-16 in the AFD ablated animals (that is, potentiation), leading to a change in the slope in the regression line in Figure 8G. We revised the paper to ensure we only used the word potentiation in this context (gene expression), even though formally DAF-16 also potentiated the effects of AFD ablation (and temperature shift from 25C to 20C) on peroxide resistance.

      Reviewer #3 (Public Review):

      This paper offers novel mechanistic insights into how pre-exposure to warm temperature increases the resistance of C. elegans to peroxides, which are more toxic at warmer temperature. The temperature range tested in this study lies within the animal's living conditions and is much lower than that of heat shock. Therefore, this study expands our understanding of how past thermosensory experience shapes physiological fitness under chemical stress. The paper is technically sound with most experiments or analyses carried out rigorously, and therefore the conclusions are solid. However, it challenges our current understanding of the role of the C. elegans thermosensory system in coping with stress. The traditional view is that the AFD thermosensory neuron is activated upon sensing temperature rise, and that temperature sensation through AFD positively regulates systemic heat shock response and promotes longevity in C. elegans. Thus, it is quite unexpected that AFD ablation activates DAF-16 and improves peroxide resistance. It also appears counterintuitive that genes upregulated at 25 degrees overlap extensively with those upregulated by AFD ablation at 20 degrees. I feel that it is premature to coin the term "enhancer sensing" for such a phenomenon, as their work does not rule out the possibility that AFD ablation increases resistance to other stresses that are independent of temperature regarding their toxicity or magnitude of hazard. Additional work is necessary to clarify these issues.

      1. Whether the role of AFD in inhibiting peroxide resistance is related to AFD activity needs further clarification. AFD activity depends on the animal's thermosensory experience. As animals in this study are maintained at 20 degrees unless indicated specifically, the AFD displays activities starting around 17 degrees and peaks around 20 degrees. Under such condition, the AFD displays little or no activity to thermal stimuli around 15 degrees. It will be important to test whether cultivation of animals at 20 degrees improves peroxide resistance at 15 degrees, compared to 15 degrees-cultivation/15 degrees peroxide testing. The authors should also test whether AFD ablation further improves survival under peroxides at 15 degrees for animals grown at 20 degrees, whose AFD should show little or no activities at 15 degrees.

      The reviewer raises an interesting point about the relation between the mechanisms that determine AFD activity in response to temperature and those that enable AFD to regulate peroxide resistance. In the revised manuscript we tested whether known mechanisms enabling AFD to sense changes in temperature acutely (receptor guanylate cyclases GCY-8, GCY-18, and GCY-23) played a role in the temperature dependence of peroxide resistance. We found that they did, as detailed in our response to reviewer #1’s point 1.

      As noted by reviewer #2 in their point 1, and in our reply to that comment (and in a new discussion paragraph in the revised manuscript), the relationship between the known mechanisms the acutely regulate the activity of AFD in response to temperature and the mechanisms by which constant cultivation temperature regulates gene expression in AFD (and therefore the expression of peroxide resistance regulating signals like INS-39) is not well understood. Therefore, it is difficult to predict which temperatures will cause induction of peroxide defenses via AFD-dependent mechanisms, or via other mechanisms. While we agree with the reviewer that it will be interesting to characterize the extent to which other cultivation temperatures besides 25C lead to increased peroxide resistance at lower temperatures (including the proposed shifts from 20C to 15C), we think that those questions will be better addressed in future studies.

      2. The importance of the thermosensory function of AFD should be verified. In the current study, the tax-4 mutation was used to infer AFD activity, but tax-4 is expressed in sensory neurons other than AFD. In addition to AFD, AWC can sense temperature and it also expresses tax-4. Therefore, influence on AFD from other tax-4-expressing neurons cannot be excluded. On the other hand, ablation of AFD removes all AFD functions, including those that are constitutive and temperature-independent. Therefore, the authors should test the gcy-18 gcy-8 gcy-23 triple mutant, in which the AFD neurons are fully differentiated but completely insensitive to thermal stimuli. These three thermosensor genes are exclusively expressed in AFD. Compared to the tax-4 mutant that is broadly defective in multiple sensory modalities, this triple gcy mutant shows defects specifically in thermosensation. They should see whether results obtained from the AFD ablated animals could be reproduced by experiments using the gcy-18 gcy-8 gcy-23 triple mutant. The authors are also recommended to investigate ins-39 expression in AFD and profile gene expression patterns in the gcy-18 gcy-8 gcy-23 triple mutant.

      We thank the reviewer for this suggestion. We have performed the requested experiments, as detailed in our response to reviewer #1’s point 1. Briefly, we determined found that gcy-18 gcy-8 gcy-23 triple mutants increased peroxide resistance at 20C but not at 25C, and found that the respective gcy single mutants affected peroxide resistance at 20C. In light of these findings, we concluded that temperature perception by AFD via GCY-8, GCY-18, and GCY-23 enables C. elegans to lower their peroxide defenses at the lower cultivation temperature.

      3. The literature suggests that AFD promotes longevity likely in part through daf-16 (Chen at al., 2016) or independent of daf-16 (Lee & Kenyon, 2009). Whatever it is, various studies show that activation of AFD and daf-16 promote a normal lifespan at higher temperature, and AFD ablation shortens lifespan at either 20 or 25 degrees. Therefore, the finding that DAF-16-upregulated genes overlap extensively with those upregulated by AFD ablation is quite unexpected (Figure 5B). The authors should perform further gene ontology (GO) analysis to identify subsets of genes co-regulated by DAF-16 and AFD ablation, whether these genes are reported to be involved in longevity regulation, immunity, stress response, etc.

      We thank the reviewer for this interesting comment about the complex mechanisms by which AFD regulates longevity. We note that AFD also has additional temperature-dependent roles in lifespan regulation, as Murphy et al. 2003 found that RNAi of gcy-18 increased lifespan in wild-type worms at 20C but not at 25C. Therefore, AFD-specific interventions can also be lifespan extending at 20C.

      We performed WormCat analysis, which is similar to gene ontology, in Figure 8-figure supplement 2 (formerly Figure S8G), which we described in the results section: “we found that the extent to which AFD ablation affected the average expression of sets of genes with related functions (Higgins et al., 2022; Holdorf et al., 2020) was systematically lower in daf-16(mu86) mutants than in daf-16(+) nematodes (R_2 = 86%, slope = 0.67, _P < 0.0001, Figure 8—figure supplement 2).” Visual inspection of the plot and the very high coefficient of determination of 86% indicate that the size of the effect of AFD ablation on gene expression was systematically smaller when the contribution of DAF-16 to gene expression was removed.

      In the revised manuscript we also moved the three panels quantifying the expression of DAF-16 targets and daf-16-regulated genes from the supplement to the main figure. One of those panels (Figure 8F) shows that genes upregulated by daf-16(+) in daf-2 mutants were disproportionally affected by lack of daf-16 in AFD-ablated worms, as we described in the results section: “In addition, in AFD ablated nematodes, lack of daf-16 lowered the expression of genes upregulated in a daf-16-dependent manner in daf-2(-) mutants (Murphy et al., 2003) to a greater degree than in unablated nematodes (Figure 8F).”

      4. I feel that "enhancer sensing" is an overstatement, or at least a premature term that is not sufficiently supported without further investigations. The authors should explore whether AFD ablation or pre-exposure to warm temperature specifically enhances resistance to a stressor the toxicity of which is increased at higher temperature, but does not affect the resistance to other temperature-insensitive threats.

      We edited the manuscript and expanded the manuscript’s discussion to address these concerns as well as similar concerns from reviewer #1. For clarity, we repeat much of our response to reviewer #1’s point 2 here, with the last paragraph of this response specific to this reviewer’s comment.

      In the paper we show that in C. elegans the regulation of the induction of H2O2 defenses is coupled to the perception of temperature (an inherent enhancer of the reactivity of H2O2). To understand the significance of this finding in an evolutionary context, and to explain why such a regulatory system would evolve, we introduced in the discussion a new conceptual framework, “enhancer sensing,” and devoted a section of the discussion to demonstrating that the phenomenon that we observed could not be adequately explained by existing frameworks used to understand the evolutionary origins of the regulatory systems for defense responses.

      We now realize that we did not sufficiently and clearly explain the scope for the criterion for establishing a phenomenon represents enhancer sensing, leading to incorrect predictions by reviewer’s 1 and 3 about (a) whether what we observed in C. elegans is an instance of enhancer sensing (or more proof is needed) and (b) what the enhancer sensing model for the coupling of temperature perception to H2O2 defense would predict about how temperature and the AFD neurons would affect resilience to other chemicals. We regret failing to adequately explain the model’s scope and predictions and believe that we have now explicitly addressed the scope of what constitutes enhancer sensing and the predictions of the model. In particular, we previously did not spell out (a) the distinction between the enhancer sensing strategy and the mechanistic implementation of that strategy; and, importantly, (b) we did not discuss what the enhancer sensing strategy coupling temperature perception to H2O2 defense in C. elegans predicted (and did not predict) about whether a similar strategy would be expected to be used by C. elegans to deal with other temperature-dependent threats. We now address these issues in two new paragraphs in the discussion that read:

      “We show here that C. elegans uses an enhancer sensing strategy that couples H2O2 defense to the perception of high temperature. We expect this strategy’s output (the level of H2O2 defense) to provide the nematodes with an evolutionarily optimal strategy across ecologically relevant inputs (cultivation temperatures) (Kussell and Leibler, 2005; Maynard Smith, 1982; Wolf et al., 2005). This strategy is implemented at the organismic level through the division of labor between the AFD neurons, which sense and broadcast temperature information, and the intestine, which responds to that information by providing H2O2 defense (Figure 9D). Ascertaining that C. elegans relies on this enhancer sensing strategy does not depend on the temperature information broadcast by AFD exclusively regulating defense responses to temperature-dependent threats, because the regulation of defense towards temperature-insensitive threats could affect defenses towards temperature-dependent threats; for example, suppressing defenses towards a temperature-insensitive threat would be beneficial if those defenses interfered with H2O2 defense or depleted energy resources contributing to H2O2 defense.

      As with any sensing strategy, enhancer sensing strategies are more likely to evolve when sensing is informative and responding is beneficial. In their natural habitat, C. elegans encounter many environmental chemicals that, like H2O2, are inherently more reactive at higher temperatures. It will be interesting to determine the extent to which C. elegans uses enhancer sensing strategies coupling temperature perception to the induction of defenses towards those chemicals, and whether those strategies rely on temperature perception and broadcasting by the AFD neurons. We expect that sensing strategies regulating defense towards those chemicals would be more likely to evolve when those chemicals are common, reactive, and cause consequential damage.”

      We note, in the first of the new discussion paragraphs, that the existence of an enhancer sensing strategy is not contingent on whether the AFD neurons (that implement the temperature sensing and temperature-information broadcasting functions regulating peroxide defenses) also do not regulate defense responses to temperature-insensitive threats. For example, it may be beneficial to an animal facing high concentrations of environmental peroxides to suppress defense against a temperature-insensitive threat when those defenses are detrimental towards defense towards hydrogen peroxide. This could occur, for example, because there is an energetic trade off when mounting multiple defense responses, or because specific defenses towards temperature-insensitive threats interfere with peroxide defense. As we noted in our response to reviewer #1’s point 2, our ability to predict survival to threats other than H2O2 (including temperature-independent threats) is limited not only by our lack of knowledge about the specific mechanisms that protect worms from those threats, but also by our inability to predict the extent to which defenses towards different threats operate independently, constructively, or destructively with those that provide hydrogen peroxide defense. We therefore think that those experiments would be better addressed in future studies.

    1. Author Response

      Reviewer #1 (Public Review):

      This study examines whether the D2 receptor antagonist amisulpride and the mu-opioid receptor antagonist naltrexone bias model-based vs model-free behavior in a well-established two-step task of behavioral control. The authors find that amisulpride enhances model-based choices, which is further supported by computational modeling of the data, revealing an increase in the relative contribution of model-based control of behavior. Naltrexon on the other hand had no reliable effect on model-based behavior.

      Overall, this is a very nice study with many strengths, including the task and data analysis. A particular strength of the design is the combination of a between-subject drug administration protocol with two within-subject (baseline vs. drug) sessions. This reduces between-subject variability in baseline model-based vs model-free behavior and enhances the power to detect drug effects.

      The introduction could do a better job articulating the rationale for testing the effect of these two specific drugs. Currently, the rationale is that both transmitter systems targeted by these drugs are involved in drug addiction, which is characterized by an imbalance in model-based vs. habitual control of behavior. This appears somewhat indirect.

      Blood draws were used to determine serum levels for amisulpride and naltrexone but these data are not included as covariates in the analysis.

      We thank the reviewer for the high acclaim of our study, and for the constructive comments to improve it. We acknowledge that the introduction did not motivate the main research goal of the manuscript clearly enough. We have now extended this section and provided further insight into our reasoning behind the study design. Beyond the involvement of opioid and dopamine promoting drugs in addiction, there is abundant evidence from experimental studies showing comparable effects of manipulating receptors of both systems in model-free processes such as reinforcement, and habit formation. Based on this overlap one may predict that both neurotransmitter systems disrupt habit formation in a similar fashion, and that blocking their respective receptors will improve the ability to behave in a model-based manner. However, as we now elaborate in the manuscript, an argument against this could be that disrupting model-free processes might not be enough to promote model-based behaviour, as such behaviour relies heavily on cognitive control. It is therefore especially interesting to compare opioid antagonists, that do not enhance cognitive function, with a D2 antagonist at a dosage that has been shown to increase cognitive control as well as increase the desire to exert cognitive effort.

      This is expressed in the following paragraphs of the Introduction (p.2 §3 and p.3 §1):

      “Opiates, psychostimulants, and most other drugs of abuse increase the release of dopamine along the mesolimbic pathway (Chiara, 1999; Koob & Bloom, 1988), a circuit that plays a central role in reinforcement learning (Schultz, Dayan, & Montague, 1997). On top of this, the reinforcing properties of addictive drugs also depend on their ability to activate the μ opioid receptors (Becker, Grecksch, & Kraus, 2002; Benjamin, Grant, & Pohorecky, 1993; Le Merrer, Becker, Befort, & Kieffer, 2009). This suggests that both the dopamine and the opioid systems might be particularly relevant in model-free reinforcement learning processes that drive the formation of habitual behaviour. Studies in rodents show that activating receptors of both systems across the striatum increases cue-triggered wanting of rewards (Peciña & Berridge, 2013; Soares-Cunha et al., 2016). Conversely, inhibition of both D1-type and D2-type of dopamine receptors (referred to as D1 and D2 from here on) as well as opioid receptors reduces motivation to obtain or consume rewards (Laurent, Leung, Maidment, & Balleine, 2012; Peciña, 2008; Soares-Cunha et al., 2016). This data raises the hypothesis that the drift towards habitual control is enabled by dopamine and opioid receptors via a common neural pathway. Recent work in humans provides some evidence in this direction, whereby systemic administration of opioid and D2 dopamine receptor antagonists causes a comparable reduction of cue responsivity and reward impulsivity (Weber et al., 2016) and decreases the effort to obtain immediate primary rewards (Korb et al., 2020). This suggests that when allocating control between the model-based and model-free system, dopamine or opioid receptor antagonists might comparatively disrupt model-free behavioural strategies and increase model-based behaviour. Yet, no study in humans has directly investigated this. Furthermore, disrupting habit formation might not in itself lead to increased model-based control, without either increasing the perceived value of applying cognitive control or making it easier to do so.”

      We also mention the implications of this direct comparison of the two compounds in the Discussion (p.8 §1):

      “Our findings provide initial evidence for a divergent involvement of the dopamine and opioid neurotransmitter systems in the shift between habitual and goal-directed behaviour. The lack of effects of naltrexone on the model-based/model-free trade-off also provides some support for the notion that simply disrupting neurobiological systems that subserve habitual behaviour might not be enough to increase goal-directed behaviour in this task. An increase in the model-based/model-free weight following amisulpride administration advocates for dopamine playing a decisive role in flexibly applying cognitive control to facilitate model-based behavior and highlights the specific functional contribution of the D2 receptor subtype.”

      Reviewer #3 (Public Review):

      I think this is an interesting study on an important topic. I agree that there is not enough research to understand how the dopaminergic system interfaces with goal-directed planning, and I like the focus on specific types of dopamine receptors. It is interesting that they seem to find a specific effect on just the dopamine antagonist. I also appreciate the clarity with which the authors describe this field of research and their results. However, I also feel that there are several concerns with this paper, both in terms of framing and in terms of the experimental design and analysis. For completeness, I must note that I am not a dopamine expert.

      I felt that the introduction of the paper did not sufficiently motivate the focus on the comparison between neurotransmitters systems, and (for the dopaminergic system) the distinction between D1/D2 receptors. Why is the mapping between stability/flexibility and D1/D2 receptors important? How does this relate to model-based control? Why do the authors predict that model-based control would increase when D2 receptors are blocked? If the hypothesis is about contrasting the contribution of D1 and D2 receptors to goal-directed control, why did the authors not use antagonists directly targeting these two systems?

      In addition, the predictions that are more explicit, for example, that blocking D2 receptors increases MB control by stabilizing goal-relevant information, are fairly specific. However, the current version of the two-step task is not amenable to testing such a specific hypothesis, because it doesn't allow us to measure the specific components of planning (e.g., maintaining goals, the representation of the structure, prospective reasoning). Moreover, MB control in this version of the two-step task is marked by flexibility, because it requires the agent to be sensitive to switching starting states.

      The predictions for the opioid system are also lacking. Why are the authors targeting this system? Why are they comparing the effects of the D2 antagonist with the opioid agonist? Why do the authors predict that amisulpride should have a stronger effect than naltrexone? In my opinion, these predictions were not sufficiently laid out, which made it difficult to appreciate the authors' motivation to run the study.

      We thank the reviewer for their critical take on the manuscript and for clearly pointing out the weaknesses in argumentation. In particular, we appreciate the reviewer’s comment on the lack of clarity in describing why the comparison of dopamine and opioid antagonists’ effects on MB/MF behaviour might be particularly interesting and why we focused on D2 and not D1 receptors. We now extended the introduction section to clarify our rationale for comparing these two compounds (p.2-3). In short, apart from the fact that both systems are implicated in addiction, there is also abundant experimental evidence from human and non-human animal studies that the two systems are involved in processes related to forming habitual responses to primary and secondary rewards. This suggests that blocking receptors of either system might comparatively affect the MB/MF trade-off by impairing model-free processes. We therefore proceeded to compare opioid and dopamine antagonists.

      As we note, using D1 antagonists would likely be detrimental to cognitive control related processes, and therefore more likely to decrease model-based performance. We therefore chose to compare opioid antagonists to D2 receptor antagonists. Another important reason for comparing the effects of opioid and D2 dopamine antagonists is the reasoning that it is not clear whether blocking model-free processes is in itself enough to promote model-based behaviour, without boosting cognitive control related processes. Given the recent evidence for D2 antagonists increasing cognitive effort (Westbrook et al., 2020) and the proposed role of prefrontal D2 receptors in destabilising prefrontal representations (according to the dual state theory of prefrontal dopamine function proposed by Durstewitz & Seamans, 2008)) we reasoned that D2 receptor blockade might also boost the ability (or willingness) to keep the mapping between spaceships and planets online while making choices.

      We incorporated these arguments in the revised Introduction (p.2-3):

      “Opiates, psychostimulants, and most other drugs of abuse increase the release of dopamine along the mesolimbic pathway (Chiara, 1999; Koob & Bloom, 1988), a circuit that plays a central role in reinforcement learning (Schultz et al., 1997). On top of this, the reinforcing properties of addictive drugs also depend on their ability to activate the μ opioid receptors (Becker et al., 2002; Benjamin et al., 1993; Le Merrer et al., 2009). This suggests that both the dopamine and the opioid systems might be particularly relevant in model-free reinforcement learning processes that drive the formation of habitual behaviour. Studies in rodents show that activating receptors of both systems across the striatum increases cue-triggered wanting of rewards (Peciña & Berridge, 2013; Soares-Cunha et al., 2016). Conversely, inhibition of both D1-type and D2-type of dopamine receptors (referred to as D1 and D2 from here on) as well as opioid receptors reduces motivation to obtain or consume rewards (Laurent et al., 2012; Peciña, 2008; Soares-Cunha et al., 2016). This data raises the hypothesis that the drift towards habitual control is enabled by dopamine and opioid receptors via a common neural pathway. Recent work in humans provides some evidence in this direction, whereby systemic administration of opioid and D2 dopamine receptor antagonists causes a comparable reduction of cue responsivity and reward impulsivity (Weber et al., 2016) and decreases the effort to obtain immediate primary rewards (Korb et al., 2020). This suggests that when allocating control between the model-based and model-free system, dopamine or opioid receptor antagonists might comparatively disrupt model-free behavioural strategies and increase model-based behaviour. Yet, no study in humans has directly investigated this. Furthermore, disrupting habit formation might not in itself lead to increased model-based control, without either increasing the perceived value of applying cognitive control or making it easier to do so. Crucially, there are important differences in how each of the two neurochemical systems relate to cognitive control that is pivotal for model-based behaviour. Across a wide range of studies using various dosing schemes, opioid receptor antagonists did not have an effect on tasks that require cognitive control, such as working memory (Del Campo, McMurray, Besser, & Grossman, 1992; File & Silverstone, 1981; Volavka, Dornbush, Mallya, & Cho, 1979), sustained attention(Zacny, Coalson, Lichtor, Yajnik, & Thapar, 1994), or mathematical problem-solving (Del Campo et al., 1992) (see (van Steenbergen, Eikemo, & Leknes, 2019) for a review). Dopaminergic circuits, on the other hand, play a central role in higher cognitive functions and goal-directed behaviour (Brozoski, Brown, Rosvold, & Goldman, 1979). In particular, D1 dopamine receptors in the prefrontal cortex enable maintenance of goal-relevant information and working memory(Goldman-Rakic, 1997; Sawaguchi & Goldman-Rakic, 1991; van Schouwenburg, Aarts, & Cools, 2010; Williams & Goldman-Rakic, 1995), while the D2 dopamine receptor activity disrupts prefrontal representations(Durstewitz & Seamans, 2008). In support of this, decreased working memory performance was observed after blocking prefrontal D1, but not prefrontal D2 receptors (Arnsten, 2011; Sawaguchi & Goldman-Rakic, 1991; Seamans & Yang, 2004). In humans, systemic administration of D2 antagonism increased the ability to maintain and manipulate working memory representations (Dodds et al., 2009; Frank & O’Reilly, 2006) and increased the value of applying cognitive effort (Westbrook et al., 2020). This data suggests that blocking D2 receptors, in contrast to blocking opioid receptors, could further facilitate model-based behaviour through enabling or encouraging flexible use of cognitive control.”

      Another important point that the reviewer stresses is that the two-step task we use does not allow us to draw any conclusions through which mechanisms amisulpride increases model-based behaviour. Although we base our hypothesis that D2 might promote model-based behaviour (on top of disrupting habit formation) on previous work showing D2 blockade increasing cognitive effort and the ability to manipulate working memory representations, we completely agree that our setup does not give any definite answers about which of these cognitive processes mediated the increase in model-based weights. In the discussion we try to interpret our findings in the context of the dual-state hypothesis framework and within the framework of striatal control of adaptive behaviour (p.8 §3-4), whereby we centre our argumentation around dopaminergic circuits that subserve one or the other mechanism.

      We agree with the reviewer that the task requires a high degree of flexible planning and that the dual-state theory might not be enough to account for our effects. We mention this in the Discussion (p. 8 §3):

      “The effects of D2 antagonism on model-based/model-free behaviour in our study can be interpreted within this [dual-state] framework to result from increased ability to maintain prefrontal representation of the mapping between the spaceships and the planets online. However, this is difficult to reconcile with the fact that model-based behaviour in dynamic learning paradigms, such as the one used here, also requires flexible updating of action values.”

      We also elaborate on the general limitations of drawing inference about the underlying cognitive/computational mechanisms in the Discussion (p. 14 §2):

      “Importantly, it should also be acknowledged that the behavioural setup in our study does not allow us to draw definite conclusions about the mechanisms that mediate amisulpride’s effects on model-based or model-free behaviour. For example, it is not clear whether amisulpride increases the perceived benefit of applying cognitive control, or whether it increases the participant’s ability to do so through various possible complementary processes, such as goal maintenance or planning abilities. Future studies should further elucidate the mechanistic contributions of dopamine receptors to the distinct coding and utilisation of task relevant representations (Langdon, Sharpe, Schoenbaum, & Niv, 2018; Stalnaker et al., 2019).”

      Related to this, I felt that the introduction was a bit too quiet on the genetic markers. Their discussion in the results was a bit surprising, and it wasn't quite clear why the authors decided to investigate these interaction effects.

      We appreciate this comment as we were quite uncertain ourselves on how much weight to give to those data. Previous research had indeed shown profound variability in MB/MF behaviour across genotypes related to baseline dopamine function. The main purpose of the genetic analysis was to control for potential baseline differences and to explore the drug genotype interactions. However, including the serum data as a covariate in analyses, as suggested by the other reviewers, made most results relating to the genetic analysis disappear, even when using less conservative priors that likely understate the variance of posterior distributions of group effects. We have therefore opted to keep coverage of the genetic data to a minimum, but still report the results and make the data available online for future studies.

      I found some of the core results confusing. Most importantly, why does amisulpride make people less like to stay after a reward when the first-stage state is the same? When first-stage states repeat, both an MB agent and an MF agent will be more likely to stay after a reward. To me, this kind of behavior doesn't seem particularly model-based. Why does this behavior occur under amisulpride? I was surprised that the authors did not really address it.

      We agree that these results have been somewhat difficult to reconcile. However, adding amisulpride serum levels to our analyses now allow us to get a better understanding. It seems that across both serum groups model-based behaviour was increased, however, only in the high serum group did we additionally observe increased exploration. We also note that increased exploration was related to a reduced effect of previous points in the first same state trials, whereas the interaction term (effect of previous points in diff vs. same state trials) was more strongly associated with the model-based weight. In the manuscript this is described in the results section and in the discussion.

      The following text is included in the Results (p.6):

      “We first observed that the more model-based choices the participants made, the more money they earned (r = 0.65, 95% CI [0.53, 0.76]). This serves as a validity check of the task, which was designed to make cognitive control pay off (literally)45. We then looked at how the model parameters relate to the random slopes from the behavioural analysis of staying behaviour and found that the participant-level (random effect) slope for the effect of previous points on staying behaviour in different vs. same first state trials was most strongly related to ω (d = 0.493, P < 10e-3) and negatively related to the inverse temperature parameter η (d = -0.328, P < 10e-3), and the slope for trials with same first states was mostly related to η (d = 0.822, P < 10e-3), and less so to ω (d = 0.235, P < 10e-3).”

      The following text is included in the Discussion (p.8 §2):

      “Interestingly, amisulpride also increased choice stochasticity parametrised by the softmax inverse temperature parameter. In a paradigm with two choice options, it cannot be definitively determined whether this indicates higher decision-noise or increased exploration of alternative choices. We can however speculate that increased decision noise would lead to overall detrimental effects on learning in both trial types with same and different consecutive first stage states, which we do not observe in our data. The effect on the choice stochasticity parameter was only present in participants with a higher effective dose75, suggesting that the effect was more likely to be post-synaptic. Similarly, in the same effective dose group, we found some evidence that amisulpride reduces response stickiness indicating increased switching between actions. This is well in line with a prominent model of the cortico-striatal circuitry implicating post-synaptic D2 receptors in exploration/exploitation65 and supported by empirical data. In animal studies, activation of D2 receptors was shown to lead to choice perseverance and more deterministic behaviour, whereas D2 receptor inhibition increases the probability of performing competing actions and increases randomness in action selection76. In humans, a recent neurochemical imaging study showed that D2 receptor availability in the striatum correlated with choice uncertainty parameters across both reinforcement learning and active inference computational modelling frameworks77. Increased choice uncertainty was also observed in a social and non-social learning tasks in a study using 800 mg of sulpiride, a dose that is known to exert post-synaptic effects54,78. We note, however, that the evidence for the difference in exploration between the low and high serum groups was not robust (p=0.066). Furthermore, it has been suggested that increased striatal dopamine is also related to tendency for stochastic, undirected exploration79,80, arising due to overall uncertainty across available options79 or through increasing the opportunity cost of choosing the wrong option68,71. This suggests that the same biological signature that leads to increased cognitive effort expenditure also promotes choice exploration. In line with this, both prior studies that investigated the effect of increasing dopamine availability with L-DOPA on model-based/model-free behaviour observed increase choice exploration as well as increased model-based behaviour (although in one it was only present in individuals with a higher working memory capacity)55,58.”

      With regards to the design, it is unfortunate that the order of drug administration is not counterbalanced. As far as I understand, model-based control is always measured without a drug in the first session, and then with the drug (or placebo) in the second. The change between sessions is then tested for all three conditions. Of course, it is possible that the increase in model-based control in the amisulpride condition is only driven by the drug. However, given the lack of counterbalancing, it's also possible that amisulpride increases model-based control only after the experience with the task. That is, if the authors had counterbalanced the drug effect, they may have found that amisulpride had a different effect if it was administered in the first session. That would have changed their interpretation quite a bit! As it stands, they are unable to verify their (admittedly simpler) hypothesis that there is only a main effect.

      We thank the reviewer for this comment. Indeed, a full within-subject design would have been statistically more powerful and would have enabled us to exclude the possibility that amisulpride’s effect on model-based behaviour is indirect. We have now included the following paragraph in the discussion that aims to highlight the limitation of not counterbalancing the drug administration (p.10):

      “One of the strengths of our design is a baseline measure, and the fact that the participants were all introduced to the task under no administration, thus avoiding potential effects of the treatment on task training. Although this design allowed to reduce between-subjects variability, we cannot completely exclude order effects. Although unlikely, it is possible that the effects of the treatment that we observe come indirectly from the effects of the two drugs on either skill transfer from the previous session, or simply on the effect of the drugs on the part of the experiment that preceded the task. For instance, participants under amisulpride could be less tired from other tasks and therefore more willing to exert effort in the task presented here. Speaking against this is the observation that we found no differences in mood between amisulpride and placebo regardless of low or high serum levels.”

    1. the essay is like a journey, we may be more mindful of our intended audience, with whom we are bringing along as fellow travelers.

      this is an interesting way to think about it, but it kind of helps!

    1. notasclearandneatasitmightseem.Toactasthoughitwereistoinvite allkindsoftrouble.Ifwepretendthatourrolebehaviorissomehownotconnectedtowhowereallyare,forexample,thenweavoidtakingrespon­sibilitynotonlyfortherolebutalsoforourportionoftheplay

      In this passage I found this example of social life as theater very interesting and well said. The way we participate in social life is based on the action of our behavior and how we use it to interact with each other. Although our actions may differ based on the social system that we are in or the people we are interacting with, the way an individual acts still represents who they are as a person because they are choosing to act that way on their own. They are choosing to act that way because that's how they want other people to view them as. However, I do think that this concept can also be a little confusing to people. This is because in specific environments where individuals are pretending to act in a certain way because of different circumstances, may argue against that concept and say that they had a reason to act like that. Whether it was to get out of that situation or because they were uncomfortable or simply because they wanted to fit in. Ultimately, I think that it is still associated with who you are as a person.

    1. Author Response

      Reviewer #1 (Public Review):

      This study presents a series of experiments that investigate maternal control over egg size in honey bees (Apis mellifera). Honey bees are social insects in which a single reproductive female (the queen) lays all the eggs in the colony. The first set of experiments presented here explore how queens change their egg size in response to changes in colony size. Specifically, they show that queens have relatively larger eggs in smaller colonies, and that egg size changes when queens are transplanted into colonies of a different size (i.e. confirming that egg size is a plastic trait in honey bee queens). The second set of experiments investigates candidate genes involved in egg size determination. Specifically, it shows that Rho1 plays a role in determining egg size in honey bee queens.

      In principle, we agree with this summary, although we find the experimental demonstration that perceived colony size affects egg size (first set of experiments) and the overall proteomic comparison of ovaries that produce small and large eggs (second set of experiments that indicate the upregulation of metabolism, protein transport, cytoskeleton organization, and a few other processes in large egg-producing ovaries) also important.

      A strength of the study is that it combines both manipulative field (apiary) experiments and molecular studies, and therefore attempts to consider broadly the mechanisms of plasticity in egg size. The link between these two types of dataset in the manuscript, however, is not strong. While the two parts are related, the molecular experiments do not follow from the conclusions of the field experiments but rather run in parallel (both using the same initial treatments of queens from large v small colonies).

      We would welcome suggestions on how to further strengthen the integration between the field experiments and our molecular studies. We sought to explore the molecular basis of the observed plasticity in reproductive behavior and thus focused on samples from the first set of experiments for our proteome comparisons, realizing that every additional field experiment could have entail a similar molecular follow-up. We attempted to bring molecular studies and field experiments back together with the RNAi-mediated knock-down of Rho1 in queens that produce eggs in differently-sized colonies under realistic apicultural conditions. There may be better, additional opportunities for a closer integration of molecular and field experiments, but we could not conceive of them.

      Another strength of the study is the focus on social cues for egg size control in a social insect. Particularly interesting is data showing that queens suddenly exposed to the cues of a larger colony (even where egg-laying opportunities did not actually increase) will decrease their egg size, in the same way as queens genuinely transplanted to larger colonies. That honey bee queens can control their egg sizes in response to cues in the colony is not unexpected, given that queens are known to vary egg size based on the cell type they are laying into (queen, drone or worker cell). Nevertheless, it is interesting to show that worker egg sizes over time are also mediated by social cues.

      We thank the reviewer for this positive assessment and want to highlight that this experiment not only controls for egg laying opportunities, but also for potentially greater resource availability in larger colonies. These results are therefore important for the key argument that egg size is actively regulated by honey bee queens.

      A weakness of the study is that the consequence of egg size on egg development and survival in honey bees is not made clear. The assumption is that larger egg size compensates for smaller colonies in some way. Do smaller eggs (i.e. those laid in large colonies) fare worse in smaller colonies than they do in large colonies? Showing that the variation in egg size is biologically relevant to fitness is an important piece of the puzzle.

      We agree that the consequences of egg size variation are important to address beyond our previously published data set and the benefits demonstrated in other contexts by other authors. However, to comprehensively resolve the consequences requires considerable additional experiments that exceed the scope of our current study, which is primarily focused on the causes of the queens’ reproductive plasticity.

      Also, the relationship between egg number and egg size in honey bees remains rather murky. Does egg size depend at least in part on daily egg laying rate (which is sure to be greater in larger colonies)? The study makes an effort to explore this by preventing queens from laying for two weeks and then comparing their egg size when they resume to those that did not have a pause in laying. Although egg size did not vary between the groups in this case, it is unclear whether the same effect would be seen if queens had simply been restricted from laying at such high rates (e.g. if available empty brood cells had been reduced rather than removed entirely).

      We agree that the relation between egg number and egg size is complicated. We have added more data that show that egg laying rates can be higher in larger colonies than in smaller colonies. We also report now that the egg size is negatively correlated to egg number, although not in all instances, which partially supports (and partially contradicts) our previous findings (Amiri et al. 2020). We have modified the discussion of our results to account for the additional results and point out the limitation of the experiment with caged queens. It is important to realize though that the queens were caged on comb and not restricted in typical, small queen cages that are used for queen transport. It is not clear whether this treatment resulted in a downregulation of the reproductive efforts and/or the resorption of eggs.

      Overall this study makes new contributions to our understanding of maternal control over egg size in honey bees. It provides stepping stones for further investigation of the molecular basis for egg size plasticity in insects.

      We agree that we could not resolve everything in this study and that more investigations are needed.

      Reviewer #2 (Public Review):

      This paper builds on recent work showing that honeybee queens can change the size of the eggs they lay over the course of their life. Here the authors identified an environmental condition that reversibly causes queens to change their egg sizes: namely, being in a relatively small or large colony context. Recently published work demonstrated the existence of this egg size plasticity, but it was completely unknown what signaled to the queen. In a series of simple and elegant experiments they confirmed the existence of this egg size plasticity, and narrowed down the set of environmental inputs to the queen that could be responsible for signaling the change in the environment. They also began the work of identifying genes and proteins that might be involved in controlling egg size. They did a comparative proteomic analysis between small-egg-laying ovaries and large-egg-laying ovaries, and then selected one candidate gene (Rho1). They showed that it is expressed during oogenesis, and that when it is knocked down, eggs get smaller.

      This is a good summary, although we think that it is fair to add that the expression of Rho1 is specific to the egg growth stage, and that we found an almost perfect correlation of Rho1 mRNA levels and egg size in two separate experiments (in addition the difference between large and small egg-producing ovaries at the protein level).

      The experiments on honeybee colonies are well-designed, and they provide fairly strong evidence that the queens are reversibly changing egg size and that it is (at least some component of) their perception of colony size that is the signal. One minor but unavoidable weakness is that experiments on honeybees are necessarily done with small sample sizes. The authors were clear about this, however, and it was very effective that they showed all individual data points. Alongside the previous work on which this paper builds, I found their core results to be rather convincing and important.

      We thank the reviewer for this positive evaluation.

      I found the parts of the paper on oogenesis to be useful, but overall less informative in answering the questions that the authors set out for those sections. On balance, I think the best way to interpret the oogenesis results is as "suggestive and exploratory". For instance, the experiment aimed at understanding the relationship between egg-laying rate and egg size does not include a direct measurement of egg-laying rate, but instead puts queens in a place with no suitable oviposition sites. The proteomic analysis was fine, but since they were using whole ovaries, with tissue pooled across all stages of oogenesis including mature oocytes, I would be cautious in interpreting the results to mean that they had identified proteins involved in making larger eggs. These proteins might just as easily be the proteins that are put into larger eggs. In fact, for the one candidate gene that is examined, its transcripts seem as though they are predominantly in the oocyte cell itself rather than in the supporting cells that actually control the egg size (although it is hard to tell from the micrographs without a label for cell interfaces).

      We have added data on the number of eggs produced in the first experiment, which actually show a negative correlation between egg size and egg number. In addition, we have cautioned our wording about the conclusions that can be drawn from the oviposition restriction experiment. Concerning the expression and role of Rho1, we apologize for the lack of a cell membrane marker. However, we share the reviewer’s interpretation that the mRNA is located in the oocyte. While we also agree that egg loading from the nurse cells is important, transport of vitellogenin from the follicle cells may also be quite significant for egg size (Wu et al. 2021 – doi:10.3389/fcell.2020.593613 and Fleig 1995 - doi:10.1016/0020-7322(95)98841-Z), a process that could be controlled by Rho1 in the documented location. We have added to the discussion to clarify this point.

      On that note, with the caveat that the sample sizes are quite small, I agree that there is some evidence that Rho1 is involved in honeybee oogenesis. If this was the only gene they knocked down, and given that it results in a small size change with such a small sample size, it strikes me as a bit of a stretch to say that these results are evidence that Rho1 plays an important role in egg size determination. It is essential to know if this is a generic result of inhibiting cytoskeletal function or a specific function of Rho1. That is beyond the scope of this study, but until those experiments are done, it is hard to know how to interpret these results. For context, in Drosophila, there are lots and lots of genes such that if you knock them down, you get a smaller or differently shaped egg, including genes involved in planar polarity, cytoskeleton, basement membrane, protrusion/motility, septate junctions, intercellular signaling and their signal transduction components, muscle functions, insect hormones, vitellogenesis, etc. This is helpful, perhaps, for thinking about how to interpret the knockdown of just one gene.

      We thank the reviewer for this perspective and have consequently cautioned our wording. The role of Rho1 in regulating the cytoskeletal function has been established in other organisms, but we do not have the tools to study the corresponding pathways and establish causality in honey bees. We have added to the discussion to alert the reader to the point that additional experiments are necessary.

      Overall, I found the results to be technically sound, and there are several clever manipulations on honeybee colonies that will doubtless be repeated and elaborated in the future to great effect. The core result-that queens can change the size of their eggs quickly and reversibly, in response to some perceived signal-was honestly pretty astonishing to me, and it reveals that there are non-nutritive plastic mechanisms in insect oogenesis that we had no idea existed. I look forward to follow-up studies with interest.

      We thank the reviewer for the overall evaluation and encouragement to continue our research.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Point-by-point description of the revisions

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      Summary: The authors compared the various multinucleated cells, osteoclasts, LCG and FBGC. Overall, the manuscript shows rigor in the analyses, and also very interesting approaches for retrieving mononuclear cells, for instance using DC-STAMP siRNA. This work adds very much to understanding the biological differences, as summarized in figure 6h. A lot of work in osteoclast field with for instance qPCR is hampered because, inevitably, a mix of mononuclear and multinucleated cells is always measured. Here, a solid attempt to separate those mixes with cell sorting and subsequent analysis on the mononuclear and multinucleated isolates, really adds. Choice of figures is good, also the extra info of the supplementary figures is relevant and makes it easy to read.

      Major and minor concerns:

      1. For osteoclasts, various markers exist for their biological characterization, for instance the ability to resorb bone. What, apart from the arrangement and number of nuclei, were the biological parameters that confirmed that the cells made by addition of IFN or IL-4 were LCG and FBGC? [Authors’ reply]. In order to address this point, we focused on gene sets that characterize LCGs and FBGCs. By doing so, we aimed to identify (i) lineage dependent factors and (ii) markers of LGCs and FBGCs. (See new Supplementary Figure 1B and C, New Supplementary Table 1 and highlighted text in Results). As expected, and in line with the lineage-determining factors, the transcriptomics comparison between mononucleated/multinucleated IFN-γ and IL-4-differentiated macrophages showed predominance of IFN-γ and IL-4-related pathways, respectively (Supplementary Figure 1B and C and Supplementary Table 1). Among known LGC and FBGC markers, we confirmed up-regulation of CCL7 [1] and CD86 [2], respectively.* As per the biological parameters, we indeed confirm that FBGCs show enhanced phagocytosis properties (Figure 5C) while LGCs can form granuloma-like clusters in vitro (Figure 4D and E). Altogether, we characterize LGCs and FBGCs with (i) polykaryon-specific nuclear arrangement, (ii) polykaryon-specific gene expression markers, (iii) previously shown and new phenotypic characteristics such as LGCs’ unique ability to form in vitro clusters containing CD3+ cells. *

      In fig 2c: did the authors perform stainings with isotype control antibodies? In my experience, quite often, antibodies stain mononuclear cells much intenser, since the cytoplasm is much more condense, less spread over a large area.

      [Authors’ reply]. According to the reviewer’s suggestion, we provide isotype control staining for MRC1 in IFN-g-stimulated mononucleated/multinucleated cells by ImageStream (left panel) and immunofluorescence in LGCs, FBGCs and osteoclasts (right panel). There was negligible staining with the isotype control antibody for MRC1 in both settings (Figure provided to the journal).

      *We did not observe a potential artefact of staining in multinucleated cells when compared to mononuclear cells. In fact, some markers of multinucleation such as B7-H3 is augmented in LGCs (Figure 4E). *

      Resorption assay in 6 is not clear. It is weird that osteoclasts apparently display so limited resorption? Also the traces are not typical for osteoclasts. Please explain.

      [Authors’ reply]. Human osteoclasts are cultured for 2 days on hydroxyapatite-coated plates and the amount of resorption is dependent on the healthy donor the peripheral blood is derived from. In addition to genetic variability, the support (hydroxyapatite) is different from dentine, which is also widely used for measuring osteoclast resorptive activity. The visualization of the human osteoclast resorption is made by transparency (area not coated by hydroxyapatite due to its resorption) on image J.

      Provide a better image Supplementary 2A, even at 250% the lettering is vague. What do the colours in 2A mean?

      [Authors’ reply]. *According to the reviewer’s suggestion, we now provide the Supplementary Figure 2A with better resolution. In STRING protein-protein interaction analysis, there is no particular meaning of the node color itself. *

      CROSS-CONSULTATION COMMENTS

      I have read the comments of the other two reviewers, and together. I absolutely agree with their additions, Indeed, supplementary tables are lacking, as well as there could be a bit more emphasis on the fact that it is all in vitro work. Together, I think the three of us are complementary in our comments, with good overlap as well. Any effort to stain for instance pathology material with the markers that have been found, would be great, especially for the LGC and the FBGC, that are much less studied in the field of MNGs. Having said that ,I can also live without this addition, but then it could be highlighted in the discussion that these are the future avenues that should be considered. Collaborate with Pathology!

      [Authors’ reply]. We appreciate that the reviewer provides cross-consultation comments which we address in our revised manuscript. As such, we discuss future avenues regarding the translatability of these results to human pathology involving MGCs.

      Reviewer #1 (Significance (Required)):

      This manuscript is particularly interesting to those who are interested in the BIOLOGY of MNCs. In essence, three types of MNCs were cultured and compared, with each of them a specific function.

      I am an osteoclast expert (76 publications), and have two publications on FBGCs

      [Authors’ reply]. *We sincerely thank the reviewer for his/her pertinent comments, enthusiasm for our findings and for providing us an overall summary of our findings in view of all other reviewer comments. *

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      Summary:

      In this manuscript, the authors performed a comparative transcriptome analysis of mononuclear and multinuclear human osteoclasts, LGCs and FBGCs. They found that multinucleation triggers a significant downregulation of macrophage identity in all three types of MGCs. Furthermore, RNA-seq data and in-vitro functional analysis of multinucleated cells showed that macrophage cell-cell fusion and multinucleation enhance phagocytosis and contribute to lysosome-dependent intracellular iron homeostasis. Furthermore, multinucleation of osteoclasts promoted mitochondrial activity and oxidative phosphorylation, resulting in maximal respiration. This unique and interesting study addresses the fundamental question of how cell-cell fusion and multinucleation contribute to cellular activity and biological homeostasis.

      Major comments

      1 The authors generated mature multinucleated cells by stimulating human PBMC-derived macrophages with either IFN-g, IL-4, or RANKL. However, no quantitative data have been presented to determine how effectively IL-4, IFN-g, and RANKL can induce multinucleated giant cells from mononuclear macrophages. Quantitative data showing induction efficiency would provide a more detailed picture of the overall experiment.

      [Authors’ reply]. According to the reviewer’s suggestion, quantitative data showing the efficiency of these cytokines to induce multinucleation (i.e. fusion index) is now provided as part of the revised Supplementary Figure 1A (right panel).

      2 The authors mentioned, "The distinct morphological appearance of these three types of MGCs (Figure 1B) suggested cell type-specific functional properties and shared mechanisms underlying macrophage multinucleation". However, there is no discussion or data showing how the nuclear arrangement and intracellular location affect the biological function of multinucleated cells.

      [Authors’ reply]. This is good point and is now discussed in the revised manuscript (see highlighted text in revised manuscript and below).

      Whether MGC-specific nuclear arrangements and/or numbers are indicative of specialized function is currently unclear. Intracellular nuclei arrangement is likely to be important for the sealing zone formation in a polarized bone-resorbing osteoclast. Furthermore, whether distinct transcriptional activities are assigned to different nuclei of the MGC also remain to be tested. Recent elegant work performed in multinucleated skeletal myofibers suggest transcriptional heterogeneity among the different nuclei of the polykaryon [3].

      3 Based on the results of DC-stamp knockdown experiments, the authors concluded that cell-cell fusion and multinucleation suppress the mononuclear phagocytic gene signature. However, to strengthen this hypothesis, it would be necessary to provide at least data showing that DC-stamp knockdown reduces the number of multinucleated cells.

      [Authors’ reply]. According to the reviewer’s suggestion, we provide data showing that DCSTAMP knockdown reduces multinucleation in LGCs and FBGCs (see below and new Supplementary Figure 2F). For human osteoclasts, the data was included in our previously published paper ([4] and figure provided to the journal).

      4 In Figure4, the authors showed that transcripts in LGCs were enriched for antigen presentation and adaptive immune system pathways. In addition, multinucleation of LGCs increased the surface expression of B7-H3 (CD276) and colocalized with CD3+ cells, suggesting that LGC multinucleation potentiates T cell activation. However, the authors did not present enough data to demonstrate the antigen-presenting ability of LGCs or their specific T cell activating capacity.

      [Authors’ reply]. We agree with the reviewer that our data on a potential role of LGCs’ on T cell activation is based on increased surface expression of B7-H3 and the unique CD3+ cluster forming ability of LGCs. In order to check for further markers of antigen presentation, we have performed MHC-1 and MHC-2 quantification by ImageStream in 3 types of MGCs (figure provided to the journal).

      Although there was no difference in MHC-I/MHC-2 between the mononucleated and multinucleated macrophages, the mean fluorescent intensity (MFI) range was the highest in IFN-g-stimulated macrophages, suggesting that LGCs may be better equipped for antigen presentation than the other 2 types of MGCs. A more comprehensive analysis of antigen presentation requires enzymatic digestion and isolation and phenotyping of LGCs from clusters in vitro and human tissues in vivo. This is a program of research that we have initiated as part of a separate study, which will focus on the in vivo relevance of the current findings such as the unique Ag presentation ability of LGCs in a non-sterile tissue environment.

      5 Figure 6 clearly shows that mature multinucleated osteoclasts exhibit increased ATP production and maximal respiration. However, the glycolytic pathway did not differ between mononuclear and multinuclear osteoclasts. No explanation for this observation has been provided. It is easy to understand that osteoclasts acquire ATP through aerobic respiration during multinucleation. But how NADPH, which is essential for its redox reaction, is produced? Is it by acquiring αKG from the glutamine pathway?

      [Authors’ reply]. This is a point worth expending (see also discussion; highlighted text). Osteoclast multinucleation is characterized by increased mitochondrial gene expression which also translates into increased spare respiratory capacity (SRC or maximal respiration). This metabolic rewiring does not modify glycolysis and basal respiration rate. As the reviewer correctly states, increased SRC may be a way to supply more ATP to the energy-demanding polykaryon.

      As per the production of NAD(P)H as an electron source for ETC, it could indeed be through glutamine rather than glucose usage in multinucleated osteoclasts. Furthermore, as iron is an essential cofactor for ETC activity through activity of iron-sulfur clusters, the mitochondrial concentration of iron is likely to be critical for the mitochondrial activity of multinucleated osteoclasts (see also discussion).

      Minor comments:

      6 Supplementary tables 1-6 were not provided.

      [Authors’ reply]. We apologize for this. The revised versions of supplementary tables are provided as part of the revised manuscript.

      7 Figure 2D right panel, difficult to see DAPI+ nuclei.

      [Authors’ reply]. Thanks for pointing this out. We have now replaced Figure 2D with a more pronounced DAPI+ nuclei.

      Reviewer #2 (Significance (Required)):

      Although it is well known that multinucleation of cells constantly occurs, especially in osteoclasts, skeletal muscle, and trophoblasts of the placenta, the biological significance of multinucleation and the intracellular functions of multinucleation are not well understood. In this unique study, three types of multinucleated cells were generated from human peripheral blood to elucidate the genetic and functional differences between mononucleated and multinucleated cells. Furthermore, by demonstrating the possibility that the morphological peculiarity of multinucleation can regulate cell function, this paper provides clues to understanding the underlying biology of multinucleated cells and how they maintain cell function in homeostatic and pathological settings.

      [Authors’ reply]. We thank the reviewer for finding our study unique and biologically meaningful. We also thank the reviewer for all the suggestions that improved significantly the overall message of the manuscript.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      Summary:

      The manuscript of Ahmadzadeh and Pereira et al is an interesting study of the fusion process key to the formation of multinucleated giant cells (MGCs). Our current ability to discriminate between different types of MGCs is limited, and there are gaps in our understanding of the molecular determinants of cell fusion. In this study, the authors isolated different MGC variants - osteoclasts, Langhans giant cells (LGCs) and foreign body giant cells (FBGCs) and identified common, as well as MGC-specific genes and pathways involved in the process of cell fusion. The approach of isolating and comparing different types of MGCs is novel, and the manuscript is well presented and written. However, due to the in vitro nature of the study, the physiological significance of the findings is unclear. I have further minor and major points for the authors to address, as detailed below.

      Minor comments:

      1. The approach to isolate the different MGCs using FACS and imaging technique is highly novel. However the difference between MGC subtypes isolated isn't immediately apparent beyond the morphological comparisons. In my opinion some of the results of MGC-specific assays from Figures 4, 5 and 6 can be included in Figure 1, e.g. TRAP staining and hydroxyapatite resorption for osteoclasts, to provide evidence of purity and specificity of each MGC subtype early on in the manuscript. Classical or canonical genes associated with each MGC subtype can also be highlighted in the volcano plots in Figure 1C, e.g ACP5, CTSK, TNFRSF11A for osteoclasts. [Authors’ reply]. We thank the reviewer for this point and we agree it is important to highlight markers for each polykaryon early in the manuscript. In accordance with this reviewers’ comment (and also with Reviewer 1’s point), we first verified existence of lineage-dependent factors and markers of LGCs and FBGCs as these cells are relatively less well-defined compared to osteoclasts. (New Supplementary Figure 1B and C and New Supplementary Table 1). As expected, and in line with the lineage-determining effects, the transcriptomics comparison between mononucleated/multinucleated IFN-γ and IL-4-differentiated macrophages showed predominance of IFN-γ and IL-4-related pathways, respectively (New Supplementary Figure 1B and C and New Supplementary Table 1). Among known LGC and FBGC markers, we confirmed up-regulation of CCL7 [1] and CD86 [2], respectively (New Supplementary Table 1). We have added this information in the revised manuscript (see highlighted text). Osteoclast phenotyping is provided by TRAP staining and resorption assay (Figure 6C) and we also confirm that CTSK is indeed significantly up-regulated upon multinucleation (LogFc=1.69; P=9.2 x 10E-6; highlighted in the revised manuscript).

      The overall decrease in phagocytic identity of all the MGCs, and the specific upregulation of phagocytic pathways in the FBGCs are conflicting. Are there subsets of phagocytic pathways that were down and upregulated during the formation of FBGCs?

      [Authors’ reply]. This is a very good point. As the reviewer indicates, the results suggest that subsets of phagocytic pathways are changed upon multinucleation. All three types of MGCs show a downregulation of transcripts that belong to Fc receptors and complement C1Q family. However only FBGCs show an up-regulation of S. Aureus bioparticle-mediated phagocytosis. Hence the exact surface receptors responsible for this pathogen clearance remain to be identified. FBGC phagocytosis is a complex process including non-canonical phagocytosis pathways and participation of increased membrane area and endoplasmic reticulum [5, 6]*. Whether these pathways are specifically induced in human FBGCs remain to be identified. We now discuss this point in the revised manuscript (see highlighted text in Discussion). *

      What are the identities of the mononuclear cells in each of the MGC experiment? They appeared to be quite heterogeneous based on the DEGs identified, beyond the common phagocyte signature. Can the authors comment on the difference between the mononuclear cells and whether this will affect the DEG analysis?

      [Authors’ reply]. This is also a very relevant point that we now address in the revised manuscript (New Supplementary Figure 1B and C; New Supplementary Table 1 and highlighted revised text in Results). The reviewer is correct that MGC-specific pathways are in line with the known function of each polykaryon (Figure 4A, 5A and 6A). To what extent lineage-dependent effects (e.g. IFN-g and IL-4) are conserved between the mononucleated and multinucleated state is yet to be determined. In order to address this point, we compared DEG in IFN-g and IL-4-differentiated mononucleated macrophages to the ones obtained in multinucleated macrophages (New Supplementary Figure 1B and C; New Supplementary Table 1). The results showed that the multinucleated cell state preserves the majority of the lineage-dependent pathways which are very significantly represented at the mononucleated cell state (e.g. IFN-g and IL-4-related pathways). Interestingly, although less significant, this analysis also showed pathways that were specific to the mononucleated or multinucleated state in IFN-γ-differentiated macrophages when compared to IL-4-differentiated ones and vice versa. (Supplementary Figure 1B and C). For instance, TRAF3-dependent IRF activation pathway is specific to mononucleated IFN-g-differentiated macrophages (Supplementary Figure 1B).

      The authors should also frame/discuss the findings in the context of diagnostic and therapeutic potentials to highlight the clinical significance of this study.

      [Authors’ reply]. We thank the reviewer for this point and we now discuss our results from a clinical/diagnostic perspective (see highlighted text in the Discussion and below).

      From a clinical perspective, since lysosome-regulated intracellular iron homeostasis appears to be a general condition for macrophage multinucleation across different tissues, its blockade may hold therapeutic potential. However, it is still unclear whether granulomatous disease can benefit from targeting LGC fusion. For non-granulomatous inflammatory diseases, inhibiting MGC formation by targeting lysosomes may be a therapeutic avenue. This approach would avoid FBGC-related adverse effects during foreign body reaction or inhibit the formation of MGCs of white adipose tissue during obesity. v-ATPase inhibitors have been previously proposed to inhibit osteoclast activity and bone resorption [7]* so their selective targeting in the lysosomal compartment may be generalized to other MGCs such as FBGCs. In addition to potential clinical translation, the results presented in this study require confirmation in tissues originating from human pathology involving MGCs. *

      Major comments:

      • As mentioned before, the physiological significance of the findings is unclear. Some form of in vivo data is needed to support some of the key conclusions of the study, e.g validating some of the markers of the pathways identified (common and MGC subtype-specific), and the role of lysosome-mediated iron homeostasis in multinucleation. The authors can make use of the FACs and imaging approaches they developed to look at MGCs in relevant tissues. [Authors’ reply]. This is an important point that we would like to explore in a comprehensive way. We have initiated a 2-year program to undertake a Multiplexed Immunohistochemistry (mIHC) using MILAN (Multiple Iterative Labeling by Antibody Neodeposition) https://www.lpcm.be/multiplex-ihc-milan approach in human biopsies using >100 antibodies. The current study is pivotal in selecting the gene targets (i.e. common and MGC-specific markers) for prioritization. We foresee to gain critical pathophysiological information about the tissue characteristics of MGCs. The reviewer would acknowledge that these high-throughput and biopsy-based initiatives are lengthy and not the primary scope of our current findings which set the foundation of major cellular events governing multinucleation in macrophages.

      Reviewer #3 (Significance (Required)):

      Significance:

      • The approach of isolating and comparing different types of MGCs is novel, and the findings certainly improved our understanding of the fusion processes of MGCs. However, the physiological role of these processes in health and disease that involve MGCs is still unclear due to the lack of in vivo data. The findings were discussed in quite a bit of detail in the context of current literature, though clinical impact was not explored. [Authors’ reply]. *We are grateful to Reviewer 3 for raising relevant and constructive points regarding the main findings. His/her review significantly improved the clarity of the overall manuscript. *

      We recognize our study lacks human clinical association, but we highlight the prospective translatability of our findings and the usage of donor-based human macrophages throughout the manuscript. As also recommended by Reviewer 1 in his/her cross-consultation, we discuss the potential clinical impact of our findings in the Discussion of our revised manuscript.

      • My background is bone biology with a very keen interest in osteoclast biology so arguably my knowledge on other MGCs eg LGCs and FBGCs is limited. References

      • Chen Y, Jiang H, Xiong J, Shang J, Chen Z, Wu A, Wang H: Insight into the Molecular Characteristics of Langhans Giant Cell by Combination of Laser Capture Microdissection and RNA Sequencing. J Inflamm Res 2022, 15:621-634.

      • McNally AK, Anderson JM: Foreign body-type multinucleated giant cells induced by interleukin-4 express select lymphocyte co-stimulatory molecules and are phenotypically distinct from osteoclasts and dendritic cells. Exp Mol Pathol 2011, 91(3):673-681.
      • Petrany MJ, Swoboda CO, Sun C, Chetal K, Chen X, Weirauch MT, Salomonis N, Millay DP: Single-nucleus RNA-seq identifies transcriptional heterogeneity in multinucleated skeletal myofibers. Nat Commun 2020, 11(1):6374.
      • Pereira M, Ko JH, Logan J, Protheroe H, Kim KB, Tan ALM, Croucher PI, Park KS, Rotival M, Petretto E et al: A trans-eQTL network regulates osteoclast multinucleation and bone mass. Elife 2020, 9.
      • McNally AK, Anderson JM: Multinucleated giant cell formation exhibits features of phagocytosis with participation of the endoplasmic reticulum. Exp Mol Pathol 2005, 79(2):126-135.
      • Milde R, Ritter J, Tennent GA, Loesch A, Martinez FO, Gordon S, Pepys MB, Verschoor A, Helming L: Multinucleated Giant Cells Are Specialized for Complement-Mediated Phagocytosis and Large Target Destruction. Cell Rep 2015, 13(9):1937-1948.
      • Qin A, Cheng TS, Pavlos NJ, Lin Z, Dai KR, Zheng MH: V-ATPases in osteoclasts: structure, function and potential inhibitors of bone resorption. Int J Biochem Cell Biol 2012, 44(9):1422-1435.
    1. Author Response

      Reviewer #1 (Public Review):

      In this manuscript the authors describe an approach for controlling cellular membrane potential using engineered gene circuits via ion channel expression. Specifically, the authors use microfluidics to track S. cerevisiae gene expression and plasma membrane potential (PMP) in single cells over time. They first establish a small engineered gene circuit capable of producing excitable gene expression dynamics through the combination of positive and negative feedback, tracking expression using GFP (Figure 1). Though not especially novel or complex, the data quality is high in Figure 1 and the results are convincing. Note that the circuit is excitable and not oscillatory; it is being driven periodically by a chemical inducer. I think the authors could have done a better job justifying the use of an excitable engineered gene circuit system, since you could get a similar result by just driving a promoter with the equivalent time course of inducer.

      We restructured the manuscript by presenting the open-loop version of our synthetic circuit and demonstrate that closed loop system integrating feedbacks performs significantly better than its open-loop version (revised Figures 1 and 3). This open-loop system is based on Mar proteins that can synchronizes gene expression on extended spatiotemporal scales (PerezGarcia et al., Nat Comm, 2021). Other driven systems (i.e., TetR, AraC, LacI) can temporally synchronize gene expression in single bacteria cells to successive cycles of inducer. However, over time these bacterial systems build substantial delays in phases between cells, partially due to noise that ultimately led to desynchrony between individual cells even though they tend to follow the common inducer. This is clearly not the case in Mar-based systems (Perez-Garcia et al., Nat Comm, 2021) as eukaryotic cells synchronize to each other under guidance of common environmental stimuli with neglectable phase drift. Furthermore, in revised version we show that dual feedback strategy provides a robust solution to control ion channel expression and associated changes in PMP (see Conclusions lines 231-237).

      The authors then use a similar approach to produce excitable expression of the bacterial ion channel KcsA, tracking membrane voltage using the voltage-sensitive dye ThT rather than GFP fluorescence (Figure 2). The experimental results in this figure are more novel as the authors are now using the expression of a heterologous ion channel to dynamically control plasma membrane potential. While fairly convincing, I think there are a few experimental controls that would make these results even more convincing. It is also unclear why the authors are now using power spectra to display observed frequencies compared to the much more intuitive histograms used in Figure 1.

      Now we use violin plots with period distributions consistently in all figures to ease the comparison between scenarios.

      Finally, the authors move on to use a similar excitable engineered gene circuit approach to produce inducible control of the K1 toxin which influences the native potassium channel TOK1 rather than the heterologous ion channel KcsA (Figure 3). I have a similar reaction to this figure as with Figure 2: the results are novel and interesting but would benefit from more experimental controls. Additionally, the image data shown in Figure 3b is very unclear and could be expanded and improved.

      In revised version we have decided to remove K1 toxin data as we are aware that we cannot modulate K1 degradation rate due to its extracellular nature. Instead, we have decided to perform additional experiments in which we directly plugged our circuit to TOK1 native potassium channel to demonstrate that our feedback-integrating synthetic circuit is capable of controlling TOK1 dosage and associated PMP changes (revised Figure 3, and lines 209-220). We believe these new data make more direct connection between synthetic circuits phytohormones and native channel expression than presented earlier K1-based scenario.

      Overall, in my opinion the claims in the abstract and title are a bit strong. I would deemphasize global coordination and "synchronous electrical signaling" since the authors are driving a global inducer. To make the claim of synchronous signaling I would want to see spatial data for cells near vs. far from K1 toxin producing cells in Figure 3 along with estimates of inducer/flow timescale vs. expression/diffusion of K1 toxin. As I read the manuscript, I see that most of the synchronicity comes from the fact that all cells are experiencing a global inducer concentration.

      We agree with the Reviewer, synchronicity and global coordination comes from phytohormone sensing feedback circuit that is guided by cyclic environmental changes. We have revised definition of synchronous signaling as suggested, focusing on the macroscopic synchronization of ion channel expression achieved by external modulation, which is the key message coming from this work.

      Reviewer #2 (Public Review):

      The authors present a novel method to induce electrical signaling through an artificial chemical circuit in yeast which is an unconventional approach that could enable extremely interesting, future experiments. I appreciate that the authors created a computer model that mathematically predicts how the relationship between their two chemical stimulants interact with their two chosen receptors, IacR/MarR, could produce such effects. Their experimental validations clearly demonstrated control over phase that is directly related to the chemical stimulation. In addition, in the three scenarios in which they tested their circuit showed clear promise as the phase difference between spatially distant yeast communities was ~10%. Interestingly, indirect TOK1 expression through K1 toxin gives a nice example of inter-strain coupling, although the synchronization was weaker than in the other cases. Overall, the method is sound as a way to chemically stimulate yeast cultures to produce synchronous electrical activity. However, it is important to point out that this synchronicity is not produced by colony-colony communication (i.e., self-organized), but by a global chemical drive of the constructed gene-expression circuit.

      The greatest limitation of the study lies in the presentation (not the science). There are two significant examples of this. First, the authors state this study 'provides a robust synthetic transcriptional toolbox' towards chemo-electrical coupling. In order to be a toolbox, more effort needs to be put into helping others use this approach. However little detail is given about methodological choices, circuit mechanisms in relation to the rest of the cell, nor how this method would be used outside of the demonstrated use case. Second, the authors stress that this method is 'non-invasive', but I fail to see how the presented methodology could be considered non-invasive, in in-vivo applications, as gene circuits are edited and a reliable way to chemically stimulate a large population of cells would be needed. It may be that I misunderstood their claim as the presentation of method and data were not done in a way that led to easy comprehension, but this needs to be addressed specifically and described.

      We apologize Reviewer for a potential misunderstanding. By ‘non-invasive’ we meant that such systems would not need, for instance, the surgical installation of light components to control ion channel activity. Nonetheless, we have removed these confusing sentences from the revised manuscript.

      The rational for using Mar-based system with feedback strategy data has been now presented in more structured and comprehensive way across the revised manuscript to demonstrate benefits from integrating feedback as well as potential of such systems for excitable dynamics with noise-filtering capability and faster responsiveness. We also show how system can be coupled to native potassium channels, opening ways to integrate synthetic circuit into other organisms.

      In terms of classifying the synchronicity, while phase difference among communities was the key indicator of synchronization, there were little data exploring other aspects of coupled waveforms, nor a discussion into potential drawbacks. For example, phase may be aligned while other properties such as amplitude and typical wave-shape measures may differ. As this is presented as a method meant for adoption in other labs, a more rigorous analytical approach was expected.

      In the revised manuscript, we have analyzed synchronicity using several different approaches:

      (1) we calculate cumulative autocorrelations of response between communities.

      (2) to complement autocorrelation analysis, we developed a quantitative metric of ‘synchrony index’ defined as 1 - R where R is the ratio of differences in subsequent ThT peak positions amongst cell communities (phase) to expected period. This metrics describes how well synchronized are fungi colonies with each other under guidance of the common environmental signal.

      (3) we analyzed amplitudes and peak widths for all presented scenarios and we conclude that while periods and peak widths are robust across communities there is noticeable variation in amplitudes (i.e. Figure 3E).

      We therefore believe that this multistep quantitative approach is rigorous in identifying oscillatory signal characteristics.

      Reviewer #3 (Public Review):

      We are enthusiastic about this paper. It demonstrates controlled expression of ion channels, which itself is impressive. Going a step further, the authors show that through their control over ion channel expression, they can dynamically manipulate membrane potential in yeast. This chemical to electrophysiological conversion opens up new opportunities for synthetic biology, for example development of synthetic signaling systems or biological electrochemical interfaces. We believe that control of ion channel expression and hence membrane potential through external stimuli can be emphasized more strongly in the report. The experimental time-lapse data were also high quality. We have two major critiques on the paper, which we will discuss below.

      First, we do not believe the analyses used supports the authors' claims that chemical or electrical signals are propagating from cell-to-cell. The text makes this claim indirectly and directly. For example, in lines 139-141, the authors describe the observed membrane potential dynamics as "indicative of the effective communication of electrical messages within the populations". There are similar remarks in lines 144 and 154-156. The claim of electrical communication is further established by Figure 2 supplement 3, which is a spatial signal propagation model. As far as we can tell, this model describes a system different from the one implemented in the paper.

      Second, it is not clear why the excitable dynamics of the circuit are so important or if the circuit constructed does in fact exhibit excitable dynamics. Certainly, the mathematical model has excitable dynamics. However, not enough data demonstrates that the biological implementation is in an excitable regime. For example, where in the parameter space of Figure 1 supplement 1 does the biological circuit lie? If the circuit has excitable dynamics, then the authors should observe something like Figure 1 supplement 1B in response to a nonoscillating input. Do they observe that? Do they observe a refractory period? Even if the circuit as constructed is not excitable, we don't think that's a major problem because it is not central to what we believe is the most important part of this work - controlling ion channel expression and hence membrane potential with external chemical stimuli.

      We thank Reviewer for encouraging comments and positive evaluation of our work.

    1. So that it is not because God is unmindful of their wickedness,

      This statement can be viewed as a warning to all people because it is telling us that no matter what we do or how well we think we are hiding something, even though no one on earth may know, God knows everything and every sin we have committed.

    1. Posted byu/raphaelmustermann9 hours agoSeparate private information from the outline of academic disciplines? .t3_xi63kb._2FCtq-QzlfuN-SwVMUZMM3 { --postTitle-VisitedLinkColor: #9b9b9b; --postTitleLink-VisitedLinkColor: #9b9b9b; --postBodyLink-VisitedLinkColor: #989898; } How does Luhmann deal with private Zettels? Does he store them in a separate category like, 2000 private. Or does he work them out under is topics in the main box.I can´ find informations about that. Anyway, you´re not Luhmann. But any suggestions on how to deal with informations that are private, like Health, Finances ... does not feel right to store them under acadmic disziplines. But maybe it´s right and just a feeling which come´ out how we "normaly" store information.

      I would echo Bob's sentiment here and would recommend you keep that material like this in a separate section or box all together.

      If it helps to have an example, in 2006, Hawk Sugano showed off a version of a method you may be considering which broadly went under the title of Pile of Index Cards (or PoIC) which combined zettelkasten and productivity systems (in his case getting things done or GTD). I don't think he got much (any?!) useful affordances out of mixing the two. In fact, from what I can see looking at later iterations of his work and how he used it, it almost seems like he spent more time and energy later attempting to separate and rearrange them to get use out of the knowledge portions as distinct from the productivity portions.

      I've generally seen people mixing these ideas in the digital space usually to their detriment as well—a practice I call zettelkasten overreach.

    1. Constrains block our thinking and idea generation. Naturally, we consider constraints as soon as an idea germinates, so eliminating even some of these constraints can encourage creative idea generation; for example, ask participants “What if there is no gravity, how can we improve the flying experience?”

      I would say this if quite useful. What is a constraint or limitation for us is not necessarily for others. Others may possibly offer solutions, so when we remove the constraint and start to think and research along the path that we did not think about before due to the existence of the constraints, we will often have a different insight.